Comparing ea89bac176..0df485c285 - mesa

fran/mesa

Author	SHA1	Message	Date
Dylan Baker	0df485c285	docs: Add mesa 19.0.8 sha256 sums	2019-06-26 13:42:45 -07:00
Dylan Baker	1625c02d65	docs: Add docs for 19.0.8	2019-06-26 13:08:54 -07:00
Dylan Baker	1704569ea3	version: bump to 19.0.8	2019-06-26 09:43:56 -07:00
Kenneth Graunke	bde2f6398d	egl/x11: calloc dri2_surf so it's properly zeroed Commit `2282ec0a` refactored drawable creation across various platforms into a new dri2_create_drawable helper function. The GBM code in platform_drm.c code passed in dri2_surf->gbm_surf as the loaderPrivate, while most other backends passed in dri2_surf directly. To try and handle this, the patch checked if dri2_surf->gbm_surf was non-NULL, and if so, presumed that the caller is the DRM platform and we should use the dri2_surf->gbm_surf pointer. This worked for most platforms, which calloc their dri2_surf structure, zeroing the data. Unfortunately, platform_x11.c used malloc, leaving most of the dri2_surf as garbage. In particular, dri2_surf->gbm_surf was often non-NULL, causing dri2_create_drawable to try and use it, passing a garbage pointer to the createNewDrawable hook, usually leading to a SIGBUS or SIGSEGV when trying to dereference that bad pointer. Since most callers calloc the data, make platform_x11.c follow suit. Fixes crashes with i915_dri.so when running dEQP-GLES2. Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `4e3297f7d4`)	2019-06-26 09:42:49 -07:00
Dylan Baker	4c43f9c667	docs: Add SHA256 sums for 19.0.7	2019-06-24 16:21:34 -07:00
Dylan Baker	7efa08da14	Docs add 19.0.7 release notes	2019-06-24 14:56:04 -07:00
Dylan Baker	bc5ea092c9	Bump version for 19.0.7 release	2019-06-24 12:39:38 -07:00
Lionel Landwerlin	77074659c5	intel/perf: improve dynamic loading config detection We're currently trying to detect dynamic loading config support by trying to remove to test config (hard coded in the i915 driver) and checking we get ENOENT. This can fail if the test config was updated in Mesa but not yet in i915. A better way to do this is to pick an invalid ID and check for ENOENT. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `c162127440`)	2019-06-21 19:43:14 +00:00
Lionel Landwerlin	5375846e8d	intel/perf: fix EuThreadsCount value in performance equations EuThreadsCount is supposed to be the number of threads per EU, not the total number of threads in the whole device. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `1fc7b95127` ("i965: Add Gen8+ INTEL_performance_query support") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `0430c6d18a`)	2019-06-21 19:43:14 +00:00
Gert Wollny	4e3f82cde4	virgl: Assume sRGB write control for older guest kernels or virglrenderer hosts When the host virglrenderer is an older version that doesn't check the sRGB write control feature, or when the guest kernel doesn't support CAPS v2, then the guest will only report support for GL 2.1 on a GL 3.3 host, even though it was supporting 3.3 with earlier guest mesa versions. By also checking the host feature check version this regression can be avoided. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110921 Fixes: `2845939d6a` virgl: Set sRGB write control CAP based on host capabilities Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> (cherry-picked from commit `2b87753a84`)	2019-06-21 09:18:04 +00:00
Gert Wollny	8b26c9159d	virgl: Add a caps feature check version When we add new feature checks on the host side that is used to enable a cap conditionally that was enabled unconditionally before we might end up with a feature regression when a new mesa version is used with an old virglrenderer version that doesn't check for that cap. To work around this problem add a version id to the caps that corresponds to the features that are actually checked on the host and check that version too when enabling the cap. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Pohsien Wang <pwang@chromium.org> (cherry-picked from commit `a0edceb00d`)	2019-06-21 09:18:04 +00:00
Dylan Baker	a5609f46f5	cherry-ignore: add additional 19.1 only patches	2019-06-20 10:53:32 -07:00
Bas Nieuwenhuizen	359852a541	meson: Allow building radeonsi with just the android platform. Just as was allowed by autotools. Fixes: `108d257a16` "meson: build libEGL" Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `d1c04835ab`)	2019-06-20 10:43:54 -07:00
Bas Nieuwenhuizen	6f0e3e8a41	anv: Fix vulkan build in meson. Apparently the android part was never ported to meson. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `755c633b8d`)	2019-06-20 10:43:26 -07:00
Bas Nieuwenhuizen	25c3188704	radv: Fix vulkan build in meson. Apparently the android part was never ported to meson. CC: <mesa-stable@lists.freedesktop.org> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `4c300bd328`)	2019-06-20 10:43:20 -07:00
Samuel Pitoiset	41e3639b3a	radv: fix FMASK expand with SRGB formats Found while working on DCC for MSAA. Fixes: `6b976024a8` ("radv: add support for FMASK expand") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `a7f75377ab`)	2019-06-20 10:43:02 -07:00
Mathias Fröhlich	0eb21e88b0	egl: Don't add hardware device if there is no render node v2. Do not offer a hardware drm backed egl device if no render node is available. The current implementation will fail on this egl device. On top it issues a warning that is actually missleading. There are finally more error paths that can fail on the way to a hardware backed egl device. Fixing all of them would kind of require opening the drm device and see if there is a usable driver associated with the device. The taken approach avoids a full probe and fixes at least this kind of problem on kvm virtualization hosts I observe here. Fixes: `dbb4457d98` ("egl: add EGL_EXT_device_drm support") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de> (cherry picked from commit `5743a36b2b`)	2019-06-20 10:42:55 -07:00
Kenneth Graunke	fbef0f9ea1	glsl: Fix out of bounds read in shader_cache_read_program_metadata The VaryingNames array has NumVaryings entries. But BufferStride is a small array of MAX_FEEDBACK_BUFFERS (4) entries. Programs with more than 4 varyings would read out of bounds. Also, BufferStride is set based on the shader itself, which means that it's inherently already included in the hash, and doesn't need to be included again. At the point when shader_cache_read_program_metadata is called, the linker hasn't even set those fields yet. So, just drop it entirely. Fixes valgrind errors in KHR-GL45.transform_feedback.linking_errors_test. Fixes: `6d830940f7` glsl/shader_cache: Allow shader cache usage with transform feedback Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `3c10a2726b`)	2019-06-20 10:42:49 -07:00
Jason Ekstrand	1fe8661d21	anv: Set STATE_BASE_ADDRESS upper bounds on gen7 This should fix floating-point border color on all gen7 HW. Integer is still thoroughly busted on gen7 because it doesn't exist on IVB and it's crazy on HSW. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `9672b7044c`)	2019-06-20 10:42:42 -07:00
Bas Nieuwenhuizen	57767250dd	radv: Decompress DCC when the image format is not allowed for buffers. Otherwise the buffer loads/stores in the bufimage meta operations fail. If we decompress DCC then we can use the "canonical" format compatible with the not-supported format. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `4107590911`)	2019-06-20 10:42:01 -07:00
Haihao Xiang	ce97d04f38	i965: support UYVY for external import only It is similar with YUYV Fixes: `165e704719` ("i965/i915: Add UYVY as the supported format") Signed-off-by: Haihao Xiang <haihao.xiang@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `8ead5bebdb`)	2019-06-20 10:41:50 -07:00
Kevin Strasser	62daf3cde2	st/mesa: Add rgbx handling for fp formats Add missing cases for fp32 and fp16 formats. Fixes: `c68334ffc0` "st/mesa: add floating point formats in st_new_renderbuffer_fb()" Signed-off-by: Kevin Strasser <kevin.strasser@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `845ec8576a`)	2019-06-20 10:41:43 -07:00
Kevin Strasser	7903658b36	gallium/winsys/kms: Fix dumb buffer bpp The bpp in the dumb buffer creation request is hardcoded to 32, which is an incorrect assumption as the caller is free to pick any pipe format. Use the bpp supplied to us through util_format_get_blocksizebits(). Fixes: `3b176c441b` "gallium: Add a dumb drm/kms winsys backed swrast provider" Signed-off-by: Kevin Strasser <kevin.strasser@intel.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `ec0a68e50d`)	2019-06-20 10:41:38 -07:00
Samuel Pitoiset	405e4b8881	radv: fix VK_EXT_memory_budget if one heap isn't available When the visible VRAM size is equal to the VRAM size only two heaps are exposed. This fixes dEQP-VK.api.info.device.memory_budget. Cc: 19.0 19.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `d378151246`)	2019-06-20 10:41:30 -07:00
Samuel Pitoiset	1d5b8cb256	radv: fix occlusion queries on VegaM The number of render backends is 16 but the enabled mask is 0xaaaa. As noticed by Bas, allowing disabled render backends might break the OCCLUSION_QUERY packet. We don't use it yet but keep this in mind. This fixes dEQP-VK.query_pool.* and dEQP-VK.multiview.*. Cc: 19.0 19.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `2ef9d2738c`)	2019-06-20 10:33:19 -07:00
Bas Nieuwenhuizen	449b24aa06	radv: Prevent out of bound shift on 32-bit builds. uintptr_t is 32-bits then and shifting it by 32 bits results in undefined behavior IIRC. Fixes: `b3c8de1c55` "radv: save all descriptor pointers into the trace BO" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `39c71e0025`)	2019-06-20 10:32:36 -07:00
Samuel Pitoiset	c7ff7af262	radv: fix setting CB_SHADER_MASK for dual source blending CB_SHADER_MASK was computed without the second color buffer format which looks totally wrong to me. While we are at it, copy a comment from RadeonSI. Cc: 19.0 19.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `e9316fdfd4`)	2019-06-20 10:32:25 -07:00
Emil Velikov	8066847b8e	mapi: correctly handle the full offset table Earlier commit converted ES1 and ES2 to a new, much simpler, dispatch generator. At the same time, GL/glapi and the driver side are still using the old code. There is a hidden ABI between GL*.so and glapi.so, former referencing entry-points by offset in the _glapi_table. Hence earlier commit added the full table of entry-points, alongside a marker for other cases like indirect GL(X) and driver-size remapping. Yet the patches did not handle things fully, thus it was possible to get different interpretations of the dispatch table after the marker. This commit fixes that adding an indicative error message to catch future bugs. While here correct the marker (MAX_OFFSETS) comment. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110302 Fixes: `cf317bf093` ("mapi: add all _glapi_table entrypoints tostatic_data.py") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `a379b1c0ee`)	2019-06-20 10:32:11 -07:00
Samuel Pitoiset	a4f5616c60	radv: fix alpha-to-coverage when there is unused color attachments When alphaToCoverage is enabled, we should always write the alpha channel of MRT0 if it's unused. This now matches RadeonSI. This fixes the new CTS: dEQP-VK.pipeline.multisample.alpha_to_coverage_unused_attachment.samples_*.alpha_invisible Cc: 19.0 19.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl (cherry picked from commit `91aa25f462`)	2019-06-20 10:31:52 -07:00
Nanley Chery	7768f52c04	anv/cmd_buffer: Initalize the clear color struct for CNL+ On CNL+, the clear color struct is composed of RGBA channel values and fields which are either reserved by the HW or used to control fast-clears. Currently anv initializes the channel values to zero and allows the other fields to be undefined. Satisfy the MBZ field requirements by removing an optimization that doesn't hold true for CNL+ and pulling in the number of dwords to initialize from ISL. Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `b4198e792c`)	2019-06-20 10:30:57 -07:00
Charmaine Lee	ae0caaf194	svga: Remove unnecessary check for the pre flush bit for setting vertex buffers This fixes the missing rebind when the can_pre_flush bit is not set and the vertex buffers are the same as what have been sent. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Neha Bhende <bhenden@vmware.com> Signed-off-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> (cherry picked from commit `f29b8fde91`)	2019-06-20 10:04:47 -07:00
Deepak Rawat	6f37df6622	winsys/svga/drm: Fix 32-bit RPCI send message Depending on whether compiled with frame-pointer or not, the temporary memory location used for the bp parameter in these macros are referenced relative to the stack pointer or the frame pointer. Hence we can never reference that parameter when we've modified either the stack pointer or the frame pointer, because then the compiler would generate an incorrect stack reference. Fix this by pushing the temporary memory parameter on a known location on the stack before modifying the stack- and frame pointers. Also in case of failuire RPCI channel is not closed which lead to vmx running out of channels. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Deepak Rawat <drawat@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> (cherry picked from commit `72fc886826`)	2019-06-20 10:04:35 -07:00
Nataraj Deshpande	ea2e23b54f	anv: Fix check for isl_fmt in assert Checking isl_fmt returned value in assert seems appropriate instead of format variable. Fixes: `f1654fa7e3` "anv/android: support creating images from external format" Signed-off-by: Nataraj Deshpande <nataraj.deshpande@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> (cherry picked from commit `d6724471a5`)	2019-06-20 10:04:25 -07:00
Jason Ekstrand	a6f1bdc5c7	nir/propagate_invariant: Don't add NULL vars to the hash table Fixes: `8410cf66d` "nir/propagate_invariant: Skip unknown vars" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `d96878a66a`)	2019-06-20 10:04:16 -07:00
Dylan Baker	3b7363f499	docs: Add SHA256 sums for 19.0.6	2019-06-05 16:37:20 -07:00
Dylan Baker	095d22b2cb	docs: Add relnotes for 19.0.6	2019-06-05 16:32:35 -07:00
Dylan Baker	3a8337416f	VERSION: bump to 19.0.6	2019-06-05 16:30:14 -07:00
Deepak Rawat	f9ac7bb8fb	winsys/drm: Fix out of scope variable usage In this particular instance, struct member were used outside of the block where it was defined. Fix this by moving the definition outside of block. Signed-off-by: Deepak Rawat <drawat@vmware.com> Fixes: `569f838987` ("winsys/svga: Add support for new surface ioctl, multisample pattern") Reviewed-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `828e1b0b4c`)	2019-06-05 12:03:25 -07:00
Emil Velikov	0b9b177f4d	egl/dri: flesh out and use dri2_create_drawable() Wrap the loader->createNewDrawable() dance into a helper and use it throughout the codebase. This addresses a cases like surfaceless (SL) on swrast (SL on kms_swrast is fine) where we'd attempt using the wrong driver and crash out. v2: fixup quirky GBM (Mathias) v3: fixup GBM for real (Marek) Cc: mesa-stable@lists.freedesktop.org Cc: Mathias Fröhlich <Mathias.Froehlich@web.de> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de> (v1) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (v2) Signed-off-by: Marek Olšák <marek.olsak@amd.com> (v2) Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `2282ec0ad6`)	2019-06-05 12:03:25 -07:00
Marek Olšák	f678b855a3	ac: fix a typo in ac_build_wg_scan_bottom Cc: 19.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `c9b64b58de`)	2019-06-05 12:03:25 -07:00
Dylan Baker	7379a9b44f	gallium: wrap u_screen in extern "C" for c++ Some drivers (notabily SWR) are written in C++, and as such they need access to C headers with extern "C". So lets add that.	2019-06-04 13:42:33 -05:00
Alok Hota	24a5b56d69	gallium/aux: add PIPE_CAP_MAX_VARYINGS to u_screen Allows drivers using `u_pipe_screen_get_param_defaults` to use a fallback value for the new pipe cap. Default value of 8 based on GL 2.1 MAX_VARYING_FLOATS Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-04 13:42:22 -05:00
Alok Hota	bf27318801	gallium/swr: Param defaults for unhandled PIPE_CAPs Without using this function, we fail the -Wswitch flag when compiling the default debugoptimized mode in Meson Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-06-04 13:42:15 -05:00
Dylan Baker	9d7e71b6c3	cherry-ignore: add another 19.1 only patch	2019-06-04 09:15:14 -07:00
Samuel Pitoiset	871c79ed0b	radv: do not use gfx fast depth clears for layered depth/stencil images The driver should only fast depth clears with the graphics path when the view covers all image layers, otherwise this might corrupt layers when HTILE is enabled. Cc: 19.0 19.1 mesa-stable@lists.freedesktop.org Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `8a35eb0602`)	2019-06-04 09:08:51 -07:00
Sagar Ghuge	9261f14e77	intel/compiler: Fix assertions in brw_alu3 v2: Fix assertion for src1 (Ian Romanick) Fixes: `3b967e17` (intel/compiler: Avoid false positive assertions) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `3016756398`)	2019-06-04 09:08:46 -07:00
Timothy Arceri	7c40c5675d	st/glsl: make sure to propagate initialisers to driver storage This essentially reverts `20234cfe3a`. Fixes piglit test: tests/spec/arb_get_program_binary/execution/uniform-after-restore.shader_test Fixes: `20234cfe3a` "st/mesa: don't propagate uniforms when restoring from cache" Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110784 (cherry picked from commit `fea36a8f43`)	2019-06-04 09:08:40 -07:00
Marek Olšák	083159c78e	u_blitter: don't fail mipmap generation for depth formats containing stencil Bugzilla: https://bugzilla.freedesktop.org/show_bug.cgi?id=109754 Cc: 19.0 19.1 <mesa-stable@lists.freedesktop.org> Tested-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> (cherry picked from commit `4b11ed443b`)	2019-06-04 09:08:16 -07:00
Rob Clark	6bf0ccb6f5	freedreno/a6xx: fix GPU crash on small render targets Fixes dEQP-GLES2.functional.multisampled_render_to_texture.readpixels Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `8eaa2d5021`)	2019-06-04 09:06:39 -07:00
Rob Clark	214409b09e	freedreno/ir3: set more barrier bits Blob is also setting the .l bit, and it seems to solve some intermittent failures with a couple of deqp's: dEQP-GLES31.functional.image_load_store.2d.qualifiers.coherent_r32i dEQP-GLES31.functional.image_load_store.2d.qualifiers.volatile_r32f Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `f9fa456e1d`)	2019-06-04 09:06:08 -07:00
Rob Clark	8a616376ab	freedreno/ir3: dynamic UBO indexing vs 64b pointers Fixes dEQP-GLES31.functional.shaders.opaque_type_indexing.ubo.uniform_fragment and similar things with multiple UBOs Signed-off-by: Rob Clark <robdclark@gmail.com> (cherry picked from commit `6e781a01b9`)	2019-06-04 09:03:50 -07:00
Bas Nieuwenhuizen	96788ffcec	nir: Actually propagate progress in nir_opt_move_load_ubo. Found with Jasons new metadata rework (https://gitlab.freedesktop.org/mesa/mesa/merge_requests/950). Fixes: `af355aaa07` "nir: add nir_opt_move_load_ubo() optimization pass" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `e24a7840f6`)	2019-05-31 08:58:31 -07:00
Jan Zielinski	279b31bc35	swr/rast: fix 32-bit compilation on Linux Removing unused but problematic code from simdlib header to fix compilation problem on 32-bit Linux. Reviewed-by: Alok Hota <alok.hota@intel.com> (cherry picked from commit `cf673747ce`)	2019-05-31 08:57:14 -07:00
Dylan Baker	d0f09e72b4	cherry-ignore: add another 19.1 only patch	2019-05-30 10:14:25 -07:00
Dylan Baker	f6fc9f6a2c	cherry-ignore: Add a commit that was manually backported	2019-05-28 11:24:34 -07:00
Samuel Pitoiset	c65c77074f	radv: allocate more space in the CS when emitting events If the driver waits for CP DMA to be idle and emit an EOP event we need more space. This fixes a crash with Quake Champions. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `47a10edefb`)	2019-05-28 11:24:33 -07:00
Timothy Arceri	a7e8255c9b	Revert "st/mesa: expose 0 shader binary formats for compat profiles for Qt" This reverts commit `55376cb31e`. It's been over a year and both QT 5.9.5 and 5.11.0 contained a fix for the original issue. It seems i965 only ever applied this workaround to the 18.0 branch. Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `11e16ca7ce`)	2019-05-28 11:24:33 -07:00
Chenglei Ren	970770c5bd	anv/android: fix missing dependencies issue during parallel build The libmesa_anv_gen* modules require anv_extensions.h, patch makes sure it gets generated as a dependency before building them. Signed-off-by: Chenglei Ren <chenglei.ren@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Cc: <mesa-stable@lists.freedesktop.org> (cherry picked from commit `13b38ca1e4`)	2019-05-28 11:24:33 -07:00
Dave Airlie	bcd9270dbc	Revert "mesa: unreference current winsys buffers when unbinding winsys buffers" This reverts commit `12bf7cfecf`. This commits caused lots of problems: https://bugs.freedesktop.org/show_bug.cgi?id=110721 https://bugs.freedesktop.org/show_bug.cgi?id=110761 Fixes: `12bf7cfecf` ("mesa: unreference current winsys buffers when unbinding winsys buffers") Pushing without review as we need to get it into next stable. (cherry picked from commit `7fe5a8e874`)	2019-05-28 11:24:33 -07:00
Christian Gmeiner	82b79b032e	etnaviv: use the correct uniform dirty bits Found during code inspection. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> (cherry picked from commit `78fb5594be`)	2019-05-28 11:24:33 -07:00
Lionel Landwerlin	77b0da0cd3	vulkan: fix build dependency issue with generated files On machines with many cores, you can run into that issue : ../mesa-9999/src/vulkan/overlay-layer/overlay.cpp:42:10: fatal error: vk_enum_to_str.h: No such file or directory v2: Move declare_dependency around (Eric) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Jan Ziak Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Rebased by Dylan Conflicts: src/amd/vulkan/meson.build src/freedreno/vulkan/meson.build src/intel/vulkan/meson.build src/vulkan/overlay-layer/meson.build src/vulkan/wsi/meson.build	2019-05-28 11:24:33 -07:00
Philipp Zabel	c49c3a1b81	etnaviv: fill missing offset in etna_resource_get_handle Without this gbm_bo_get_offset() can return 0 where it shouldn't. Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Cc: <mesa-stable@lists.freedesktop.org> (cherry picked from commit `1ccb8a071b`)	2019-05-28 11:24:33 -07:00
Dylan Baker	7e48844421	docs: Add Sha256 sums for 19.0.5	2019-05-21 14:17:09 -07:00
Dylan Baker	9e3d9c4e13	docs: Add release notes for 19.0.5	2019-05-21 14:10:20 -07:00
Dylan Baker	bec0a67629	bump version to 19.0.5	2019-05-21 09:18:03 -07:00
Caio Marcelo de Oliveira Filho	e64fc93148	nir: Fix clone of nir_variable state slots When num_state_slots is 0, don't create the array. This was triggering the following assert when running vkcube with NIR_TEST_CLONE=1 vkcube: ../src/compiler/nir/nir_split_per_member_structs.c:66: split_variable: Assertion `var->state_slots == NULL' failed. Fixes: `9fbd390dd4` "nir: Add support for cloning shaders" Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `005cc9ae37`)	2019-05-21 09:11:22 -07:00
Charmaine Lee	1302f20ddb	mesa: unreference current winsys buffers when unbinding winsys buffers This fixes surface leak when no winsys buffers are bound. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `12bf7cfecf`)	2019-05-21 09:11:16 -07:00
Charmaine Lee	d1443da4f0	st/mesa: purge framebuffers with current context after unbinding winsys buffers With commit `c89e8470e5`, framebuffers are purged after unbinding context, but this change also introduces a heap corruption when running Rhino application on VMware svga device. Instead of purging the framebuffers after the context is unbound, this patch first ubinds the winsys buffers, then purges the framebuffers with the current context, and then finally unbinds the context. This fixes heap corruption. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `b480adfa5e`)	2019-05-21 09:11:03 -07:00
Eric Engestrom	03cb07168f	meson: expose glapi through osmesa Suggested-by: Pierre Guillou <pierre.guillou@lip6.fr> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109659 Fixes: `f121a669c7` "meson: build gallium based osmesa" Fixes: `cbbd5bb889` "meson: build classic osmesa" Cc: Brian Paul <brianp@vmware.com> Cc: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Tested-by: Chuck Atkins <chuck.atkins@kitware.com> (cherry picked from commit `ccb8ea7acf`)	2019-05-20 09:52:56 -07:00
Ian Romanick	dfe2258cc1	Revert "nir: add late opt to turn inot/b2f combos back to bcsel" This reverts commit `7acc865226`. With these optimizations in place, the extra constant folding added in the next commit extends some live ranges of 0.0 and ±1.0 constants, and that causes several hundred shaders to have more spills and fills. I believe this optimization we made basically irrelevant by `7725d60938` "intel/fs: Emit better code for b2f(inot(a)) and b2i(inot(a))". All Gen7.5+ platforms had similar results. (Ice Lake shown) total instructions in shared programs: 17225303 -> 17224634 (<.01%) instructions in affected programs: 879402 -> 878733 (-0.08%) helped: 679 HURT: 1 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.03% max: 0.93% x̄: 0.24% x̃: 0.05% HURT stats (abs) min: 10 max: 10 x̄: 10.00 x̃: 10 HURT stats (rel) min: 0.45% max: 0.45% x̄: 0.45% x̃: 0.45% 95% mean confidence interval for instructions value: -1.02 -0.95 95% mean confidence interval for instructions %-change: -0.26% -0.22% Instructions are helped. total cycles in shared programs: 360842595 -> 360828542 (<.01%) cycles in affected programs: 110443594 -> 110429541 (-0.01%) helped: 389 HURT: 265 helped stats (abs) min: 1 max: 7525 x̄: 162.81 x̃: 28 helped stats (rel) min: <.01% max: 18.66% x̄: 1.11% x̃: 0.11% HURT stats (abs) min: 1 max: 7614 x̄: 185.96 x̃: 48 HURT stats (rel) min: <.01% max: 25.08% x̄: 0.95% x̃: 0.10% 95% mean confidence interval for cycles value: -75.65 32.67 95% mean confidence interval for cycles %-change: -0.49% -0.06% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 12159 -> 12161 (0.02%) spills in affected programs: 13 -> 15 (15.38%) helped: 0 HURT: 1 total fills in shared programs: 25207 -> 25208 (<.01%) fills in affected programs: 25 -> 26 (4.00%) helped: 0 HURT: 1 Ivy Bridge total instructions in shared programs: 12082019 -> 12082013 (<.01%) instructions in affected programs: 1033 -> 1027 (-0.58%) helped: 6 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.41% max: 0.83% x̄: 0.61% x̃: 0.59% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -0.78% -0.45% Instructions are helped. total cycles in shared programs: 179849270 -> 179849157 (<.01%) cycles in affected programs: 4735 -> 4622 (-2.39%) helped: 4 HURT: 0 helped stats (abs) min: 2 max: 74 x̄: 28.25 x̃: 18 helped stats (rel) min: 0.13% max: 6.53% x̄: 2.85% x̃: 2.36% 95% mean confidence interval for cycles value: -82.73 26.23 95% mean confidence interval for cycles %-change: -7.98% 2.28% Inconclusive result (value mean confidence interval includes 0). Sandy Bridge total instructions in shared programs: 10882750 -> 10882748 (<.01%) instructions in affected programs: 266 -> 264 (-0.75%) helped: 2 HURT: 0 Iron Lake total cycles in shared programs: 188609440 -> 188609448 (<.01%) cycles in affected programs: 4320 -> 4328 (0.19%) helped: 0 HURT: 2 GM45 total cycles in shared programs: 129016868 -> 129016872 (<.01%) cycles in affected programs: 2302 -> 2306 (0.17%) helped: 0 HURT: 1 Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `d2a9ba03e3`) Conflicts resolved by Dylan Conflicts: src/compiler/nir/nir_opt_algebraic.py	2019-05-17 15:30:00 -07:00
Dylan Baker	0ed91c772d	cherry-ignore: Add more 19.1 patches	2019-05-17 15:28:12 -07:00
Gert Wollny	cab826d5a8	Revert "softpipe/buffer: load only as many components as the the buffer resource type provides" This reverts commit `865b9ddae4`. The buffer always reports format PIPE_FORMAT_R8_UNORM so with this patch only one component would be supported. The original issue is still relevant, but the fix should be different. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `0f598ed7b3`)	2019-05-17 15:24:45 -07:00
Jason Ekstrand	93d278a73a	anv: Only consider minSampleShading when sampleShadingEnable is set From the Vulkan 1.1.107 spec: Sample shading is enabled for a graphics pipeline: - If the interface of the fragment shader entry point of the graphics pipeline includes an input variable decorated with SampleId or SamplePosition. In this case minSampleShadingFactor takes the value 1.0. - Else if the sampleShadingEnable member of the VkPipelineMultisampleStateCreateInfo structure specified when creating the graphics pipeline is set to VK_TRUE. In this case minSampleShadingFactor takes the value of VkPipelineMultisampleStateCreateInfo::minSampleShading. Otherwise, sample shading is considered disabled. In other words, if sampleShadingEnable is set to VK_FALSE, we should ignore minSampleShading. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `1c92358bd8`)	2019-05-17 15:20:39 -07:00
Samuel Pitoiset	7be21f6575	radv: add a workaround for Monster Hunter World and LLVM 7&8 The load/store optimizer pass doesn't handle WaW hazards correctly and this is the root cause of the reflection issue with Monster Hunter World. AFAIK, it's the only game that are affected by this issue. This is fixed with LLVM r361008, but we need a workaround for older LLVM versions unfortunately. Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `d7501834cd`)	2019-05-17 15:20:14 -07:00
Nanley Chery	e8d9b33986	anv: Fix some depth buffer sampling cases on ICL+ Don't attempt sampling with HiZ if the sampler lacks support for it. On ICL, the HW docs state that sampling with HiZ is not supported and that instances of AUX_HIZ in the RENDER_SURFACE_STATE object will be interpreted as AUX_NONE. Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> (cherry picked from commit `629806b55b`)	2019-05-17 15:20:07 -07:00
Caio Marcelo de Oliveira Filho	fe7f221ddb	nir: Fix nir_opt_idiv_const when negatives are involved First, allow the case for negative powers of two. Then ensure that we use the absolute value of the non-constant value to calculate the quotient -- this was hinted in the code by the name 'uq'. This fixes an issue when 'd' is positive and 'n' is negative. The ishr will propagate the negative sign and we'll use nir_ineg() again, incorrectly. v2: First version used only ishr, but that isn't sufficient, since it never can produce a zero as a result. (Jason) Allow negative powers of two. (Caio) Fixes: `74492ebad9` "nir: Add a pass for lowering integer division by constants" Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `8a995f2b5e`)	2019-05-17 15:20:00 -07:00
Jason Ekstrand	0d645c98f2	intel/fs/ra: Stop adding RA interference to too many SENDS nodes We only have one node per VGRF so this was adding way too much interference. No idea how we didn't catch this before. Shader-db results on Kaby Lake: total instructions in shared programs: 15311100 -> 15311100 (0.00%) instructions in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 355468050 -> 355543197 (0.02%) cycles in affected programs: 2472492 -> 2547639 (3.04%) helped: 17 HURT: 20 Fixes: `014edff0d2` "intel/fs: Add interference between SENDS sources" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `096ad8a809`)	2019-05-17 15:19:23 -07:00
Jason Ekstrand	1621a5ab55	intel/fs/ra: Only add dest interference to sources that exist Fixes: `83dedb6354` "i965: Add src/dst interference for certain" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `88cac12230`)	2019-05-17 15:19:16 -07:00
Gert Wollny	2274d43fa6	softpipe/buffer: load only as many components as the the buffer resource type provides Otherwise we risk to read past the end of the buffer. In addition, change the loop counters to unsigned to be consistent with the types. Fixes: `afa8707ba9` softpipe: add SSBO/shader atomics support. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `865b9ddae4`)	2019-05-17 15:19:10 -07:00
Józef Kucia	5cac14f77a	radv: clear vertex bindings while resetting command buffer Only vertex inputs accessed by vertex shader must have valid buffers bound. Signed-off-by: Józef Kucia <joseph.kucia@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Fixes: `5010436e09` "radv: bail out when binding the same vertex buffers" (cherry picked from commit `24af0f1318`)	2019-05-13 11:27:45 -07:00
Marek Olšák	4f8992efac	st/mesa: fix 2 crashes in st_tgsi_lower_yuv src/mesa/state_tracker/st_tgsi_lower_yuv.c:68: void reg_dst(struct tgsi_full_dst_register , const struct tgsi_full_dst_register , unsigned int): assertion "dst->Register.WriteMask" failed The second crash was due to insufficient allocated size for TGSI instructions. Cc: 19.0 19.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Rob Clark <robdclark@gmail.com> (cherry picked from commit `83435e748f`)	2019-05-13 11:27:40 -07:00
Kenneth Graunke	aaf9a11f79	i965: Fix memory leaks in brw_upload_cs_work_groups_surface(). This was taking a reference to the 64kB upload buffer and never returning it, leaking a reference each time this atom triggered. This leaked lots of 64kB upload BOs, eventually running us out of of VMA space. This would usually happen when using mpv to watch a movie, after 20-40 minutes. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110134 Fixes: `63d7b33f51` i965/cs: Setup surface binding for gl_NumWorkGroups Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (cherry picked from commit `3f60810de0`)	2019-05-13 11:27:24 -07:00
Dylan Baker	1221311fb1	cherry-ignore: add patches for panfrost there is no panfrost in 19.0	2019-05-10 10:31:22 -07:00
Leo Liu	1ad8a0e751	winsys/amdgpu: add VCN JPEG to no user fence group There is no user fence for JPEG, the bug triggering kernel WARN_ON(flags & AMDGPU_FENCE_FLAG_64BIT) Signed-off-by: Leo Liu <leo.liu@amd.com> Acked-by: Christian König <christian.koenig@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `ceba9ff294`)	2019-05-10 10:30:02 -07:00
Lionel Landwerlin	f4ab855312	anv: Use corresponding type from the vector allocation We didn't notice this issue much because the 2 struct share a similar layout, expect for the additional fields... We run into that issue in Anv : ==15236== Invalid write of size 8 ==15236== at 0x8CF3939C: anv_state_table_expand_range (anv_allocator.c:211) ==15236== by 0x8CF394D5: anv_state_table_grow (anv_allocator.c:264) ==15236== by 0x8CF3967E: anv_state_table_add (anv_allocator.c:312) ==15236== by 0x8CF3B13C: anv_state_pool_alloc_no_vg (anv_allocator.c:1167) ==15236== by 0x8CF3B2B0: anv_state_pool_alloc (anv_allocator.c:1190) ==15236== by 0x8CF60871: alloc_surface_state (anv_image.c:1122) ==15236== by 0x8CF61FF9: anv_CreateImageView (anv_image.c:1519) ==15236== by 0x8BCBD2ED: vkCreateImageView (trampoline.c:1358) ==15236== Address 0x8994ef10 is 0 bytes after a block of size 128 alloc'd ==15236== at 0x4C2FB0F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==15236== by 0x8D2578E6: u_vector_init (u_vector.c:47) ==15236== by 0x8CF3929A: anv_state_table_init (anv_allocator.c:168) ==15236== by 0x8CF3A99A: anv_state_pool_init (anv_allocator.c:921) ==15236== by 0x8CF56517: anv_CreateDevice (anv_device.c:1909) ==15236== by 0x8BCB4FBA: terminator_CreateDevice (loader.c:6073) ==15236== by 0x8DD2CB3D: ??? (in /home/djdeath/.steam/ubuntu12_64/libVkLayer_steam_fossilize.so) ==15236== by 0x8DF4D241: vkCreateDevice (in /home/djdeath/.steam/ubuntu12_64/steamoverlayvulkanlayer.so) ==15236== by 0x8BCB35C6: loader_create_device_chain (loader.c:5449) ==15236== by 0x8BCBC230: vkCreateDevice (trampoline.c:838) v2: Rename mmap_cleanups to avoid confusion (Caio) v3: s/fail_mmap_cleanups/fail_cleanups/ (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110648 Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (cherry picked from commit `f2f6ac1c08`)	2019-05-10 10:29:53 -07:00
Dylan Baker	ab41ddb671	docs: Add SHA256 sums for mesa 19.0.4	2019-05-09 13:45:19 -07:00
Dylan Baker	a17ac460a2	Docs: add 19.0.4 release notes	2019-05-09 13:29:15 -07:00
Dylan Baker	c045d81c3d	VERSION: bump for 19.0.4 release	2019-05-09 13:18:08 -07:00
Samuel Pitoiset	b5698550dd	radv: fix setting the number of rectangles when it's dyanmic We need to know the number of rectangles. This fixes new CTS dEQP-VK.draw.discard_rectangles.dynamic_*. Fixes: `5db0bf9994` ("radv: Implement VK_EXT_discard_rectangles.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `53dfff1c4d`)	2019-05-09 10:18:08 -07:00
Dylan Baker	1281a2207c	meson: Force the use of config-tool for llvm meson git now has a cmake find method for llvm, but it lacks a couple of features that we use from the config tool version. Until that reaches parity we need to use the config-tool version. CC: 19.0 19.1 <<mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `0d59459432`)	2019-05-09 10:18:01 -07:00
Lionel Landwerlin	ba47599c79	anv: rework queries writes to ensure ordering memory writes We use a mix of MI & PIPE_CONTROL commands to write our queries' data (results & availability). Those commands' memory write order is not guaranteed with regard to their order in the command stream, unless CS stalls are inserted between them. This is problematic for 2 reasons : 1. We copy results from the device using MI commands even though the values are generated from PIPE_CONTROL, meaning we could copy unlanded values into the results and then copy the availability that is inconsistent with the values. 2. We allow the user to poll on the availability values of the query pool from the CPU. If the availability lands in memory before the values then we could return invalid values. This change does 2 things to address this problem : - We use either PIPE_CONTROL or MI commands to write both queries values and availability, so that the ordering of the memory writes guarantees that if availability is visible, results are also visible. - For the occlusion & timestamp queries we apply a CS stall before copying the results on the device, to ensure copying with MI commands see the correct values of previous PIPE_CONTROL writes of availability (required by the Vulkan spec). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Iago Toral Quiroga <itoral@igalia.com> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `a07d06f103`)	2019-05-09 00:49:12 +01:00
Timothy Arceri	0ebd8e316d	Revert "glx: Fix synthetic error generation in __glXSendError" This reverts commit `e91ee763c3`. This seems to have broken a number of wine games. Lets revert everything for now and try again later. Acked-by: Adam Jackson <ajax@redhat.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110632 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110590 (cherry picked from commit `a01b393c39`)	2019-05-08 16:40:19 -07:00
Kenneth Graunke	529981bc19	i965: leave the top 4Gb of the high heap VMA unused This ports commit `9e7b0988d6` from anv to i965. Thanks to Lionel for noticing that it was missing! Fixes: `01058a5522` i965: Add virtual memory allocator infrastructure to brw_bufmgr. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `d568fcd0a0`)	2019-05-08 16:40:19 -07:00
Kenneth Graunke	be928454f0	i965: Force VMA alignment to be a multiple of the page size. This should happen regardless, but let's be paranoid. Fixes: `01058a5522` i965: Add virtual memory allocator infrastructure to brw_bufmgr. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `17210c63a9`)	2019-05-08 16:40:19 -07:00
Kenneth Graunke	123e5c142b	i965: Fix BRW_MEMZONE_LOW_4G heap size. The STATE_BASE_ADDRESS "Size" fields can only hold 0xfffff in pages, and 0xfffff * 4096 = 4294963200, which is 1 page shy of 4GB. So we can't use the top page. Fixes: `01058a5522` i965: Add virtual memory allocator infrastructure to brw_bufmgr. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `15f134c628`)	2019-05-08 16:40:19 -07:00
Dylan Baker	f2b085df41	cherry-ignore: Add more patches The radv patch was manually backported by Samuel (thanks!), and the two android build system patches were de-nominated by their submitter, as they would require many additional patches to be pulled in	2019-05-08 16:40:19 -07:00
Samuel Pitoiset	29e6f5c11b	radv: apply the indexing workaround for atomic buffer operations on GFX9 Because the new raw/struct intrinsics are buggy with LLVM 8 (they weren't marked as source of divergence), we fallback to the old instrinsics for atomic buffer operations only. This means we need to apply the indexing workaround for GFX9. The load/store operations still use the new LLVM 8 intrinsics. The fact that we need another workaround is painful but we should be able to clean up that a bit once LLVM 7 support will be dropped. This fixes a GPU hang with AC Odyssey and some rendering problems with Nioh. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110573 Fixes: `31164cf5f7` ("ac/nir: only use the new raw/struct image atomic intrinsics with LLVM 9+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-05-08 16:40:19 -07:00
Emil Velikov	b157a98978	configure.ac: check for libdrm when using VL with X11 The X11 specific code uses libdrm, yet we are missing the dependency. This has gone unnoticed since all drivers which use VL already mandate the library. Note: this is applicable only for the stable branches. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Tested-by: Alyssa Ross <hi@alyssa.is>	2019-05-08 16:40:19 -07:00
Dylan Baker	7011e21890	cherry-ignore: Add another patch	2019-05-08 16:40:19 -07:00
Timothy Arceri	553f29b6c3	radeonsi: add config entry for Counter-Strike Global Offensive This fixes rendering issues with gun scopes which is rather important. Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100239 (cherry picked from commit `49025292fb`)	2019-05-08 16:40:19 -07:00
Erik Faye-Lund	0fedb0fdc6	draw: flush when setting stream-out targets We need to re-prepare the middle-end state to pick up changes to this state to react correctly to pausing/resuming stream-out. So let's add a flush here. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Fixes: `ec8cbd79ac` "draw/softpipe: EXT_transform_feedback support (v2)" Reviewed-by: Roland Scheidegger <sroland@vmware.com> (cherry picked from commit `d84b85bc28`)	2019-05-08 16:40:19 -07:00
Timothy Arceri	ced513b691	util/drirc: add workarounds for bugs in Doom 3: BFG This makes the game playable on radeonsi. Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110143 (cherry picked from commit `1af72fa4d6`)	2019-05-08 16:40:19 -07:00
Chuck Atkins	444e938285	meson: Fix missing glproto dependency for gallium-glx Signed-off-by: Chuck Atkins <chuck.atkins@kitware.com> Cc: mesa-stable <mesa-stable@lists.freedesktop.org> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> (cherry picked from commit `a381dbf253`)	2019-05-08 16:40:19 -07:00
Dave Airlie	ad15e3049d	util/bitset: fix bitset range mask calculations. The MASK macro is used in the RANGE macro, and it should return the pre-bitset word mask for the (b) value. i.e. BITSET_MASK(0) should be undefined since it's meaningless. BITSET_MASK(31) should give 0x7fffffff BITSET_MASK(32) should give 0xffffffff BITSET_MASK(33) should give 0x00000001 BITSET_MASK(64) should give 0xffffffff However then BITSET_RANGE ends up broken for cases where it's (b) value is the 0,32,64 value as in that case the lower mask would be 0 not 0xffffffff. This fixes the unit tests that I've added, and my code that uses bitsets. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `bb38cadb1c` "More GLSL code" Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> (cherry picked from commit `512a31a412`)	2019-05-08 16:40:19 -07:00
Samuel Pitoiset	f767301bba	radv: only need to force emit the TCS regs on Vega10 and Raven1 Other GFX9 chips aren't affected. Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `62001f3dff`)	2019-05-08 16:40:19 -07:00
Francisco Jerez	50174f01f2	intel/fs: Cap dst-aligned region stride to maximum representable hstride value. This is required in combination with the following commit, because otherwise if a source region with an extended 8+ stride is present in the instruction (which we're about to declare legal) we'll end up emitting code that attempts to write to such a region, even though strides greater than four are still illegal for the destination. Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `7f9f6263c1`)	2019-05-08 16:40:18 -07:00
Francisco Jerez	c3ba3370e6	intel/fs: Lower integer multiply correctly when destination stride equals 4. Because the "low" temporary needs to be accessed with word type and twice the original stride, attempting to preserve the alignment of the original destination can potentially lead to instructions with illegal destination stride greater than four. Because the CHV/BXT alignment restrictions are now being enforced by the regioning lowering pass run after lower_integer_multiplication(), there is no real need to preserve the original strides anymore. Note that this bug can be reproduced on stable branches, but back-porting would be non-trivial, because the fix relies on the regioning lowering pass recently introduced. Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `e2f475ddff`)	2019-05-08 16:40:18 -07:00
Bas Nieuwenhuizen	d6afb4a7c6	radv: Disable VK_EXT_descriptor_indexing. We did not implement the required non-uniform indexing features. This patch is to disable the extension on 19.0. For 19.1 the necessary functionality has been implemented. Fixes: `0e10790558` "radv: Enable VK_EXT_descriptor_indexing." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-05-08 16:40:18 -07:00
Samuel Pitoiset	f861f042b8	radv: set WD_SWITCH_ON_EOP=1 when drawing primitives from a stream output buffer According to RadeonSI, this seems to be required by the hardware to avoid GPU hangs. I think I just forgot to set that bit when I implemented VK_EXT_transform_feedback. This fixes a GPU hang with Space Engineers and DXVK. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110291 Fixes: `b4eb029062` ("radv: implement VK_EXT_transform_feedback") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `08be23bfde`)	2019-05-08 16:40:18 -07:00
Brian Paul	cbce2f47fa	svga: add SVGA_NO_LOGGING env var (v2) valgrind crashes when we try to initialize host logging. This env var can be used to disable logging. v2: rebase onto "svga: move host logging to winsys". Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Neha Bhende <bhenden@vmware.com> (cherry picked from commit `f0f7c3b03a`)	2019-05-08 16:40:18 -07:00
Charmaine Lee	646c77d718	svga: move host logging to winsys This patch adds a host_log interface to svga_winsys and moves the host logging code to the winsys layer. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com> (cherry picked from commit `9c5f407b0b`)	2019-05-08 16:40:18 -07:00
Daniel Stone	e8a0371d25	vulkan/wsi/wayland: Respect non-blocking AcquireNextImage If the client has requested that AcquireNextImage not block at all, with a timeout of 0, then don't make any non-blocking calls. This will still potentially block infinitely given a non-infinte timeout, but the fix for that is much more involved. Signed-off-by: Daniel Stone <daniels@collabora.com> Cc: mesa-stable@lists.freedesktop.org Cc: Chad Versace <chadversary@chromium.org> Cc: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108540 Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `9826e04eca`)	2019-05-08 16:40:18 -07:00
Rhys Perry	3bd0b970fc	radv: fix set_output_usage_mask() with composite and 64-bit types It previously used var->type instead of deref_instr->type and didn't handle 64-bit outputs. This fixes lots of transform feedback CTS tests involving transform feedback and geometry shaders (mostly dEQP-VK.transform_feedback.fuzz.random_geometry.*) v2: fix writemask widening when comp != 0 v3: fix 64-bit variables when comp != 0, again Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Cc: 19.0 19.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `13c423629e`)	2019-05-08 16:40:18 -07:00
Samuel Pitoiset	3618405c6a	radv: implement a workaround for VK_EXT_conditional_rendering Per the Vulkan spec 1.1.107, the predicate is a 32-bit value. Though the AMD hardware treats it as a 64-bit value which means it might fail to discard. I don't know why this extension has been drafted like that but this definitely not fit with AMD. The hardware doesn't seem to support a 32-bit value for the predicate, so we need to implement a workaround. This fixes an issue when DXVK enables conditional rendering with RADV, this also fixes the Sasha conditionalrender demo. Fixes: `e45ba51ea4` ("radv: add support for VK_EXT_conditional_rendering") Reported-by: Philip Rebohle <philip.rebohle@tu-dortmund.de> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `6ac10e07c2`)	2019-05-08 16:40:18 -07:00
Samuel Pitoiset	585c25be1e	radv: fix color conversions for normalized uint/sint formats The hardware actually rounds before conversion. This now matches what values are used when performing fast clears vs slow clears. This fixes a rendering issue with Far Cry 3&4. This also fixes a bunch of CTS tests that use a 8-bit UNORM format (only when the 512*512 image size hint is manually disabled). Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `e03e7c510f`)	2019-05-08 16:40:18 -07:00
Samuel Pitoiset	015e8974ab	radv: do not need to force emit the TCS regs on Vega20 This chip doesn't need the fixup. This fixes a bunch of dEQP-VK.tessellation tests and avoid random GPU hangs. Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `6162543999`)	2019-05-08 16:40:18 -07:00
Dave Airlie	227df9464e	r600: reset tex array override even when no view bound If no view is bound we still should reset the override to 0 and array mode. This should fix misrendering in firefox WebRender since the pbo sampler was removed. Fixes: `1250383e36` (st/mesa: remove sampler associated with buffer texture in pbo logic) (cherry picked from commit `e2fecf57e3`)	2019-05-08 16:40:18 -07:00
Erik Faye-Lund	5bebe2a00d	softpipe: setup pixel_offset for all primitive types If we don't update this for all primitive-types, we end up rendering slightly offset points and lines up until the point where the first triangle gets drawn. This is obviously not correct, and violates OpenGL's repeatability rule. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Fixes: `ca9c413647` ("softpipe: Respect gl_rasterization_rules in primitive setup.") Reviewed-By: Gert Wollny <gert.wollny@collabora.com> (cherry picked from commit `f753ac355e`)	2019-05-08 16:40:18 -07:00
Andrii Simiklit	4b0f059938	egl: return correct error code for a case req ver < 3 with forward-compatible The EGL_KHR_create_context spec says: "If an OpenGL context is requested and the values for attributes EGL_CONTEXT_MAJOR_VERSION_KHR and EGL_CONTEXT_MINOR_VERSION_KHR, when considered together with the value for attribute EGL_CONTEXT_OPENGL_FORWARD_COMPATIBLE_BIT_KHR, specify an OpenGL version and feature set that are not defined, than an EGL_BAD_MATCH error is generated." This case is already correctly handled a bit below in the same source file. The correct handling was added by commit: `63beb3df` Reported-by: Ian Romanick <idr@freedesktop.org> Here: https://bugzilla.freedesktop.org/show_bug.cgi?id=92552#c9 Fixes: `11cabc45b7` "egl: rework handling EGL_CONTEXT_FLAGS" Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> (cherry picked from commit `5c581b3dd6`)	2019-05-08 16:40:18 -07:00
Axel Davy	fabaae4d8d	st/nine: Fix D3DWindowBuffer_release for old wine nine support No-one reported bugs for that, but is seems `c442dd7890` and previous commits used APIs not defined until nine minor version 3. This patch should prevent crash in this case. Also turn off the resize feature in this case, as we won't prevent a buffer leak anymore. Cc: "19.0" mesa-stable@lists.freedesktop.org Signed-off-by: Axel Davy <davyaxel0@gmail.com> (cherry picked from commit `64a45ba7f8`)	2019-05-08 16:40:18 -07:00
Ian Romanick	6e710c1fb4	mesa: Add missing display list support for GL_FOG_COORDINATE_SOURCE Fixes: `fe5d67d95f` ("Implement EXT_fog_coord and EXT_secondary_color.") Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de> Cc: Brian Paul <brianp@vmware.com> (cherry picked from commit `bfc6486819`)	2019-05-08 16:40:18 -07:00
Alejandro Piñeiro	3fe72ca214	docs: document MESA_GLSL=errors keyword Added with commit `0161691f35`, still checked on shaderapi.c _mesa_get_shader_flag method. Fixes: `0161691f35` "mesa: add GLSL_REPORT_ERRORS debug flag" Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `9b6a00e66e`)	2019-05-08 16:40:18 -07:00
Ross Burton	05c7d8c551	Revert "meson: drop GLESv1 .so version back to 1.0.0" This patch claimed that the autotools build generates libGLESv1_CM.so.1.0.0, but it doesn't: es1api_libGLESv1_CM_la_LDFLAGS = \ -no-undefined \ -version-number 1:1 \ $(GC_SECTIONS) \ $(LD_NO_UNDEFINED) Revert commit `cc15460e18` to ensure that the autotools and meson builds produce the same libraries. Fixes: `cc15460e18` "meson: drop GLESv1 .so version back to 1.0.0" Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> (cherry picked from commit `1c1efa4ca9`)	2019-05-08 16:40:18 -07:00
Juan A. Suarez Romero	96fe97d133	radv: enable descriptor indexing capabilities This enables the remaining capabilities in SPV_EXT_descriptor_indexing. Fixes: `0e10790558` "radv: Enable VK_EXT_descriptor_indexing." Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `06c9d7f9f9`)	2019-05-08 16:40:18 -07:00
Juan A. Suarez Romero	f807986a38	spirv: add missing SPV_EXT_descriptor_indexing capabilities Add ShaderNonUniformEXT, UniformBufferArrayNonUniformIndexingEXT, SampledImageArrayNonUniformIndexingEXT, StorageBufferArrayNonUniformIndexingEXT, StorageImageArrayNonUniformIndexingEXT, InputAttachmentArrayNonUniformIndexingEXT, UniformTexelBufferArrayNonUniformIndexingEXT and StorageTexelBufferArrayNonUniformIndexingEXT capabilities. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `bbbe00a101`)	2019-05-08 16:40:18 -07:00
Hal Gentz	b66205a140	glx: Fix synthetic error generation in __glXSendError To quote Uli Schlachter, who understands this stuff more than I do: > The function __glXSendError() in mesa's src/glx/glx_error.c invents an X11 > protocol error out of thin air. For the sequence number it uses dpy->request. > This is the sequence number of the last request that was sent. _XError() will > then update dpy->last_request_read based on the sequence number of the error > that just "came in". > > If now another something comes in with a sequence number less than > dpy->last_request_read, since sequence numbers are monotonically increasing, > widen() will incorrectly add 1<<32 to the sequence number and things might go > downhill afterwards. `__glXSendErrorForXcb` was also patched, as that's the function that `glXCreateContextAttribsARB` actually uses. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99781 Cc: mesa-stable@lists.freedesktop.org Fixes: `ad503c41` 'apple: Initial import of libGL for OSX from AppleSGLX svn repository' Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Hal Gentz <zegentzy@protonmail.com> (cherry picked from commit `e91ee763c3`)	2019-05-08 16:40:18 -07:00
Dylan Baker	e416ce522c	cherry-ignore: Add more backported patches	2019-05-08 16:40:18 -07:00
Jason Ekstrand	c8dbe93b0a	anv/descriptor_set: Destroy sets before pool finalization Fixes: `105002bd2d` "anv: destroy descriptor sets when pool gets..." Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-05-08 16:40:18 -07:00
Tapani Pälli	fa1bbc00a6	anv: expose VK_EXT_queue_family_foreign on Android VK_ANDROID_external_memory_android_hardware_buffer requires this extension. It is safe to enable it since currently aux usage is disabled for ahw buffers. Fixes following dEQP extension dependency test on Android: dEQP-VK.api.info.device#extensions Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `376c3e8f87`)	2019-05-08 16:40:18 -07:00
Samuel Pitoiset	cd9e4fbec1	radv: add missing VEGA20 chip in radv_get_device_name() Otherwise it returns "AMD RADV unknown". Cc: 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `07745f9494`)	2019-05-08 16:40:18 -07:00
Emil Velikov	d19148b7d9	vulkan/wsi: don't use DUMB_CLOSE for normal GEM handles Currently we get normal GEM handles from PrimeFDToHandle, yet we close then with DUMB_CLOSE. Use GEM_CLOSE instead. Fixes: `da997ebec9` ("vulkan: Add KHR_display extension using DRM [v10]") Cc: Jason Ekstrand <jason@jlekstrand.net> Cc: Keith Packard <keithp@keithp.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `bd0c4e360d`)	2019-05-08 16:40:18 -07:00
Emil Velikov	37a3a80c2e	vulkan/wsi: check if the display_fd given is master As effectively required by the extension, we need to ensure we're master Currently drivers employ vendor specific solutions, which check if the device behind the fd is capable, yet none of them do the master check. In the radv case, if acceleration is available. Instead of duplicating the check in each driver, keep it where it's needed and used. Note this copies libdrm's drmIsMaster() to avoid depending on bleeding edge version of the library. v2: set the fd to -1 if not master (Bas) Fixes: `da997ebec9` ("vulkan: Add KHR_display extension using DRM [v10]") Cc: Andres Rodriguez <andresx7@gmail.com> Cc: Jason Ekstrand <jason@jlekstrand.net> Cc: Keith Packard <keithp@keithp.com> Reported-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `c962a78f18`)	2019-05-08 16:40:18 -07:00
Dylan Baker	ef1f065f0a	cherry-ignore: Add a patch that was manually backported	2019-05-08 16:40:18 -07:00
Kenneth Graunke	d81cf30f95	i965: Ignore uniform storage for samplers or images, use binding info gl_nir_lower_samplers_as_deref creates new top level sampler and image uniforms which have been split from structure uniforms. i965 assumed that it could walk through gl_uniform_storage slots by starting at var->data.location and walking forward based on a simple slot count. This assumed that structure types were walked in a particular order. With samplers and images split out of structures, it becomes impossible to assign meaningful locations. Consider: struct S { sampler2D a; sampler2D b; } s[2]; The gl_uniform_storage locations for these follow this map: 0 => a[0], 1 => b[0], 2 => a[0], 3 => b[0]. But the new split variables look like: sampler2D lowered_a[2]; sampler2D lowered_b[2]; and there is no way to know that there's effectively a stride to get to the location for successive elements of a[] or b[]. So, working with location becomes effectively impossible. Ultimately, the point of looking at uniform storage was to pull out the bindings from the opaque index fields. gl_nir_lower_samplers_as_derefs can obtain this information while doing the splitting, however, and sets up var->data.binding to have the desired values. We move gl_nir_lower_samplers before brw_nir_lower_image_load_store so gl_nir_lower_samplers_as_derefs has the opportunity to set proper image bindings. Then, we make the uniform handling code skip sampler(-array) variables, and handle image param setup based on var->data.binding. Fixes Piglit tests/spec/glsl-1.10/execution/samplers/uniform-struct, this time without regressing dEQP-GLES2.functional.uniform_api.random.3. Fixes: `f003859f97` nir: Make gl_nir_lower_samplers use gl_nir_lower_samplers_as_deref Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (backported from commit `6981069fc8`, without prog_to_nir using derefs or texture bitfield gathering)	2019-05-08 16:40:18 -07:00
Kenneth Graunke	6b74d33928	glsl: Don't look at sampler uniform storage for internal vars Passes like nir_lower_drawpixels add additional sampler variables, and set an explicit binding which never changes. These extra samplers don't have proper uniform storage associated with them, and there is no way to update bindings via the API. So, for any 'hidden' variables, just trust that there's an explicit binding set. Reviewed-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `529a0711c1`)	2019-05-08 16:40:18 -07:00
Kenneth Graunke	06ca5c0a64	glsl: Allow gl_nir_lower_samplers*() without a gl_shader_program I would like to be able to run gl_nir_lower_samplers() to turn texture and sampler variable dereferences into indexes and offsets, even for ARB programs, and built-in shaders. This would make sampler handling more consistent across the various types of shaders. For GLSL programs, the gl_nir_lower_samplers_as_deref() pass looks up the variable bindings in the shader program's uniform storage. But ARB programs and built-in shaders don't have a gl_shader_program, and uniform storage doesn't exist. In this case, we simply skip that lookup, and trust var->data.binding to be set correctly by whoever created the shader. Reviewed-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `d34e434989`)	2019-05-08 16:40:18 -07:00
Marek Olšák	1a3478dc56	radeonsi/gfx9: rework the gfx9 scissor bug workaround (v2) Needed to track context rolls caused by streamout and ACQUIRE_MEM. ACQUIRE_MEM can occur outside of draw calls. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110355 v2: squashed patches and done more rework Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `440135e5a0`) Conflicts resolved by Dylan	2019-05-08 16:40:18 -07:00
Nicolai Hähnle	9f1d5d63ee	radeonsi: add si_debug_options for convenient adding/removing of options Move the definition of radeonsi_clear_db_cache_before_clear there, as well as radeonsi_enable_nir. This removes the AMD_DEBUG=nir option. We currently still have two places for options: the driconf machinery and AMD_DEBUG/R600_DEBUG. If we are to have a single place for options, then the driconf machinery should be preferred since it's more flexible. The only downside of the driconf machinery was that adding new options was quite inconvenient. With this change, a simple boolean option can be added with a single line of code, same as for AMD_DEBUG. One technical limitation of this particular implementation is that while almost all driconf features are available, the translation machinery doesn't pick up the description strings for options added in si_debvug_options. In practice, translations haven't been provided anyway, and this is intended for developer options, so I'm not too worried. It could always be added later if anybody really cares. v2: - use bool instead of uint8_t for options - si_debug_options.inc -> si_debug_options.h Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `8bef4df196`) autotools dist fixed by Dylan (not needed in master)	2019-05-08 16:40:10 -07:00
Marek Olšák	c15519b882	radeonsi/gfx9: set that window_rectangles always roll the context Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `bc0d924507`)	2019-04-25 13:38:50 -07:00
Timothy Arceri	a038a271f9	nir: fix nir_remove_unused_varyings() We were only setting the used mask for the first component of a varying. Since the linking opts split vectors into scalars this has mostly worked ok. However this causes an issue where for example if we split a struct on one side of the interface but not the other, then we can possibly end up removing the first components on the side that was split and then incorrectly remove the whole struct on the other side of the varying. With this change we simply mark all 4 components for each slot used by a struct. We could possibly make this more fine gained but that would require a more complex change. This fixes a bug in Strange Brigade on RADV when tessellation is enabled, all credit goes to Samuel Pitoiset for tracking down the cause of the bug. Fixes: `f1eb5e6399` ("nir: add component level support to remove_unused_io_vars()") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `b155f74d7b`) Conflicts resolved by Dylan	2019-04-25 13:38:10 -07:00
Jon Turney	a8aa18aa23	meson: Force '.so' extension for DRI drivers DRI driver loadable modules are always installed with install_megadriver.py with names ending with '.so', irrespective of platform. Force the name the loadable module is built with to match, so install_megadriver.py doesn't spin trying to remove non-existent symlinks. Fixes: `c77acc3c` "meson: remove meson-created megadrivers symlinks" (cherry picked from commit `5d310015c5`)	2019-04-25 13:24:26 -07:00
Ian Romanick	aa340450a0	glsl: Silence may unused parameter warnings in glsl/ir.h Every file that included glsl/ir.h had a warning like: src/compiler/glsl/ir.h: In member function ‘virtual bool ir_rvalue::is_lvalue(const _mesa_glsl_parse_state) const’: src/compiler/glsl/ir.h:236:64: warning: unused parameter ‘state’ [-Wunused-parameter] virtual bool is_lvalue(const struct _mesa_glsl_parse_state state = NULL) const ^ Cc: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `fa4ebf6b8d` ("glsl: add _mesa_glsl_parse_state object to is_lvalue()") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `3b087f668f`)	2019-04-24 15:28:39 -07:00
Caio Marcelo de Oliveira Filho	d4bbf9f838	spirv: Handle SpvOpDecorateId This operation decorate with an Id instead of a Literal or String. It is used by HlslCounterBufferGOOGLE (provided by SPV_GOOGLE_hlsl_functionality1). Even if we don't do anything with that decoration, we must be able to parse SPIR-V that uses it. Fixes: `891886da2f` "spirv: Add no-op support for VK_GOOGLE_hlsl_functionality1" Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `7e2684ce01`)	2019-04-24 15:28:39 -07:00
Lionel Landwerlin	35970dea11	anv: fix argument name for vkCmdEndQuery Doesn't fix anything but it's not the right function prototype. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `673f33c77d` ("anv: Implement CmdBegin/EndQueryIndexed") Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> (cherry picked from commit `0fb0058f18`)	2019-04-24 15:28:39 -07:00
Lionel Landwerlin	08f3ce4c7d	anv: leave the top 4Gb of the high heap VMA unused In `628c9ca908` I forgot to apply the same -4Gb of the high address of the high heap VMA. This was previously computed in the HIGH_HEAP_MAX_ADDRESS. Many thanks to James for pointing this out. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Xiong, James <james.xiong@intel.com> Fixes: `628c9ca908` ("anv: store heap address bounds when initializing physical device") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `9e7b0988d6`)	2019-04-25 05:22:46 +08:00
Lionel Landwerlin	3a90f8df50	anv: store heap address bounds when initializing physical device We can then reuse those bounds to initialize the VMA heaps at logical device creation. This fixes an issue on EHL which has only 36bits of VMA. We were incorrectly using the fixed 48bits upper bound to initialize the logical device heap, resulting in addresses beyong the device's limits. v2: Don't confuse heap size (limited by system memory) and VMA size (limited by number of addressing bits the platform has) v3: Fix low heap vma_size :( (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: James Xiong <james.xiong@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v2) (cherry picked from commit `628c9ca908`)	2019-04-25 05:22:46 +08:00
Dylan Baker	f163900f7c	docs: Add SHA256 sums for mesa 19.0.3	2019-04-24 10:44:27 -07:00
Dylan Baker	c8cdee5dc3	docs: add relnotes for 19.0.3	2019-04-24 10:39:04 -07:00
Dylan Baker	5cb685a3b8	Bump version for 19.0.3	2019-04-24 10:36:18 -07:00
Marek Olšák	44ddb884c8	radeonsi: use CP DMA for the null const buffer clear on CIK This is a workaround for a thread deadlock that I have no idea why it occurs. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108879 Fixes: `9b331e462e` Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `b58e5fb6f3`)	2019-04-24 08:50:50 -07:00
Samuel Pitoiset	ba1bf6c3ea	radv: do not load vertex attributes that are not provided by the pipeline Per the Vulkan spec this is definitely invalid but X4 Foundations does that and it ends up by hanging the GPU. Found while enabling validation layers with the game. The issue will be reported to the developers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit 381e38aaaa47c5aa38bc4f504b325fb68b7caea8)	2019-04-24 08:50:34 -07:00
Kenneth Graunke	f223fb98e9	Revert "glsl: Set location on structure-split sampler uniform variables" This reverts commit `9e0c744f07`, which regressed dEQP-GLES2.functional.uniform_api.random.3. It turns out that the newly produced location is meaningless and impossible to consume by drivers that want to look at gl_uniform_storage, so it's probably better to leave it unset (0) than a number that looks usable. Leave a tombstone^Wcomment to discourage the next person from making the obvious looking fix. See the next commit for a longer description of the problem. This breaks tests/spec/glsl-1.10/execution/samplers/uniform-struct on i965, which was originally fixed by the revert. The next commit will fix it again. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `47303b466c`)	2019-04-23 09:18:20 -07:00
Lubomir Rintel	91671ec1f4	gallivm: disable NEON instructions if they are not supported The LLVM project made some questionable decisions about defaults for armv7 (e.g. they enable NEON that is not there on NVIDIA and Marvell platforms). On top of that, getHostCPUFeatures() doesn't disable missing machine attributes. Finally, -neon alone is not sufficient to disable emmision of NEON instructions. Signed-off-by: Lubomir Rintel <lkundrak@v3.sk> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `e983a975c6`)	2019-04-23 09:17:58 -07:00
Lubomir Rintel	b509068164	gallivm: guess CPU features also on ARM getHostCPUFeatures() is also available on ARM, for even longer time than for x86. Use it -- it potentially enables instructions that may speed things up. Signed-off-by: Lubomir Rintel <lkundrak@v3.sk> Cc: <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/merge_requests/518 Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `bc6bfc861f`)	2019-04-23 09:17:51 -07:00
Jason Ekstrand	2397f5d99d	anv: Add a #define for the max binding table size This also fixes a bug where we mis-calculate maximum binding table sizes and may return true in vkGetDescriptorSetLayoutSupport even for sets too large to fit in a binding table. Fixes: `ddc4069122` "anv: Implement VK_KHR_maintenance3" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (cherry picked from commit `a5a0dc08f1`)	2019-04-22 09:06:44 -07:00
Lionel Landwerlin	ac1ffeab1d	intel/devinfo: fix missing num_thread_per_eu on ICL There was an assumption that num_thread_per_eu would be set in the Gen8 features. Since this is mostly the same of all gen8->11 (except GEN9_LP that overwrites it) let's just factor it out. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Anuj Phogat anuj.phogat@gmail.com (cherry picked from commit `773e6aa9fd`)	2019-04-22 09:06:38 -07:00
Eric Anholt	229c4abde3	nir: Fix deref offset calculation for structs. We were calcuating the offset for the field within the struct, and just dropping it on the floor. Fixes a regression in KHR-GLES3.shaders.struct.local.nested_struct_array_dynamic_index_fragment and a few of its friends since the scratch lowering commit. Fixes: `e8e159e9df` ("nir/deref: Add helpers for getting offsets") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `9ac5ec2f90`)	2019-04-22 09:06:32 -07:00
Samuel Pitoiset	b5ea4378c3	ac/nir: only use the new raw/struct image atomic intrinsics with LLVM 9+ They are buggy with LLVM 8 because they weren't marked as source of divergence, see r358579. Fixes: `dd0172e865` ("radv: Use structured intrinsics instead of indexing workaround for GFX9.")" Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `31164cf5f7`)	2019-04-22 09:06:26 -07:00
Lionel Landwerlin	23abb7d310	anv: fix uninitialized pthread cond clock domain Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `843775bab7` ("anv: Rework fences") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `dfd79079da`)	2019-04-18 16:18:46 -07:00
Juan A. Suarez Romero	32e08b2397	meson: Add dependency on genxml to anvil genfiles This fixes a race condition where anv_gen_files are executed before genxml files, which causes a build failure v2: add dependency on idep_genxml (Lionel) Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `b74e605cf4`)	2019-04-17 12:47:45 -07:00
Danylo Piliaiev	bde36e0736	intel/compiler: Do not reswizzle dst if instruction writes to flag register If we write to the flag register changing the swizzle would change what channels are written to the flag register. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110201 Fixes: `4cd1a0be` Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: <ian.d.romanick@intel.com> (cherry picked from commit `04508f57d1`)	2019-04-16 09:43:38 -07:00
Chia-I Wu	3400359432	virgl: fix fence fd version check Fixes: `d1a1c21e76` ("virgl: native fence fd support") Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `c45c889f95`)	2019-04-16 09:43:32 -07:00
Roland Scheidegger	ce4b6974cd	gallivm: fix bogus assert in get_indirect_index 0 is a valid value as max index, and the code handles it fine. This isn't commonly seen, as it will only happen with array declarations of size 1. Fixes piglit tests/shaders/complex-loop-analysis-bug.shader_test Fixes: `a3c898dc97` "gallivm: fix improper clamping of vertex index when fetching gs inputs" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110441 Reviewed-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `88e0bbf24a`)	2019-04-16 09:43:25 -07:00
Bas Nieuwenhuizen	0ffd4c744d	ac: Move has_local_buffers disable to radeonsi. In radv we had a separate flag to actually use it + an env option to experimentally use it. The common code setting has_local_buffers to false of course broke that experimental option. Also the "enable on APU" did not make sense for RADV as it is still disabled by default. Fixes: `b21a4efb55` "radv/winsys: allow local BOs on APUs" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `af9534b9f3`)	2019-04-16 09:43:18 -07:00
Rhys Perry	77dbb70e5c	nir,ac/nir: fix cube_face_coord Seems it was missing the "/ ma + 0.5" and the order was swapped. Fixes: `a1a2a8dfda` ('nir: add AMD_gcn_shader extended instructions') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `8671cfe2a2`)	2019-04-16 09:43:10 -07:00
Andres Gomez	332da02f27	glsl/linker: location aliasing requires types to have the same width From the OpenGL 4.60.5 spec, section 4.4.1 Input Layout Qualifiers, Page 67, (Location aliasing): " Further, when location aliasing, the aliases sharing the location must have the same underlying numerical type and bit width (floating-point or integer, 32-bit versus 64-bit, etc.) and the same auxiliary storage and interpolation qualification." Additionally, we have improved the linker error descriptions. Specifically, when taking structs into account we were producing a linker error because we assumed that all components in each location were used and that would cause component aliasing. This is not accurate of the actual problem. Now, the failure specifies that the underlying numerical type incompatibility is the cause for the failure. Fixes the following piglit test: tests/spec/arb_enhanced_layouts/linker/component-layout/vs-to-fs-width-mismatch-double-float.shader_test v2: - Do not assert if we see invalid numerical types. These come straight from shader code, so we should produce linker errors if shaders attempt to do location aliasing on variables that are not numerical such as records. - While we are at it, improve error reporting for the case of numerical type mismatch to include the shader stage. v3: - Allow location aliasing of images and samplers. If we get these it means bindless support is active and they should be handled as 64-bit integers (Ilia) - Make sure we produce link errors for any non-numerical type for which we attempt location aliasing, not just structs. v4: - Rebased with minor fixes (Andres). - Added fixing tag to the commit log (Andres). v5: - Remove the helper function and check individually for the underlying numerical type and bit width (Timothy). - Implicitly, assume that any non-treated type which is checked for its underlying numerical type is either integer or float and has a defined bit width (Timothy). - Implicitly, assume that structs are the only non-treated non-numerical type (Timothy). - Improve the linker error descriptions and commit log (Andres). Fixes: `13652e7516` ("glsl/linker: Fix type checks for location aliasing") Cc: Ilia Mirkin <imirkin@alum.mit.edu> Cc: Timothy Arceri <tarceri@itsqueeze.com> Cc: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `75a3dd97aa`) [Andres Gomez: is_record() instead of is_struct() and brought glsl_base_type_get_bit_size] Signed-off-by: Andres Gomez <agomez@igalia.com>	2019-04-12 17:18:40 -07:00
Kenneth Graunke	2e63686268	glsl: Set location on structure-split sampler uniform variables gl_nir_lower_samplers_as_deref splits structure uniform variables, creating new variables for individual fields. As part of that, it calculates a new location. It then never set this on the new variables. Thanks to Michael Fiano for finding this bug. Fixes crashes on i965 with Piglit's new tests/spec/glsl-1.10/execution/samplers/uniform-struct test, which was reduced from the failing case in Michael's app. Fixes: `f003859f97` nir: Make gl_nir_lower_samplers use gl_nir_lower_samplers_as_deref Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `9e0c744f07`)	2019-04-12 13:36:49 -07:00
Jason Ekstrand	f9eaa873cf	anv/pipeline: Fix MEDIA_VFE_STATE::PerThreadScratchSpace on gen7 We were always programming it with the Broadwell convention which is too large by a factor of two on Haswell and just plain wrong on IVB and BYT. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `7eaaff18cb`)	2019-04-12 13:36:49 -07:00
Eric Engestrom	aacefed521	meson: remove meson-created megadrivers symlinks Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110356 Fixes: `aa7afe324c` "meson: strip rpath from megadrivers" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Tested-by: Mike Lothian <mike@fireburn.co.uk> Reviewed-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `c77acc3ceb`)	2019-04-12 13:36:49 -07:00
Dylan Baker	d41acb4c9e	docs: Add sha256 sums for 19.0.2	2019-04-10 20:40:42 -07:00
Dylan Baker	2964ee3ad0	docs: Add release notes for 19.0.2	2019-04-10 20:34:09 -07:00
Dylan Baker	349759165c	VERSION: bump version for 19.0.2	2019-04-10 20:30:30 -07:00
Boyuan Zhang	20db3b0e46	st/va: reverse qt matrix back to its original order The quantiser matrix that VAAPI provides has been applied with inverse z-scan. However, what we expect in MPEG2 picture description is the original order. Therefore, we need to reverse it back to its original order. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110257 Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `d507bcdcf2`)	2019-04-09 08:36:40 -07:00
Lionel Landwerlin	57b7dbbb21	intel: add dependency on genxml generated files Drivers using genxml will start compilation before generated files are created, so add a dependency to it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `48e48b8560`) Conflicts resolved by Dylan Conflicts: src/gallium/drivers/iris/meson.build	2019-04-09 08:35:49 -07:00
Caio Marcelo de Oliveira Filho	b493686860	nir: Take if_uses into account when repairing SSA If a def is used as an condition before its definition, we should also consider this a case to repair. When repairing, make sure we rewrite any if conditions too. Found in while inspecting a SPIR-V conversion from a 'continue block' that contains a conditional branch. We pull the continue block up to the beggining of the loop, and the condition in the branch ends up defined afterwards. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `364212f1ed` "nir: Add a pass to repair SSA form" (cherry picked from commit `c037dbb0ef`)	2019-04-08 09:30:03 -07:00
Eric Anholt	73bc3248f4	v3d: Don't try to use the TFU blit path if a scissor is enabled. We'll need to do a render-based blit for scissors, since the TFU (as seen in this conditional) can only update a whole surface. Fixes: `976ea90bdc` ("v3d: Add support for using the TFU to do some blits.") Fixes piglit fbo-scissor-blit. (cherry picked from commit `4c70f276bc`)	2019-04-05 09:08:03 -07:00
Eric Anholt	d1f4c96919	v3d: Bump the maximum texture size to 4k for V3D 4.x. 4.1 and 4.2 both have the same 16k limit, but it I'm seeing GPU hangs in the CTS at 8k and 16k. 4k at least lets us get one 4k display working. Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `62360e92ec`)	2019-04-05 09:07:57 -07:00
Eric Anholt	b7769cdfb7	dri3: Return the current swap interval from glXGetSwapIntervalMESA(). We were caching only the value set with glXSwapIntervalSGI(), missing out on the default setting of the swap interval by the loader. This fixes glxgears's warning about being vblank synchronized by default. Fixes: `9777c4234b` ("loader: drop the [gs]et_swap_interval callbacks") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `edc7deec42`)	2019-04-02 09:14:20 -07:00
Marek Olšák	e46e3bfd13	radeonsi: fix assertion failure by using the correct type src/gallium/drivers/radeonsi/si_state_viewport.c:196: si_emit_guardband: Assertion `vp_as_scissor.maxx <= max_viewport_size[vp_as_scissor.quant_mode] && vp_as_scissor.maxy <= max_viewport_size[vp_as_scissor.quant_mode]' failed. The comparison was unsigned, so negative maxx or maxy would fail. Fixes: `3c540e0a74` "radeonsi: Fix guardband computation for large render targets" (cherry picked from commit `3ad2a9b3fa`)	2019-04-01 09:47:45 -07:00
Leo Liu	a4d5161d42	radeon/vcn/vp9: search the render target from the whole list The number of render targets could be more than max of references, so we search the full list of the render pictures for the current render target index https://bugs.freedesktop.org/show_bug.cgi?id=109648 Signed-off-by: Leo Liu <leo.liu@amd.com> Tested-by: James Zhu <James.Zhu@amd.com> Acked-by: James Zhu<James.Zhu@amd.com> Cc: <mesa-stable@lists.freedesktop.org> (cherry picked from commit `d4e0fbc92f`)	2019-04-01 09:47:39 -07:00
Eric Engestrom	a1c30b8b78	meson: strip rpath from megadrivers More specifically, use the library file that has been post-processed by Meson when creating the hardlinks. Bugs: https://bugs.freedesktop.org/show_bug.cgi?id=108766 Fixes: `3218056e0e` "meson: Build i965 and dri stack" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> (cherry picked from commit `aa7afe324c`)	2019-04-01 09:47:34 -07:00
Karol Herbst	9987a3d448	nir/print: fix printing the image_array intrinsic index Fixes: `0de003be03` ("nir: Add handle/index-based image intrinsics") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `6ffc72472c`)	2019-03-29 08:32:00 -07:00
Samuel Pitoiset	891c4ff633	radv: do not always initialize HTILE in compressed state Especially when performing a transtion from UNDEFINED->GENERAL, the driver shouldn't initialize HTILE metadata in compressed state because it doesn't decompress when the src layout is GENERAL. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110259 Fixes: `3a2e93147f` ("radv: always initialize HTILE when the src layout is UNDEFINED") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `62a9d757e6`)	2019-03-29 08:31:53 -07:00
Samuel Pitoiset	a175dffe84	radv: skip updating depth/color metadata for conditional rendering I don't think we should update metadata when conditional rendering is enabled. For some reasons, some CTS breaks only on SI. This fixes the following CTS on SI: dEQP-VK.conditional_rendering.draw_clear.clear.depth.* Cc: 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `6596eb2b30`)	2019-03-28 12:14:46 -07:00
Leo Liu	29bfb1af10	radeon/vcn: add H.264 constrained baseline support VCN supports this profile as well as UVD, so add it Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> CC: <mesa-stable@lists.freedesktop.org> (cherry picked from commit `f8ef8b56a6`)	2019-03-28 12:14:39 -07:00
Jason Ekstrand	dc6f00d53e	Revert "anv/radv: release memory allocated by glsl types during spirv_to_nir" This reverts commit `4e1bbb000c`. It turns out that some DXVK apps due to some implementation detail of DXVK or other create and destroy instances in an interleaved way. Freeing the glsl_type memory without being a bit more careful causes use-after-free issues. Looks like we need to try again. (cherry picked from commit `ce47999cee`)	2019-03-27 11:49:05 -07:00
Dylan Baker	ba3eb3c938	docs: Add SHA256 sums for mesa 19.0.1	2019-03-27 10:10:37 -07:00
Dylan Baker	08fbf25ce1	Add release notes for 19.0.1	2019-03-27 10:02:21 -07:00
Dylan Baker	499053e5d7	bump version for 19.0.1	2019-03-27 09:56:53 -07:00
Bas Nieuwenhuizen	bb66e61727	ac/nir: Return frag_coord as integer. To preserve the invariant that nir ssa defs are integers or pointers in LLVM. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `82075e3c42`)	2019-03-26 12:14:04 -07:00
Dylan Baker	964f7a7063	bin/install_megadrivers.py: Fix regression for set DESTDIR The previous patch tried to address a bug when DESTDIR is '', however, it introduces a bug when DESTDIR is not '', and fakeroot is used. This patch does fix that, and has been tested with the arch pkg-build to ensure it isn't regressed. Fixes: 093a1ade4e24b7dd701a093d30a71efd669fe9c8 ("bin/install_megadrivers.py: Correctly handle DESTDIR=''") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110221 Reviewed-by: Eric Engestrom <eric@engestrom.ch> (cherry picked from commit `ed96038e55`)	2019-03-25 09:44:28 -07:00
Dylan Baker	561fd519a7	bin/install_megadrivers.py: Correctly handle DESTDIR='' Currently if destdir is set to '' then the resulting libdir will have it's first character replaced by / instead of / being prepended to the string. This was the result of ensuring that that DESTDIR wouldn't be ignored if libdir was absolute, since the only cases that meson allows the libdir to be absolute is if the prefix is /, this won't be a problem. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110211 Fixes: `ae3f45c11e` ("bin/install_megadrivers: fix DESTDIR and -D*-path") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `4188dd7879`)	2019-03-25 09:43:24 -07:00
Józef Kucia	db6c05f5db	mesa: Fix GL_NUM_DEVICE_UUIDS_EXT Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `1d996ef714`)	2019-03-22 10:44:58 -07:00
Tapani Pälli	a9a600f216	anv/radv: release memory allocated by glsl types during spirv_to_nir Fixes leaks for each glsl_type generated: ==32470== 384 bytes in 3 blocks are possibly lost in loss record 18 of 18 ==32470== at 0x483880B: malloc (vg_replace_malloc.c:309) ==32470== by 0x4C43F4A: ralloc_size (ralloc.c:119) ==32470== by 0x4C44014: rzalloc_size (ralloc.c:151) ==32470== by 0x4C44258: rzalloc_array_size (ralloc.c:215) ==32470== by 0x4D38957: glsl_type::glsl_type(glsl_struct_field const, unsigned int, char const) (glsl_types.cpp:114) ==32470== by 0x4D3BEED: glsl_type::get_struct_instance(glsl_struct_field const, unsigned int, char const) (glsl_types.cpp:1146) ==32470== by 0x4D42ECC: glsl_struct_type (nir_types.cpp:501) ==32470== by 0x4CDB5A1: vtn_handle_type (spirv_to_nir.c:1269) ==32470== by 0x4CE53DD: vtn_handle_variable_or_type_instruction (spirv_to_nir.c:4018) ==32470== by 0x4CD8CFF: vtn_foreach_instruction (spirv_to_nir.c:365) ==32470== by 0x4CE5E6B: spirv_to_nir (spirv_to_nir.c:4490) ==32470== by 0x497AF10: anv_shader_compile_to_nir (anv_pipeline.c:173) v2: move release call to vkDestroyInstance v3: apply fix also to radv driver Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `4e1bbb000c`)	2019-03-22 10:44:50 -07:00
Józef Kucia	96b0478c41	radv: Fix driverUUID Fixes: `14cad8786a` ("radv: generate the same driver UUID as radeonsi") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `c077d5d7de`)	2019-03-22 10:44:43 -07:00
Danylo Piliaiev	764131ff0a	glsl: Cross validate variable's invariance by explicit invariance only 'invariant' qualifier is propagated on variables which are used to calculate other invariant variables, however when we are matching variable's declarations we should take into account only explicitly declared invariance because invariance propagation is an implementation specific detail. Thus new flag is added to ir_variable_data which indicates 'invariant' qualifier being explicitly set in the shader. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100316 Fixes: `89b60492` ('glsl: Add a pass to propagate the "invariant" and "precise" qualifiers') Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (cherry picked from commit `ea9bde151f`)	2019-03-22 10:44:25 -07:00
Dave Airlie	09f08a2fce	softpipe: fix texture view crashes I noticed we crashed piglit arb_texture_view-rendering-formats when run on softpipe. This fixes the clear tiles to use the surface format not the underlying storage format. This fixes a bunch of srgb piglits as well. Fixes: `396ac41fc2` (softpipe: add integer support) Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> (cherry picked from commit `04189565a0`)	2019-03-22 10:44:16 -07:00
Jason Ekstrand	d3941aa8e7	intel/nir: Lower array-deref-of-vector UBO and SSBO loads This fixes a serious performance issue with DXVK: https://github.com/doitsujin/dxvk/issues/937 This was caused by a recent change that to improve performance on RADV which back-fired on ANV and killed performance for some apps: `e5a06d3f4a` Throwing in this bit of lowering lets us come along and CSE those UBO loads (or copy-prop for SSBO load) and get one load where we previously would have gotten several. VkPipeline-db results on Kaby Lake: total instructions in shared programs: 5115361 -> 5073185 (-0.82%) instructions in affected programs: 1754333 -> 1712157 (-2.40%) helped: 5331 HURT: 63 total cycles in shared programs: 2544501169 -> 2481144545 (-2.49%) cycles in affected programs: 2531058653 -> 2467702029 (-2.50%) helped: 9202 HURT: 4323 total loops in shared programs: 3340 -> 3331 (-0.27%) loops in affected programs: 9 -> 0 helped: 9 HURT: 0 total spills in shared programs: 3246 -> 3053 (-5.95%) spills in affected programs: 384 -> 191 (-50.26%) helped: 10 HURT: 5 total fills in shared programs: 4626 -> 4452 (-3.76%) fills in affected programs: 439 -> 265 (-39.64%) helped: 10 HURT: 5 All of the shaders with hurt spilling were in Rise of the Tomb Raider which also had shaders solidly helped in the spilling department. Not shown in those results (because I've not had success dumping the shaders) is Witcher 3 where this reduces spilling and improves over-all perf by around 20-25%. There were no shader-db changes. Apparently, this just isn't a pattern that happens in OpenGL. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Cc: "19.0" mesa-stable@lists.freedesktop.org (cherry picked from commit `d3386e73c5`) Conflicts resolved by Dylan	2019-03-20 08:51:33 -07:00
Samuel Pitoiset	62b2aea3ee	radv: fix binding transform feedback buffers The mask should be accumulated if two calls are used for binding two buffers at different indexes. Otherwise, the driver only accounts for the last one. Noticed while glancing at this code. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `4fa61273a8`)	2019-03-20 08:44:50 -07:00
Andres Gomez	062d464c4c	Revert "glsl: relax input->output validation for SSO programs" This reverts commit `1aa5738e66`. This patch incorrectly asumed that for SSOs no inner interface matching check was needed. From the ARB_separate_shader_objects spec v.25: " With separable program objects, interfaces between shader stages may involve the outputs from one program object and the inputs from a second program object. For such interfaces, it is not possible to detect mismatches at link time, because the programs are linked separately. When each such program is linked, all inputs or outputs interfacing with another program stage are treated as active. The linker will generate an executable that assumes the presence of a compatible program on the other side of the interface. If a mismatch between programs occurs, no GL error will be generated, but some or all of the inputs on the interface will be undefined." This completes the fix from commit: `3be05dd267` ("glsl/linker: don't fail non static used inputs without matching outputs") Fixes: `1aa5738e66` ("glsl: relax input->output validation for SSO programs") Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Timothy Arceri <tarceri@itsqueeze.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu> Cc: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Cc: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `ab28dca033`)	2019-03-19 10:53:45 -07:00
Andres Gomez	33d331859a	glsl/linker: simplify xfb_offset vs xfb_stride overflow check Current implementation uses a complicated calculation which relies in an implicit conversion to check the integral part of 2 division results. However, the calculation actually checks that the xfb_offset is smaller or a multiplier of the xfb_stride. For example, while this is expected to fail, it actually succeeds: " ... layout(xfb_buffer = 2, xfb_stride = 12) out block3 { layout(xfb_offset = 0) vec3 c; layout(xfb_offset = 12) vec3 d; // ERROR, requires stride of 24 }; ... " Fixes: `2fab85aaea` ("glsl: add xfb_stride link time validation") Cc: Timothy Arceri <tarceri@itsqueeze.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `422882e78f`)	2019-03-19 10:53:40 -07:00
Andres Gomez	068e9a8f45	glsl/linker: don't fail non static used inputs without matching outputs If there is no Static Use of an input variable, the linker shouldn't fail whenever there is no defined matching output variable in the previous stage. From page 47 (page 51 of the PDF) of the GLSL 4.60 v.5 spec: " Only the input variables that are statically read need to be written by the previous stage; it is allowed to have superfluous declarations of input variables." Now, we complete this exception whenever the input variable has an explicit location. Previously, `18004c338f` ("glsl: fail when a shader's input var has not an equivalent out var in previous") took care of the cases in which the input variable didn't have an explicit location. v2: do the location based interface matching check regardless on whether it is a separable program or not (Ilia). Fixes: `1aa5738e66` ("glsl: relax input->output validation for SSO programs") Cc: Timothy Arceri <tarceri@itsqueeze.com> Cc: Iago Toral Quiroga <itoral@igalia.com> Cc: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Ian Romanick <ian.d.romanick@intel.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `3be05dd267`)	2019-03-19 10:53:33 -07:00
Andres Gomez	1b4712719e	glsl: correctly validate component layout qualifier for dvec{3,4} From page 62 (page 68 of the PDF) of the GLSL 4.50 v.7 spec: " A dvec3 or dvec4 can only be declared without specifying a component." Therefore, using the "component" qualifier with a dvec3 or dvec4 should result in a compiling error. v2: enhance the error message (Timothy). Fixes: `94438578d2` ("glsl: validate and store component layout qualifier in GLSL IR") Cc: Timothy Arceri <tarceri@itsqueeze.com> Cc: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `a96093136b`)	2019-03-19 10:53:27 -07:00
Bas Nieuwenhuizen	da17740ea7	radv: Use correct image view comparison for fast clears. The if is actually returning true on success, enabling fast clears, so we need to have the test succeed when the iview dimensions are right. Fixes: `d5400a5ec2` "radv: provide a helper for comparing an image extents." Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `a777c3d7cb`)	2019-03-19 10:53:20 -07:00
Jason Ekstrand	cf2e4490c3	nir: Add a new pass to lower array dereferences on vectors This pass was originally written for lowering TCS output reads and writes but it is also applicable just about anything including UBOs, SSBOs, and shared variables. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (cherry picked from commit `35b8f6f40b`)	2019-03-18 11:36:41 -07:00
Jason Ekstrand	fa137cd655	nir/builder: Add a vector extract helper This one's a tiny bit better than what we had in spirv_to_nir because it emits a binary tree rather than a linear walk. It also doesn't leave around unneeded bcsel instructions for a constant index and returns an undef for constant OOB access. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (cherry picked from commit `fe9a6c0f14`)	2019-03-18 11:36:26 -07:00
Dylan Baker	12745f5dc0	cherry-ignore: Add commit that doesn't apply	2019-03-18 11:34:41 -07:00
Danylo Piliaiev	ddea2a99c5	anv: Treat zero size XFB buffer as disabled Vulkan spec doesn't explicitly forbid zero size transform feedback buffers. Having zero size xfb caused SurfaceSize overflow and triggered assert in debug build. The only way to have zero size SO_BUFFER is to disable SO_BUFFER as stated in hardware spec. From SKL PRM, Vol 2a, "3DSTATE_SO_BUFFER": "If set, stream output to SO Buffer is enabled, if 3DSTATE_STREAMOUT::SO Function ENABLE is also enabled. If clear, the SO Buffer is considered "not bound" and effectively treated as a zero- length buffer for the purposes of SO output and overflow detection. If an enabled stream's Stream to Buffer Selects includes this buffer it is by definition an overflow condition. That stream will cause no writes to occur, and only SO_PRIM_STORAGE_NEEDED[<stream>] will increment." Fixes: `36ee2fd61c` "anv: Implement the basic form of VK_EXT_transform_feedback" Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `ecb98c6898`)	2019-03-18 10:19:46 -07:00
Tapani Pälli	f028945c01	isl: fix automake build when sse41 is not supported Fixes: `864cc419eb` "intel/isl: move tiled_memcpy static libs from i965 to isl" Cc: mesa-stable@lists.freedesktop.org Reported-by: Milav Soni <milav.soni@teqdiligent.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `a1cd0040b6`)	2019-03-18 10:19:41 -07:00
Mark Janes	6c7f03bb5b	mesa: properly report the length of truncated log messages _mesa_log_msg must provide the length of the string passed into the KHR_debug api. When the string formatted by _mesa_gl_vdebugf exceeds MAX_DEBUG_MESSAGE_LENGTH, the length is incorrectly set to the number of characters that would have been written if enough space had been available. Fixes: `3025680578` ("mesa: Add support for GL_ARB_debug_output with dynamic ID allocation.") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `eb1a869a5d`)	2019-03-15 14:58:39 -07:00
Sergii Romantsov	ee18a3ec10	d3d: meson: do not prefix user provided d3d-drivers-path The user can select the location where there d3d drivers are installed by the d3d-drivers-path meson option. By default path will be $prefix/$libdir/d3d. Currently we add $prefix to the user provided path. Resulting in an incorrect or even missing path. Based on logic of Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109698 CC: Kenneth Graunke <kenneth@whitecape.org> CC: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `dcc4866419`)	2019-03-14 10:38:20 -07:00
Samuel Pitoiset	06787d23cb	radv: always initialize HTILE when the src layout is UNDEFINED HTILE should always be initialized when transitioning from VK_IMAGE_LAYOUT_UNDEFINED to other image layouts. Otherwise, if an app does a transition from UNDEFINED to GENERAL, the driver doesn't initialize HTILE and it tries to decompress the depth surface. For some reasons, this results in VM faults. Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107563 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `3a2e93147f`)	2019-03-14 09:50:51 -07:00
Plamena Manolova	35029d4361	i965: Disable ARB_fragment_shader_interlock for platforms prior to GEN9 ARB_fragment_shader_interlock depends on memory fences to ensure fragment ordering and this ordering guarantee is only supported from GEN9 onwards. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109980 Fixes: `939312702e` "i965: Add ARB_fragment_shader_interlock support." Signed-off-by: Plamena Manolova <plamena.n.manolova@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `19ab082001`)	2019-03-14 09:50:44 -07:00
Jason Ekstrand	c4f8fb1749	anv/pass: Flag the need for a RT flush for resolve attachments Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `489bf2de23`)	2019-03-14 09:50:39 -07:00
Kevin Strasser	0dd88cf9ae	egl/dri: Avoid out of bounds array access indexConfigAttrib iterates over every index in the dri driver, possibly exceeding __DRI_ATTRIB_MAX. In other words, if the dri driver has newer attributes libEGL will end up reading from uninitialized memory through dri2_to_egl_attribute_map[]. Signed-off-by: Kevin Strasser <kevin.strasser@intel.com> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `70b36c0ef9`)	2019-03-13 14:26:36 -07:00
Jason Ekstrand	3a18f13ba5	glsl/list: Add a list variant of insert_after Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (cherry picked from commit `20c4578c55`)	2019-03-13 14:26:36 -07:00
Jason Ekstrand	95b001cb19	glsl/lower_vector_derefs: Don't use a temporary for TCS outputs Tessellation control shader outputs act as if they have memory backing them and you can have multiple writes to different components of the same vector in-flight at the same time. When this happens, the load vec store pattern that gets used by ir_triop_vector_insert doesn't yield the correct results. Instead, just emit a sequence of conditional assignments. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `bd17bdc56b`)	2019-03-13 14:26:36 -07:00
Kenneth Graunke	f2e5ca1d81	intel/fs: Fix opt_peephole_csel to not throw away saturates. We were not copying the saturate bit from the original instruction to the new replacement instruction. This caused major misrendering in DiRT Rally on iris, where comparisons leading to discards failed due to the missing saturate, causing lots of extra garbage pixels to be drawn in text rendering, trees, and so on. This did not show up on i965 because st/nir performs a more aggressive version of nir_opt_peephole_select, yielding more b32csel operations. Fixes: `52c7df1643` i965/fs: Merge CMP and SEL into CSEL on Gen8+ Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `3570d15b6d`)	2019-03-13 14:26:36 -07:00
Eric Anholt	f953d0f52f	v3d: Fix leak of the renderonly struct on screen destruction. This makes v3d match vc4's destroy path. Fixes: `e113b21cb7` ("v3d: Add renderonly support.") (cherry picked from commit `486b181fd7`)	2019-03-13 14:26:36 -07:00
Samuel Pitoiset	93386fbc5e	radv: set the maximum number of IBs per submit to 192 This fixes random SteamVR corruption, see https://github.com/ValveSoftware/SteamVR-for-Linux/issues/181 Fixes: `4d30f2c6f4` ("radv/winsys: remove the max IBs per submit limit for the fallback path") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `ae77f12368`)	2019-03-13 14:26:36 -07:00
Dylan Baker	142e37ab34	docs: Add SHA256 sums for 19.0.0	2019-03-13 12:09:08 -07:00
Dylan Baker	5925a57258	docs: Add release notes for 19.0.0	2019-03-13 10:40:19 -07:00
Dylan Baker	f4bc129c09	VERSION: bump for 19.0.0 release	2019-03-13 10:39:47 -07:00
Danylo Piliaiev	3147af410d	anv: Fix destroying descriptor sets when pool gets reset pool->next and pool->free_list were reset before their usage in anv_descriptor_pool_free_set Fixes: 775aabdd "anv: destroy descriptor sets when pool gets reset" Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `9c80be956f`)	2019-03-12 15:10:14 -07:00
Juan A. Suarez Romero	ffeb946c8a	anv: destroy descriptor sets when pool gets reset As stated in Vulkan spec: "Resetting a descriptor pool recycles all of the resources from all of the descriptor sets allocated from the descriptor pool back to the descriptor pool, and the descriptor sets are implicitly freed." This fixes dEQP-VK.api.descriptor_pool.* Fixes: `14f6275c92` "anv/descriptor_set: add reference counting for..." Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Clayton Craft <clayton.a.craft@intel.com> (cherry picked from commit `775aabdd01`)	2019-03-12 15:10:14 -07:00
Tapani Pälli	ef8a38e0e6	anv: destroy descriptor sets when pool gets destroyed Patch maintains a list of sets in the pool and destroys possible remaining sets when pool is destroyed. As stated in Vulkan spec: "When a pool is destroyed, all descriptor sets allocated from the pool are implicitly freed and become invalid." This fixes memory leaks spotted with valgrind: ==19622== 96 bytes in 1 blocks are definitely lost in loss record 2 of 3 ==19622== at 0x483880B: malloc (vg_replace_malloc.c:309) ==19622== by 0x495B67E: default_alloc_func (anv_device.c:547) ==19622== by 0x4955E05: vk_alloc (vk_alloc.h:36) ==19622== by 0x4956A8F: anv_multialloc_alloc (anv_private.h:538) ==19622== by 0x4956A8F: anv_CreateDescriptorSetLayout (anv_descriptor_set.c:217) Fixes: `14f6275c92` ("anv/descriptor_set: add reference counting for descriptor set layouts") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `105002bd2d`) Conflicts resolved by Dylan and Jason	2019-03-12 15:10:02 -07:00
Jason Ekstrand	8d43691b26	intel/nir: Vectorize all IO The IO scalarization pass that we run to help with linking end up turning some shader I/O such as that for tessellation and geometry shaders into many scalar URB operations rather than one vector one. To alleviate this, we now vectorize the I/O once again. This fixes a 10% performance regression in the GfxBench tessellation test that was caused by scalarizing. Shader-db results on Kaby Lake: total instructions in shared programs: 15224023 -> 15220871 (-0.02%) instructions in affected programs: 342009 -> 338857 (-0.92%) helped: 1236 HURT: 443 total spills in shared programs: 23471 -> 23465 (-0.03%) spills in affected programs: 6 -> 0 helped: 1 HURT: 0 total fills in shared programs: 31770 -> 31766 (-0.01%) fills in affected programs: 4 -> 0 helped: 1 HURT: 0 Cycles was just a lot of churn do to moves being different places. Most of the pure churn in instructions was +/- one or two instructions in fragment shaders. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107510 Fixes: `4434591bf5` "intel/nir: Call nir_lower_io_to_scalar_early" Fixes: `8d8222461f` "intel/nir: Enable nir_opt_find_array_copies" Reviewed-by: Connor Abbott <cwabbott0@gmail.com> (cherry picked from commit `6d5d89d25a`)	2019-03-12 09:27:29 -07:00
Jason Ekstrand	47db151b9b	nir: Add a pass for lowering IO back to vector when possible This pass tries to turn scalar and array-of-scalar IO variables into vector IO variables whenever possible. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Cc: "19.0" <mesa-stable@lists.freedesktop.org> (cherry picked from commit `5ef2b8f1f2`)	2019-03-12 09:27:29 -07:00
pal1000	1cf66fa64e	scons: Compatibility with Scons development version string This ensures Mesa3D build doesn't fail in this case as encountered when bisecting Scons source code while regression testing https://bugs.freedesktop.org/show_bug.cgi?id=109443 and when testing 3.0.5.a.2 Technical details: Scons version string has consistently been in this format: MajorVersion.MinorVersion.Patch[.alpha/beta.yyyymmdd] so these formulas should strip alpha/beta flags and return Scons version: - as string - `'.'.join(SCons.__version__.split('.')[:3])` - as tuple of integers - `tuple(map(int, SCons.__version__.split('.')[:3]))` - v2: Fixed Scons version retrieval formulas as string and tuple of integers. - v3: Fixed Scons version string format description. Cc: "19.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> (cherry picked from commit `7f89fd17ed`)	2019-03-12 09:27:29 -07:00
Tapani Pälli	7f86fbc2a9	anv: revert "anv: release memory allocated by glsl types during spirv_to_nir" This reverts commit `47fc359822`. Reason is that patch did not take in to account situation where we might have both OpenGL and Vulkan using glsl_types at the same time. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `bef354321b`)	2019-03-12 09:27:29 -07:00
Samuel Pitoiset	0fe48675e5	radv: fix pointSizeRange limits The values should match the ones that are emitted. This fixes new CTS dEQP-VK.rasterization.primitive_size.points.*. Fixes: `f4e499ec79` ("radv: add initial non-conformant radv vulkan driver") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `6403171843`)	2019-03-12 09:27:29 -07:00
Tapani Pälli	f763433180	anv: release memory allocated by glsl types during spirv_to_nir Fixes leaks for each glsl_type generated: ==32470== 384 bytes in 3 blocks are possibly lost in loss record 18 of 18 ==32470== at 0x483880B: malloc (vg_replace_malloc.c:309) ==32470== by 0x4C43F4A: ralloc_size (ralloc.c:119) ==32470== by 0x4C44014: rzalloc_size (ralloc.c:151) ==32470== by 0x4C44258: rzalloc_array_size (ralloc.c:215) ==32470== by 0x4D38957: glsl_type::glsl_type(glsl_struct_field const, unsigned int, char const) (glsl_types.cpp:114) ==32470== by 0x4D3BEED: glsl_type::get_struct_instance(glsl_struct_field const, unsigned int, char const) (glsl_types.cpp:1146) ==32470== by 0x4D42ECC: glsl_struct_type (nir_types.cpp:501) ==32470== by 0x4CDB5A1: vtn_handle_type (spirv_to_nir.c:1269) ==32470== by 0x4CE53DD: vtn_handle_variable_or_type_instruction (spirv_to_nir.c:4018) ==32470== by 0x4CD8CFF: vtn_foreach_instruction (spirv_to_nir.c:365) ==32470== by 0x4CE5E6B: spirv_to_nir (spirv_to_nir.c:4490) ==32470== by 0x497AF10: anv_shader_compile_to_nir (anv_pipeline.c:173) v2: move release call to vkDestroyInstance Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `47fc359822`)	2019-03-12 09:27:29 -07:00
Eric Anholt	ab43e60540	st/dri: Set the PIPE_BIND_SHARED flag on create_image_with_modifiers. With createImage(), the caller was expected to set a SHARED flag if they needed the ability to get a GEM handle. DRI3, wayland, and gbm all set it, EGL_MESA_drm_image passes it through, and surfaceless doesn't need it because there's no way to request a handle. With the new createImageWithModifiers() DRI method to replace it, the expectation is that you'll always be able to share the buffer, so the flag is unnecessary in its arguments. However, we do need to tell gallium about this expectation. Without this, kmscube's modifiers path using gbm_bo_create_with_modifiers(&modifier, 1) instead of gbm_bo_create(SCANOUT \| SHARED) will call the driver's resource_create() function wtih PIPE_BIND_SHARED unset, so the driver (particularly renderonly drivers) may allocate in such a way that it can't return an answer from gbm_bo_get_handle(). I used to have a hack in v3d using count==1 && modifier==LINEAR to indicate that you wanted SHARED anyway, but that was dropped recently. Fixes: `59527a36e9` ("v3d: Restructure RO allocations using resource_from_handle.") Reviewed-by: Kristian H. Kristensen <hoegsberg@chromium.org> (cherry picked from commit `fafead7b62`)	2019-03-12 09:27:29 -07:00
Brian Paul	4119f81c9d	svga: remove SVGA_RELOC_READ flag in SVGA3D_BindGBSurface() This fixes a rendering issue where UBO updates aren't always picked up by drawing calls. This issue effected the Webots robotics simulator. VMware bug 2175527. Testing Done: Webots replay, piglit, misc Linux games Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> (cherry picked from commit `d4381cf593`)	2019-03-12 09:27:29 -07:00
Jason Ekstrand	750e7f89ce	anv: Take references to push descriptor set layouts Technically, descriptor set layouts aren't required to survive past the function they're passed into so we need to reference them. Cc: "19.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-03-12 09:27:29 -07:00
Jason Ekstrand	5432448878	anv: Refactor descriptor pushing a bit Pull the common code out of the two entrypoints into the helper which fetches the push descriptor set for us. Now that it does more than just get a thing, call it anv_cmd_buffer_push_descriptor_set. Cc: "19.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-03-12 09:27:29 -07:00
Erik Faye-Lund	7c171da284	virgl: remove unused variable This variable is now unused, so let's remove it. Fixes: `db77573d7b` (virgl: modify how we handle GL_MAP_FLUSH_EXPLICIT_BIT) Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org> (cherry picked from commit `44620d4ef7`)	2019-03-08 09:11:31 -08:00
Dylan Baker	8ab2fc8c96	cherry-ignore: Update the cherry-ignore file	2019-03-07 09:15:59 -08:00
Ian Romanick	4621c21d54	intel/fs: Fix extract_u8 of an odd byte from a 64-bit integer In the old code, we would generate the exact same instruction for extract_u8(some_u64, 0) and extract_u8(some_u64, 1). The mask-a-word trick only works for even numbered bytes. This fixes the (new) piglit test tests/spec/arb_gpu_shader_int64/execution/fs-ushr-and-mask.shader_test. v2: Use a SHR instead of an AND. This saves an instruction compared to using two moves. Suggested by Jason. Fixes: `6ac2d16901` ("i965/fs: Fix extract_i8/u8 to a 64-bit destination") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `55e6454d5e`)	2019-03-07 09:09:26 -08:00
Ian Romanick	1cae37f6d3	intel/fs: nir_op_extract_i8 extracts a byte, not a word Fixes: `6ac2d16901` ("i965/fs: Fix extract_i8/u8 to a 64-bit destination") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `4aaf139ea4`)	2019-03-07 09:09:20 -08:00
Jason Ekstrand	0bc0319823	spirv: Pull offset/stride from the pointer for OpArrayLength We can't pull it from the variable type because it might be an array of blocks and not just the one block. While we're here, throw in some error checking. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `f1dbc7e97d`)	2019-03-07 09:09:15 -08:00
Dylan Baker	f852e93815	VERSION: bump version for rc7	2019-03-06 09:22:20 -08:00
Samuel Pitoiset	7c22742731	radv: properly align the fence and EOP bug VA on GFX9 If alignement is 0, offets returned by radv_cmd_buffer_upload_alloc() are always 0. These two virtual addresses were pointing at the same location. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `c2a148692b`)	2019-03-05 12:03:05 -08:00
Eric Engestrom	d7bea557d3	egl: fix libdrm-less builds This function was never used, and isn't properly guarded by HAVE_LIBDRM, breaking the build on systems that don't have libdrm. Let's just remove it. Fixes: `7552fcb7b9` "egl: add base EGL_EXT_device_base implementation" Reported-by: Timo Aaltonen <tjaalton@debian.org> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `bcc4bfc8e8`)	2019-03-05 12:03:05 -08:00
Tapani Pälli	503746b131	anv: retain the is_array state in create_plane_tex_instr_implicit This does not seem to fix anything ATM but is the right thing todo. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Fixes: `f3e91e78a3` ("anv: add nir lowering pass for ycbcr textures") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `33bf3d510c`)	2019-03-05 12:03:05 -08:00
Jason Ekstrand	b6324bdfa5	intel,nir: Lower TXD with min_lod when the sampler index is not < 16 When we have a larger sampler index, we get into the "high sampler" scenario and need an instruction header. Even in SIMD8, this pushes the instruction over the sampler message size maximum of 11 registers. Instead, we have to lower TXD to TXL. Fixes: `cb98e0755f` "intel/fs: Support min_lod parameters on texture..." Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `5c96120b5c`)	2019-03-05 12:03:05 -08:00
Jason Ekstrand	8af5448418	spirv: OpImageQueryLod requires a sampler No idea how this fell through the cracks besides the fact that the sampler bound at 0 almost always works and the CTS isn't amazing. In any case, this appears to have been broken for almost forever. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `ca295ddbfb`)	2019-03-05 12:03:05 -08:00
Jason Ekstrand	014588440b	anv: Count surfaces for non-YCbCr images in GetDescriptorSetLayoutSupport We were accidentally not counting those surfaces Fixes: `ddc4069122` "anv: Implement VK_KHR_maintenance3" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `5049fbddb4`)	2019-03-05 12:03:05 -08:00
Axel Davy	3342431606	st/nine: Ignore multisample quality level if no ms Apparently instead of returning error when passing a quality level different than 0 for D3DMULTISAMPLE_NONE, we should pass. Fixes: https://github.com/iXit/Mesa-3D/issues/340 Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Axel Davy <davyaxel0@gmail.com> (cherry picked from commit `1d363d440f`)	2019-03-05 12:03:05 -08:00
Axel Davy	eb159aab8a	st/nine: Ignore window size if error Check GetWindowInfo and ignore the computed sizes if there is an error. Fixes a regression caused by earlier commit when using old wine gallium nine patches. Should also address a crash at window destruction. Related issues: https://github.com/iXit/Mesa-3D/issues/331 https://github.com/iXit/Mesa-3D/issues/332 Cc: mesa-stable@lists.freedesktop.org Fixes: `2318ca68bb` ("st/nine: Handle window resize when a presentation buffer is used") Signed-off-by: Axel Davy <davyaxel0@gmail.com> (cherry picked from commit `86666f051e`)	2019-03-05 12:03:05 -08:00
Mauro Rossi	682491f447	android: anv: fix libexpat shared dependency Fixes undefined reference building errors for XML_* functions Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Cc: "19.0" <mesa-stable@lists.freedesktop.org> (cherry picked from commit `ec0f465bc5`)	2019-03-05 12:03:05 -08:00
Mauro Rossi	949d6c3544	android: anv: fix generated files depedencies (v2) Fix anv_extrypoints.{c,h} and anv_extensions.{c,h} missing dependencies Rename the variable labels according to targets and python scripts Align the building rules as per Automake for simplification Fixes building errors during rebuils due to missing dependencies (v2) Fixed a missing $(VULKAN_API_XML) reference Fixes: `9a508b7` ("android: anv/extensions: fix generated sources build") Fixes: `dd088d4bec` ("anv/extensions: Generate a header file with extension tables") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Cc: "19.0" <mesa-stable@lists.freedesktop.org> (cherry picked from commit `14e7e26a09`)	2019-03-05 12:03:05 -08:00
Ilia Mirkin	9bdd64a12d	glsl: fix recording of variables for XFB in TCS shaders This is purely for conformance, since it's not actually possible to do XFB on TCS output varyings. However we do have to make sure we record the names correctly, and this removes an extra level of array-ness from the names in question. Fixes KHR-GL45.tessellation_shader.single.xfb_captures_data_from_correct_stage v2: Add comment to the new program_resource_visitor::process function. (Ilia Mirkin) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108457 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `4eec3a2a36`)	2019-03-05 12:03:05 -08:00
Jose Maria Casanova Crespo	688ce72518	glsl: TCS outputs can not be transform feedback candidates on GLES Avoids regression on: KHR-GLES*.core.tessellation_shader.single.xfb_captures_data_from_correct_stage that is uncovered by the following patch. "glsl: fix recording of variables for XFB in TCS shaders" v2: Rebased over glsl: fix recording of variables for XFB in TCS shaders v3: Move this patch before "glsl: fix recording of variables for XFB in TCS shaders" to avoid temporal regressions. (Illia Mirkin) Cc: 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `bf1f49482d`)	2019-03-05 12:03:05 -08:00
Ray Zhang	8bda94e85a	glx: fix shared memory leak in X11 call XShmDetach to allow X server to free shared memory Fixes: `bcd80be49a` "drisw/glx: use XShm if possible" Signed-off-by: Ray Zhang <zhanglei002@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `b344e32cdf`)	2019-03-05 12:03:05 -08:00
Timothy Arceri	6b067b8dd0	glsl: fix shader cache for packed param list Some types of params such as some builtins are always padded. We need to keep track of this so we can restore the list correctly. Here we also remove a couple of cache entries that are not actually required as they get rebuilt by the _mesa_add_parameter() calls. This patch fixes a bunch of arb_texture_multisample and arb_sample_shading piglit tests for the radeonsi NIR backend. Fixes: `edded12376` ("mesa: rework ParameterList to allow packing") Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `7536af670b`)	2019-03-05 12:03:05 -08:00
Yevhenii Kolesnikov	578716cbf7	i965: Fix allow_higher_compat_version workaround limited by OpenGL 3.0 Added check for higher compat profile being allowed before assigning certain extensions. Fixes: `272fe94942` (mesa: enable ARB_texture_buffer_* extensions in the Compatibility profile) Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107052 (cherry picked from commit `07f4b4e403`)	2019-03-05 12:03:05 -08:00
Lionel Landwerlin	77ae9a3a42	intel/compiler: use correct swizzle for replacement The optimization in `4cd1a0be76` introduced a replacement of : cmp(8).z.f0.0 vgrf11.y:D, vgrf10.xxxx:D, vgrf2.xyyy:D ... cmp(8).nz.f0.0 null.x:D, vgrf11.yyyy:D, 0D By : cmp(8).z.f0.0 vgrf15.x:D, vgrf10.xxxx:D, vgrf2.yyyy:D ... mov(8) vgrf11.y:D, vgrf15.yyyy:D The first cmp instruction is storing in x while the second mov is sourcing from y. We need to take into account where the replacement on the scan_inst destination is going to store thing so that the replacement mov can source things from the correct location. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4cd1a0be76` ("i965/vec4: Propagate conditional modifiers from more compares to other compares") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109759 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `6e184147dd`)	2019-03-05 12:03:05 -08:00
Jonathan Marek	4b8df622c4	freedreno: use renderonly path for buffers allocated with modifiers Now that freedreno has create_with_modifiers(), this "hack" is needed to make some cases work. Copied from vc4. Fixes: `41ddf1d1` Signed-off-by: Jonathan Marek <jonathan@marek.ca> (cherry picked from commit `e3591b0339`)	2019-03-05 12:03:05 -08:00
Jonathan Marek	548893d5f5	freedreno: a2xx: fix mipmapping for NPOT textures Fixes: `3a273a4a` Signed-off-by: Jonathan Marek <jonathan@marek.ca> (cherry picked from commit `6c0fefb448`)	2019-03-05 12:03:05 -08:00
Jonathan Marek	84effbe02c	freedreno: a2xx: fix fast clear for some gmem configurations In freedreno_gmem.c, gmem_align of 0x8000 is used. Alignment used here should be the same. Fixes: `912a9c8d` Signed-off-by: Jonathan Marek <jonathan@marek.ca> (cherry picked from commit `4f23767590`)	2019-03-05 12:03:05 -08:00
Jonathan Marek	00fd6cf1e2	freedreno: a2xx: add use_hw_binning function Fixes: `cb2322c7` Signed-off-by: Jonathan Marek <jonathan@marek.ca> (cherry picked from commit `8eca6df5ed`)	2019-03-05 12:03:05 -08:00
Jonathan Marek	04fd26cd29	freedreno: a2xx: don't write 4th vertex in mem2gmem There is only room for 3 vertices now (RECT has 3 vertices). Fixes: `6ef7700a` Signed-off-by: Jonathan Marek <jonathan@marek.ca> (cherry picked from commit `357313ab0f`)	2019-03-05 12:03:05 -08:00
Jose Fonseca	17361a3be9	scons: Workaround failures with MSVC when using SCons 3.0.[2-4]. This change applies the workaround suggested by Bill Deegan on the affected SCons versions. It also adds a comment with the URL explaining why we were using customizing the decider and max_drift in the first place, as I had forgotten all about it. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109443 Tested-by: liviuprodea@yahoo.com Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2019-02-28 21:28:22 +00:00
Brian Paul	f3454823d1	mesa: fix display list corner case assertion This fixes a failed assertion in glDeleteLists() for the following case: list = glGenLists(1); glDeleteLists(list, 1); when those are the first display list commands issued by the application. When we generate display lists, we plug in empty lists created with the make_list() helper. This function uses the OPCODE_END_OF_LIST opcode but does not call dlist_alloc() which would set the InstSize[OPCODE_END_OF_LIST] element to non-zero. When the empty list was deleted, we failed the InstSize[opcode] > 0 assertion. Typically, display lists are created with glNewList/glEndList so we set InstSize[OPCODE_END_OF_LIST] = 1 in dlist_alloc(). That's why this bug wasn't found before. To fix this failure, simply initialize the InstSize[OPCODE_END_OF_LIST] element in make_list(). The game oolite was hitting this. Fixes: https://github.com/OoliteProject/oolite/issues/325 Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `6dabcb5bcf`)	2019-02-27 09:03:31 -08:00
Bas Nieuwenhuizen	62d457eee1	radv: Interpolate less aggressively. Seems like dxvk used integer builtins without setting the flat interpolation decoration. I believe in the current spec the app is required to set these, but in the meantime to avoid breaking things in stable releases (and so close to release for 19.0), only expand the interpolation to float16 and struct (which cannot be builtins as our spirv parser lowers the builtin block). Fixes: `f324784104` "radv: Allow interpolation on non-float types." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `c0110477b5`)	2019-02-27 09:02:32 -08:00
Dylan Baker	3e72463ce0	Bump version for rc6	2019-02-26 16:35:55 -08:00
Samuel Pitoiset	efe69eb1ea	radv: fix clearing attachments in secondary command buffers If no framebuffer is bound, get the number of samples and the image format from the render pass. This fixes new CTS dEQP-VK.geometry.layered.*.secondary_cmd_buffer. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `5671f38085`) Conflicts resolved by Dylan Conflicts: src/amd/vulkan/radv_meta_clear.c	2019-02-26 11:43:10 -08:00
Eleni Maria Stea	0cf3c5f2f3	i965: fixed clamping in set_scissor_bits when the y is flipped Calculating the scissor rectangle fields with the y flipped (0 on top) can generate negative values that will cause assertion failure later on as the scissor fields are all unsigned. We must clamp the bbox values again to make sure they don't exceed the fb_height. Also fixed a calculation error. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108999 https://bugs.freedesktop.org/show_bug.cgi?id=109594 v2: - I initially clamped the values inside the if (Y is flipped) case and I made a mistake in the calculation: the clamp of the bbox[2] should be a check if (bbox[2] >= fbheight) bbox[2] = fbheight - 1 instead and I shouldn't have changed the ScissorRectangleYMax calculation. As the fixed code is equivalent with using CLAMP instead of MAX2 at the top of the function when bbox[2] and bbox[3] are calculated, and the 2nd is more clear, I replaced it. (Nanley Chery) v3: - Reversed the CLAMP change in bbox[3] as the API guarantees that the viewport height is positive. (Nanley Chery) v4: - Added nomination for the mesa-stable branch and the link to the second bugzilla bug (Nanley Chery) CC: <mesa-stable@lists.freedesktop.org> Tested-by: Paul Chelombitko <qamonstergl@gmail.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> (cherry picked from commit `fd37a19ac4`)	2019-02-26 10:41:27 -08:00
Emil Velikov	df01e77d64	meson: egl: correctly manage loader/xmlconfig Earlier commit introduced support for haiku yet did not properly annotate the loader/xmlconfig dependencies. Thus we ended up adding inc_loader for each !haiku platform - see `659910eda0` `9a96bf0ecd` `c731508b98` `ec6cb01e21`. One piece remained though - the wayland platform. Hence the following would fail: meson -Dgallium-drivers=etnaviv -Ddri-drivers=''\ -Dtools=etnaviv -Dplatforms=wayland -Dglx=disabled \ build/ Cc: Alexander von Gluck IV <kallisti5@unixzen.com> Reported-by: Boris Brezillon <boris.brezillon@collabora.com> Fixes: `834d221512` ("meson: Add Haiku platform support v4") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Tested-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> (cherry picked from commit `f0a7b463b5`)	2019-02-26 10:41:18 -08:00
Samuel Pitoiset	e32fb87fd0	radv: don't copy buffer descriptors list for samplers Sampler descriptors don't have a buffer list. This fixes some crashes with new CTS dEQP-VK.binding_model.descriptor_copy..sampler_. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `4924dfc851`)	2019-02-26 10:41:09 -08:00
Samuel Pitoiset	b2b148eb32	radv: fix out-of-bounds access when copying descriptors BO list We shouldn't increment the buffer list pointers twice. This fixes some crashes with new CTS dEQP-VK.binding_model.descriptor_copy.*. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `9256e0a09d`)	2019-02-26 10:40:58 -08:00
Tapani Pälli	2cb6cdc2aa	nir: initialize value in copy_prop_vars_block Fixes following valgrind warning: ==27561== Conditional jump or move depends on uninitialised value(s) ==27561== at 0x667856B: value_set_ssa_components (nir_opt_copy_prop_vars.c:78) ==27561== by 0x667A1C4: copy_prop_vars_block (nir_opt_copy_prop_vars.c:797) Fixes: `62332d139c` "nir: Add a local variable-based copy propagation pass" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (cherry picked from commit `22267feff1`)	2019-02-26 10:40:48 -08:00
Timothy Arceri	dd067d57e1	radeonsi: fix query buffer allocation Fix the logic for buffer full check on alloc. This patch just takes the fix Nicolai attached to the bug report and updates it to work on master. Fixes: `e0f0d3675d` ("radeonsi: factor si_query_buffer logic out of si_query_hw") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109561 (cherry picked from commit `603206d0a6`)	2019-02-26 10:40:40 -08:00
Kenneth Graunke	a4ea63e884	spirv: Eliminate dead input/output variables after translation. spirv_to_nir can generate input/output variables which are illegal for the current shader stage, which would cause nir_validate_shader to balk. After my recent commit to start decorating arrays as compact, dEQP-VK.spirv_assembly.instruction.graphics.module.same_module started hitting validation errors due to outputs in a TCS (not intended for the TCS at all) not being per-vertex arrays. Thanks to Jason Ekstrand for suggesting this approach. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109573 Fixes: `ef99f4c8d1` compiler: Mark clip/cull distance arrays as compact before lowering. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> (cherry picked from commit `6775665e5e`)	2019-02-26 10:40:21 -08:00
Bas Nieuwenhuizen	bdc946c23a	radv: Fix rebase issue in 19.0 for float16 fix. The cherry-pick dropped a chunk. Fixes: `e7351739ff` "radv: Fix float16 interpolation set up." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-02-26 18:38:30 +00:00
Kenneth Graunke	bb3fdedadf	compiler: Mark clip/cull distance arrays as compact before lowering. nir_lower_clip_cull_distance_arrays() marks the combined clip/cull distance array as compact. However, when translating in from GLSL or SPIR-V, we were not marking the original float[] arrays as compact. We should do so. That way, we can detect these corner cases properly. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `ef99f4c8d1`) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-26 16:50:07 +01:00
Jason Ekstrand	c0509143ab	nir/lower_clip_cull: Fix an incorrect assert Copy+paste error. It was supposed to test cull and not clip. Fixes: `4e69fba534` "nir: Rewrite lower_clip_cull_distance_arrays..." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109717 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `f98fd9d15a`)	2019-02-25 14:17:48 -08:00
Bas Nieuwenhuizen	49dad5e8fe	radv: Handle clip+cull distances more generally as compact arrays. Needed for https://gitlab.freedesktop.org/mesa/mesa/merge_requests/248 . That MR keeps the clip and cull arrays split. So we have to handle - compact arrays with location_frac != 0 - VARYING_SLOT_CLIP_DIST1 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `1ef2855692`)	2019-02-25 14:01:31 -08:00
Jason Ekstrand	7114cfc1d2	nir/xfb: Handle compact arrays in gather_xfb_info This makes us properly handle gl_ClipDistance and gl_CullDistance. Fixes: `19064b8c` "nir: Add a pass for gathering transform feedback info" Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (cherry picked from commit `1a93fc382b`)	2019-02-25 14:00:40 -08:00
Jason Ekstrand	447ef1d9b3	nir/xfb: Work in terms of components rather than slots We needed to better handle cases where a chunk of a variable starts at some non-zero location_frac and rolls over into the next slot but may not be more than 4 dwords. For example, if gl_CullDistance is an array of 3 things and has location_frac = 2, it will span across two vec4s but is not, itself, bigger than a vec4. If you ignore the clip/cull special case, it's not allowed to happen for anything else because the only things that can span more than one slot is dvec3 and dvec4 and they're both bigger than a vec4. The current code uses this attrib_slot thing where we count attribute slots and iterate over them. However, that doesn't work in the case above because gl_CullDistance will have an attrib_slot count of 1 even though it does span two slots. We could fix this by adjusting attrib_slot but we already have comp_mask and it's easier to just handle it that way. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (cherry picked from commit `558c314504`)	2019-02-25 14:00:34 -08:00
Jason Ekstrand	721011c6f7	nir: Rewrite lower_clip_cull_distance_arrays to do a lot less lowering Instead of going to all the work of to combine them into one array, just make two arrays and use location_frac to colocate them within CLIP0. Then the back-end can sort things out and stack them on top of each other. Thanks to `ef99f4c8`, we also don't need to set compact anymore. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `4e69fba534`) Conflicts resolved by Dylan Conflicts: src/compiler/nir/nir_lower_clip_cull_distance_arrays.c	2019-02-25 13:57:54 -08:00
Jason Ekstrand	fdc25b9b1b	nir/xfb: Properly align 64-bit values Fixes: `19064b8c` "nir: Add a pass for gathering transform feedback info" Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (cherry picked from commit `8f0fe71cc5`)	2019-02-25 13:29:22 -08:00
Jason Ekstrand	5fcd81c5ed	compiler/types: Add a contains_64bit helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (cherry picked from commit `30b548fc62`)	2019-02-25 13:29:14 -08:00
Maya Rashish	67162ad12f	configure: fix test portability From the bash manual: string1 == string2 string1 = string2 True if the strings are equal. = should be used with the test command for POSIX conformance. (cherry picked from commit `021c496135`)	2019-02-25 13:26:36 -08:00
Bas Nieuwenhuizen	a03101c717	radv: Allow interpolation on non-float types. In particular structs containing floats and 16-bit floating point types. Fixes: `62024fa775` "radv: enable VK_KHR_16bit_storage extension / 16bit storage features" Fixes: `da29594636` "spirv: Only split blocks" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109735 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `f324784104`)	2019-02-25 13:26:36 -08:00
Bas Nieuwenhuizen	e7351739ff	radv: Fix float16 interpolation set up. float16 types can have non-flat interpolation so set up the HW correctly for that. Fixes: `62024fa775` "radv: enable VK_KHR_16bit_storage extension / 16bit storage features" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `a1fdd4a4a7`) Conflicts resolved by Dylan Conflicts: src/amd/vulkan/radv_nir_to_llvm.c	2019-02-25 13:26:36 -08:00
Sergii Romantsov	697c6c5a19	dri: meson: do not prefix user provided dri-drivers-path The user can select the location where there dri drivers are installed by the dri-drivers-path meson option. By default path will be $prefix/$libdir/dri. Currently we add $prefix to the user provided path. Resulting in an incorrect or even missing path. v2: fixed dri_search_path by default, rebased to master v3: new commit-message (Emil Velikov), cc mesa-stable Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109698 CC: Rafael Antognolli <rafael.antognolli@intel.com> CC: Dylan Baker <dylan@pnwbakers.com> Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Fixes: `306914db92` (meson: Add dridriverdir variable to dri.pc.) Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `f6556ec7d1`)	2019-02-25 13:26:36 -08:00
David Shao	edeb13b550	meson: ensure that xmlpool_options.h is generated for gallium targets that need it Fixes: `68076b8747` "meson: build gallium vdpau state tracker" Fixes: `22a817af8a` "meson: build gallium xvmc state tracker" Fixes: `5a785d51a6` "meson: build gallium va state tracker" Fixes: `0ba909f0f1` "meson: build gallium xa state tracker" Fixes: `1d36dc674d` "meson: build gallium omx state tracker" Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `6fa923a65d`)	2019-02-25 13:26:36 -08:00
Alok Hota	06889aaa63	swr/rast: bypass size limit for non-sampled textures This fixes a bug where SWR will fail to render in cases with large buffer allocations, e.g. very large meshes whose vertex buffers exceed 2GB CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> (cherry picked from commit `6053499f2e`)	2019-02-25 13:26:36 -08:00
Marek Olšák	1d4fbff786	tgsi: don't set tgsi_info::uses_bindless_images for constbufs and hw atomics This might have decreased performance for radeonsi/tgsi, because most most shaders claimed they used bindless. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> (cherry picked from commit `b326a15eda`)	2019-02-25 13:26:36 -08:00
Juan A. Suarez Romero	9f5622fe14	anv: advertise 8 subpixel precision bits On one side, when emitting 3DSTATE_SF, VertexSubPixelPrecisionSelect is used to select between 8 bit subpixel precision (value 0) or 4 bit subpixel precision (value 1). As this value is not set, means it is taking the value 0, so 8 bit are used. On the other side, in the Vulkan CTS tests, if the reference rasterizer, which uses 8 bit precision, as it is used to check what should be the expected value for the tests, is changed to use 4 bit as ANV was advertising so far, some of the tests will fail. So it seems ANV is actually using 8 bits. v2: explicitly set 3DSTATE_SF::VertexSubPixelPrecisionSelect (Jason) v3: use _8Bit definition as value (Jason) v4: (by Jason) anv: Explicitly set 3DSTATE_CLIP::VertexSubPixelPrecisionSelect This field was added on gen8 even though there's an identically defined one in 3DSTATE_SF. CC: Jason Ekstrand <jason@jlekstrand.net> CC: Kenneth Graunke <kenneth@whitecape.org> CC: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `4f917e6a61`)	2019-02-25 13:26:35 -08:00
Juan A. Suarez Romero	be1200d6a5	genxml: add missing field values for 3DSTATE_SF Fill out "Vertex Sub Pixel Precision Select" possible values. CC: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `3b423eeb2d`)	2019-02-25 13:26:35 -08:00
Lionel Landwerlin	641884c884	intel: fix urb size for CFL GT1 Same 192Kb amount as SKL/KBL GT1 applies. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Fixes: `de7ed0ba55` ("i965/CFL: Add PCI Ids for Coffee Lake.") (cherry picked from commit `1d626fc028`)	2019-02-25 13:26:35 -08:00
Francisco Jerez	32e2f645e4	intel/fs: Implement extended strides greater than 4 for IR source regions. Strides up to 32B can be implemented for the source regions of most instructions by leveraging either the vertical or the horizontal stride of the hardware Align1 region. The main motivation for this is that currently the lower_integer_multiplication() pass will happily double the stride of one of the 32-bit sources, which can blow up if the stride of the original source was already the maximum value allowed by the hardware. An alternative would be to use the regioning legalization pass in order to lower such strides into the composition of multiple legal strides, but that would be somewhat less efficient. This showed up as a regression from my commit `cbea91eb57` in Vulkan 1.1 CTS tests on CHV/BXT platforms, however it was really a pre-existing problem that had affected conformance on other platforms without native support for integer multiplication. CHV/BXT were getting around it because the code I removed in that commit had the "fortunate" side effect of emitting narrower regions that didn't hit the hardware stride limit after lowering. Beyond fixing the regression this fixes ~90 additional Vulkan 1.1 subgroup CTS tests on ICL (that's why this patch is marked for inclusion in mesa-stable even though the original regressing patch was not). According to Jason, a nearly equivalent change had been committed previously as `e8c9e65185` and then (mistakenly?) reverted as `a31d038208`. Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109328 Reported-by: Mark Janes <mark.a.janes@intel.com> Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `e03be78252`)	2019-02-25 13:26:35 -08:00
Francisco Jerez	d45d35768b	intel/fs: Exclude control sources from execution type and region alignment calculations. Currently the execution type calculation will return a bogus value in cases like: mov_indirect(8) vgrf0:w, vgrf1:w, vgrf2:ud, 32u Which will be considered to have a 32-bit integer execution type even though the actual indirect move operation will be carried out with 16-bit precision. Similarly there's no need to apply the CHV/BXT double-precision region alignment restrictions to such control sources, since they aren't directly involved in the double-precision arithmetic operations emitted by these virtual instructions. Applying the CHV/BXT restrictions to control sources was expected to be harmless if mildly inefficient, but unfortunately it exposed problems at codegen level for virtual instructions (namely the SHUFFLE instruction used for the Vulkan 1.1 subgroup feature) that weren't prepared to accept control sources with an arbitrary strided region. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109328 Reported-by: Mark Janes <mark.a.janes@intel.com> Fixes: `efa4e4bc5f` "intel/fs: Introduce regioning lowering pass." Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `c3c27762f7`)	2019-02-25 13:26:35 -08:00
Andrii Simiklit	4cfdd5a1f2	i965: re-emit index buffer state on a reset option change. Seems like we forget to update the index buffer (ib) status and IndexedDrawCutIndexEnable or CutIndexEnable flag is left unchanged it leads to ignoring of glEnable/glDisable functions for GL_PRIMITIVE_RESTART in some cases. The index buffer (ib) status should be re-emmited after the reset option change to avoid some unexpected behavior. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109451 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Signed-off-by: Andrii Simiklit <asimiklit.work@gmail.com> (cherry picked from commit `f4f4ec941e`)	2019-02-25 09:13:48 -08:00
Rodrigo Vivi	6abb6bd87a	intel: Add more PCI Device IDs for Coffee Lake and Ice Lake. Align with kernel commits: 5e0f5a58b167 ("drm/i915/cfl: Adding another PCI Device ID.") 03ca3cf8e9aa ("drm/i915/icl: Adding few more device IDs for Ice Lake") Cc: José Roberto de Souza <jose.souza@intel.com> Cc: Kenneth Graunke <kenneth@whitecape.org> Cc: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `56c3b4971d`)	2019-02-20 08:19:31 -08:00
Carlos Garnacho	c4799332b8	wayland/egl: Ensure EGL surface is resized on DRI update_buffers() Fullscreening and unfullscreening a totem window while playing a video sometimes results in the video subsurface not changing size along. This is also reproducible with epiphany. If a surface gets resized while we have an active back buffer for it, the resized dimensions won't get neither immediately applied on the resize callback, nor correctly synchronized on update_buffers(), as the (now stale) surface size and currently attached buffer size still do match. There's actually 2 things to synchronize here, first the surface query size might not be updated yet to the wl_egl_window's (i.e. resize_callback happened while there is a back buffer), and second the wayland buffers would need dropping if new surface size differs with the currently attached buffer. These are done in separate steps now. https://bugzilla.redhat.com/show_bug.cgi?id=1650929 https://bugs.freedesktop.org/show_bug.cgi?id=109594 Fixes: `a9fb331ea7` ("wayland/egl: update surface size on window resize") Signed-off-by: Carlos Garnacho <carlosg@gnome.org> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Tested-by: Bastien Nocera <hadess@hadess.net> Tested-by: Denys Kostin <denys.kostin@globallogic.com> (cherry picked from commit `30a01cd923`)	2019-02-20 08:14:21 -08:00
Bas Nieuwenhuizen	73fa079b1d	radv: Sync ETC2 whitelisted devices. Fixes: `4bb6c49375` "radv: Allow ETC2 on RAVEN and VEGA10 instead of all GFX9." Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `7631feaa00`)	2019-02-20 08:14:13 -08:00
Mario Kleiner	2ea2b2273b	drirc: Add sddm-greeter to adaptive_sync blacklist. This is the sddm login screen. Fixes: `a9c36dbf9c` ("drirc: Initial blacklist for adaptive sync") Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com> Cc: 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `afb15d14ca`)	2019-02-20 08:14:01 -08:00
Marek Olšák	dd03e1d5de	driconf: add Civ6Sub executable for Civilization 6 I'm getting Civ6Sub instead of Civ6. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `bff8da6c59`)	2019-02-20 08:13:50 -08:00
Marek Olšák	c837cd6546	radeonsi: always enable NIR for Civilization 6 to fix corruption Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104602 Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `ae21bdf47c`)	2019-02-20 08:13:34 -08:00
Marek Olšák	a5f16a42a5	radeonsi: add driconf option radeonsi_enable_nir Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `ccbfe44e5f`)	2019-02-20 08:13:24 -08:00
Dylan Baker	2e7833ad91	Version: update to 19.0-rc5	2019-02-19 11:15:18 -08:00
Tapani Pälli	0a2e4b02ca	mesa: return NULL if we exceed MaxColorAttachments in get_fb_attachment This fixes invalid access to Attachment array which would occur if caller would exceed MaxColorAttachments. In practice this should not ever happen because DiscardFramebufferEXT specifies only GL_COLOR_ATTACHMENT0 to be valid and InvalidateFramebuffer will error out before but this should make coverity happy. v2: const, remove _EXT (Ian) CID: 1442559 Fixes: `0c42b5f3cb` "mesa: wire up InvalidateFramebuffer" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `9762a9f893`)	2019-02-19 07:08:42 -08:00
Rhys Perry	c7fc61d15b	radv: ensure export arguments are always float So that the signature is correct and consistent, the inputs to a export intrinsic should always be 32-bit floats. This and the previous commit fixes a large amount crashes from dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.input_output_int_* tests Fixes: `b722b29f10` ('radv: add support for 16bit input/output') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `0ca550e01a`)	2019-02-19 07:08:23 -08:00
Rhys Perry	1b093b567f	radv: bitcast 16-bit outputs to integers 16-bit outputs are stored as 16-bit floats in the outputs array, so they have to be bitcast. Fixes: `b722b29f10` ('radv: add support for 16bit input/output') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `64065aa504`)	2019-02-19 07:08:11 -08:00
Eric Anholt	d73e48b63f	v3d: Fix the check for "is the last thrsw inside control flow" The execute.file check used to be good enough, until I stopped setting up the execute mask for uniform ifs. No known tests fixed, noticed while doing a refactor. Fixes: `0805060573` ("v3d: Handle dynamically uniform IF statements with uniform control flow.") (cherry picked from commit `441294962c`)	2019-02-19 07:07:54 -08:00
Eric Anholt	ba24ca67f6	v3d: Use the early_fragment_tests flag for the shader's disable-EZ field. Apparently we need disable-EZ flagged, not just "does Z writes". Fixes dEQP-GLES31.functional.image_load_store.early_fragment_tests.no_early_fragment_tests_depth_fbo on 7278, even though it passed in simulation. Signed-off-by: Eric Anholt <eric@anholt.net> Fixes: `051a41d3d5` ("v3d: Add support for the early_fragment_tests flag.") (cherry picked from commit `cd5e0b2729`)	2019-02-19 07:07:38 -08:00
Samuel Pitoiset	110500cc8a	radv: fix writing the alpha channel of MRT0 when alpha coverage is enabled This version is better and safer. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `47616810ed`)	2019-02-19 07:07:22 -08:00
Samuel Pitoiset	0b9f6ebfbb	radv: write the alpha channel of MRT0 when alpha coverage is enabled Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109597 Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `0d8f096293`)	2019-02-19 07:07:13 -08:00
Kenneth Graunke	69ebf4569a	nir: Don't reassociate add/mul chains containing only constants The idea here is to reassociate a * (b * c) into (a * c) * b, when b is a non-constant value, but a and c are constants, allowing them to be combined. But nothing was enforcing that 'b' must be non-constant, which meant that running opt_algebraic in a loop would never terminate if the IR contained non-folded constant expressions like 256 * 0.5 * 2. Normally, we call constant folding in such a loop too, but IMO it's better for nir_opt_algebraic to be robust and not rely on that. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109581 Fixes: `32e266a9a5` i965: Compile fp64 funcs only if we do not have 64-bit hardware support Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `535251487b`)	2019-02-19 07:07:04 -08:00
Matt Turner	385b736238	intel/compiler/test: Add unit test for mismatched signedness comparison v2 (idr): Move adding the test to after adding the fix. Reordering the two commits prevents possible headaches for git-bisect with scripts that always do 'ninja check'. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109404 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `ac21dd4aee`)	2019-02-15 12:03:53 -08:00
Matt Turner	4cf1a40f9a	intel/compiler: Avoid propagating inequality cmods if types are different v2: Fix silly bug in logic. s/\|\|/&&/ All but one of the affected shaders is in an Unreal4 demo. The other is in Tomb Raider. All of the cases that Ian investigated appear to be sequences like the following if (int(uint(some_float)) < 0) /* other relations too */ ... At least in Tomb Raider, it's not obvious that this sequence came from the original shader. In some of the Unreal demos, the shader contains code like if (int(uint(textureLod(...))) > 0) ... which explicitly generates the offending sequence. All Gen6+ platforms had similar results (Skylake shown): total instructions in shared programs: 15437170 -> 15437187 (<.01%) instructions in affected programs: 4492 -> 4509 (0.38%) helped: 0 HURT: 17 HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.05% max: 0.73% x̄: 0.66% x̃: 0.73% 95% mean confidence interval for instructions value: 1.00 1.00 95% mean confidence interval for instructions %-change: 0.57% 0.75% Instructions are HURT. total cycles in shared programs: 383007996 -> 383007992 (<.01%) cycles in affected programs: 20542 -> 20538 (-0.02%) helped: 6 HURT: 7 helped stats (abs) min: 2 max: 6 x̄: 5.33 x̃: 6 helped stats (rel) min: 0.11% max: 0.36% x̄: 0.32% x̃: 0.36% HURT stats (abs) min: 4 max: 4 x̄: 4.00 x̃: 4 HURT stats (rel) min: 0.27% max: 0.27% x̄: 0.27% x̃: 0.27% 95% mean confidence interval for cycles value: -3.30 2.69 95% mean confidence interval for cycles %-change: -0.19% 0.19% Inconclusive result (value mean confidence interval includes 0). No changes on Iron Lake or GM45. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109404 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: nagrigoriadis@gmail.com Tested-by: Danylo Piliaiev <danylo.piliaiev@gmail.com> (cherry picked from commit `2dff9a66b6`)	2019-02-15 12:03:44 -08:00
Jason Ekstrand	81e053b757	intel/fs: Bail in optimize_extract_to_float if we have modifiers This fixes a bug in runscape where we were optimizing x >> 16 to an extract and then negating and converting to float. The NIR to fs pass was dropping the negate on the floor breaking a geometry shader and causing it to render nothing. Fixes: `1f862e923c` "i965/fs: Optimize float conversions of byte/word..." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109601 Tested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `367b0ede4d`)	2019-02-15 10:03:26 -08:00
Ilia Mirkin	f30fb27665	swr: set PIPE_CAP_MAX_VARYINGS correctly Unfortunately swr was missed in the original commit. The number of varyings should generally match up to what's reported as the shader caps for fragment inputs. Fixes: `6010d7b8e8` (gallium: add PIPE_CAP_MAX_VARYINGS) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Alok Hota <alok.hota@intel.com> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `8c859367df`)	2019-02-15 10:03:20 -08:00
Kenneth Graunke	1039285288	anv: Put MOCS in the correct location My patch to switch from struct-based MOCS to numeric MOCS accidentally divided all MOCS entries by 2 in the Vulkan driver. MOCS on Gen9+ is just an array index into a table. But in the hardware packets, the index starts at bit 1. So we need to shift it. Fixes: `0b44644ca6` (genxml: Consistently use a numeric "MOCS" field) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `39aee57523`)	2019-02-15 10:03:08 -08:00
Ian Romanick	59812ac38d	spirv: Add missing break Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `c6465fec0c` ("spirv: add SpvCapabilityInt64Atomics") CID: 1442555 (cherry picked from commit `9a918050e0`)	2019-02-14 09:30:54 -08:00
Dylan Baker	c19ce6e5e2	meson: Add dependency on genxml to anvil Currently the Intel "anvil" driver races with the generation of genxml files, while i965 has an explicit dependency. This patch adds the same dependency to anvil. Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `279060cd32`)	2019-02-14 09:30:44 -08:00
Samuel Pitoiset	eba57c29b0	radv: always export gl_SampleMask when the fragment shader uses it For some reasons, this breaks trees rendering in Project Cars. Fixes: `85010585cd` ("radv: only enable gl_SampleMask if MSAA is enabled too") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109401 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `334da034d8`)	2019-02-14 09:30:38 -08:00
Samuel Pitoiset	e304007d87	radv/winsys: fix BO list creation when RADV_DEBUG=allbos is set Fixes: `50fd253bd6` ("radv/winsys: Add priority handling during submit.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `5e18000d1b`)	2019-02-14 09:30:33 -08:00
Dylan Baker	b4419fdba5	get-pick-list: Add --pretty=medium to the arguments for Cc patches Because none of them have been picked up for 19.0 due to this bug being reintroduced. v2: - Fix fixes tags Fixes: `e6b3a3b201` ("bin/get-pick-list.sh: handle "typod" usecase.") Fixes: `fac10169bb` ("bin/get-pick-list.sh: prefix output with "[stable] "") Reviewed-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `aff52dd2c6`)	2019-02-13 14:14:35 -08:00
Karol Herbst	7ac15d9e42	nir/opt_if: don't mark progress if nothing changes if we have something like this: loop { ... if x { break; } else { continue; } } opt_if_loop_last_continue returns true marking progress allthough nothing changes. Fixes: `5921a19d4b` "nir: add if opt opt_if_loop_last_continue()" Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `7e08f22a72`)	2019-02-13 14:14:35 -08:00
Oscar Blumberg	6b48451110	radeonsi: Fix guardband computation for large render targets Stop using 12.12 quantization for viewports that are not contained in the lower 4k corner of the render target as the hardware needs to keep both absolute and relative coordinates representable. Signed-off-by: Marek Olšák <marek.olsak@amd.com> Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `3c540e0a74`)	2019-02-13 14:14:35 -08:00
Dylan Baker	838baab472	version: bump for 19.0-rc4	2019-02-13 09:11:02 -08:00
Juan A. Suarez Romero	d8534f931c	anv/cmd_buffer: check for NULL framebuffer This can happen when we record a VkCmdDraw in a secondary buffer that was created inheriting from the primary buffer, but with the framebuffer set to NULL in the VkCommandBufferInheritanceInfo. Vulkan 1.1.81 spec says that "the application must ensure (using scissor if neccesary) that all rendering is contained in the render area [...] [which] must be contained within the framebuffer dimesions". While this should be done by the application, commit `465e5a86` added the clamp to the framebuffer size, in case of application does not do it. But this requires to know the framebuffer dimensions. If we do not have a framebuffer at that moment, the best compromise we can do is to just apply the scissor as it is, and let the application to ensure the rendering is contained in the render area. v2: do not clamp to framebuffer if there isn't a framebuffer v3 (Jason): - clamp earlier in the conditional - clamp to render area if command buffer is primary v4: clamp also x and y to render area (Jason) v5: rename used variables (Jason) Fixes: `465e5a86` ("anv: Clamp scissors to the framebuffer boundary") CC: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `1ad26f9417`)	2019-02-12 14:19:52 -08:00
Samuel Pitoiset	1f33f3cf3a	radv: fix using LOAD_CONTEXT_REG with old GFX ME firmwares on GFX8 This fixes a critical issue. Cc: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109575 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `1b8983c25b`)	2019-02-12 14:19:52 -08:00
Samuel Pitoiset	fbcd1ad42c	radv: fix compiler issues with GCC 9 "The C standard says that compound literals which occur inside of the body of a function have automatic storage duration associated with the enclosing block. Older GCC releases were putting such compound literals into the scope of the whole function, so their lifetime actually ended at the end of containing function. This has been fixed in GCC 9. Code that relied on this extended lifetime needs to be fixed, move the compound literals to whatever scope they need to accessible in." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109543 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Gustaw Smolarczyk <wielkiegie@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `129a9f4937`)	2019-02-12 14:19:52 -08:00
Kenneth Graunke	9a5c8d2aab	st/mesa: Limit GL_MAX_[NATIVE_]PROGRAM_PARAMETERS_ARB to 2048 Piglit's vp-max-array test creates a vertex program containing a uniform array sized to the value of GL_MAX_NATIVE_PROGRAM_PARAMETERS_ARB. Mesa will then add additional state-var parameters for things like the MVP matrix. radeonsi currently exposes a value of 4096, derived from constant buffer upload size. This means the array will have 4096 elements, and the extra MVP state-vars would get a prog_src_register::Index of over 4096. Unfortunately, prog_src_register::Index is a signed 13-bit integer, so values beyond 4096 end up turning into negative numbers. Negative source indexes are only valid for relative addressing, so this ends up generating illegal IR. In prog_to_nir, this would cause an out of bounds array access. st_mesa_to_tgsi checks for a negative value, assumes it's bogus, and remaps it to parameter 0 in order to get something in-range. This isn't right - instead of reading the MVP matrix, it would read the first element of the vertex program's large array. But the test only checks that the program compiles, so we never noticed that it was broken. This patch limits the size of the program limits, with the understanding that we may need to generate additional state-vars internally. i965 has exposed 1024 for this limit for years, so I don't expect lowering it to 2048 will cause any practical problems for radeonsi or other drivers. Fixes vp-max-array with prog_to_nir.c. Cc: "19.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `f45dd6d31b`)	2019-02-12 14:19:52 -08:00
Leo Liu	c55008e5a0	st/va/vp9: set max reference as default of VP9 reference number If there is no information about number of render targets Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `a0a52a0367`)	2019-02-12 14:19:52 -08:00
Leo Liu	ab585817e6	st/va: fix the incorrect max profiles report Add "PIPE_VIDEO_PROFILE_MAX" to enum, so it will make sure here will be correct when adding more profiles in the future. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109107 Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `21cdb828a3`)	2019-02-12 14:19:52 -08:00
Marek Olšák	75bec50c2a	winsys/amdgpu: don't drop manually added fence dependencies wow, it's hard to believe that fence and syncobjs dependencies were ignored. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (cherry picked from commit `ddfe209a0d`)	2019-02-12 14:19:52 -08:00
Marek Olšák	62b3bd8cd1	radeonsi: fix EXPLICIT_FLUSH for flush offsets > 0 Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (cherry picked from commit `61c678d4bc`)	2019-02-12 14:19:52 -08:00
Marek Olšák	fb3485bc92	gallium/u_threaded: fix EXPLICIT_FLUSH for flush offsets > 0 Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (cherry picked from commit `4522f01d4e`)	2019-02-12 14:19:52 -08:00
Ilia Mirkin	2a97a3a8e7	nvc0: we have 16k-sized framebuffers, fix default scissors For some reason we don't use view volume clipping by default, and use scissors instead. These scissors were set to an 8k max fb size, while the driver advertises 16k-sized framebuffers. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: <mesa-stable@lists.freedesktop.org> (cherry picked from commit `cc79a1483f`)	2019-02-12 14:19:52 -08:00
Karol Herbst	ab70eccc75	st/mesa: require RGBA2, RGB4, and RGBA4 to be renderable If the driver does not support rendering to these formats but does support texturing, we can end up in incompatibilities between textures and renderbuffers that are then copied to. Fixes KHR-GL45.copy_image.functional on nvc0 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `cbd1ad6165`)	2019-02-12 14:19:52 -08:00
Karol Herbst	24bb2771b6	gallium: add PIPE_CAP_MAX_VARYINGS Some NVIDIA hardware can accept 128 fragment shader input components, but only have up to 124 varying-interpolated input components. We add a new cap to express this cleanly. For most drivers, this will have the same value as PIPE_SHADER_CAP_MAX_INPUTS for the fragment shader. Fixes KHR-GL45.limits.max_fragment_input_components Conflicts resolved by Dylan Signed-off-by: Karol Herbst <karolherbst@gmail.com> [imirkin: rebased, improved docs/commit message] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `6010d7b8e8`)	2019-02-12 14:19:52 -08:00
Karol Herbst	7b5e0f8316	gm107/ir: add fp64 rsq Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `cce4955721`)	2019-02-12 14:19:52 -08:00
Karol Herbst	77102d0151	gm107/ir: add fp64 rcp Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `815a8e59c6`)	2019-02-12 14:19:52 -08:00
Karol Herbst	c96d433105	gk104/ir: Use the new rcp/rsq in library [imirkin: add a few more "long" prefixes to safen things up] Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `12669d2970`)	2019-02-12 14:19:52 -08:00
Boyan Ding	81810fa5db	gk110/ir: Use the new rcp/rsq in library v2: (Karol Herbst <kherbst@redhat.com> * fix Value setup for the builtins Signed-off-by: Boyan Ding <boyan.j.ding@gmail.com> [imirkin: track the fp64 flag when switching ops to calls] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `656ad06051`)	2019-02-12 14:19:52 -08:00
Boyan Ding	c5b9774eb4	gk110/ir: Add rsq f64 implementation Signed-off-by: Boyan Ding <boyan.j.ding@gmail.com> Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `7937408052`)	2019-02-12 14:19:52 -08:00
Boyan Ding	a08aba86da	gk110/ir: Add rcp f64 implementation Signed-off-by: Boyan Ding <boyan.j.ding@gmail.com> Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `04593d9a73`)	2019-02-12 14:19:52 -08:00
Ilia Mirkin	d278b3c187	nvc0: stick zero values for the compute invocation counts Not quite perfect, but at least we don't end up with random values in the query buffer. Fixes KHR-GL45.pipeline_statistics_query_tests_ARB.functional_default_qo_values Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `6adb9b38bf`)	2019-02-12 14:19:52 -08:00
Ilia Mirkin	5a9b7bce9c	nv50,nvc0: use condition for occlusion queries when already complete For the NO_WAIT variants, we would jump into the ALWAYS case for both nested and inverted occlusion queries. However if the query had previously completed, the application could reasonably expect that the render condition would follow that result. To resolve this, we remove the nesting distinction which unnecessarily created an imbalance between the regular and inverted cases (since there's no "zero" condition mode). We also use the proper comparison if we know that the query has completed (which could happen as a result of an earlier get_query_result call). Fixes KHR-GL45.conditional_render_inverted.functional Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `e00799d3dc`)	2019-02-12 14:19:52 -08:00
Ilia Mirkin	b9e5e15f87	nvc0: fix 3d images on kepler Looks like SUBFM.3D and SUEAU are perfectly capable of dealing with 3d tiling, they just need the correct inputs. Supply them. We also have to deal with the case where a 2d "layer" of a 3d image is bound. In this case, we supply the z coordinate separately to the shader, which has to optionally treat every 2d case as if it could be a slice of a 3d texture. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `162352e671`)	2019-02-12 14:19:52 -08:00
Ilia Mirkin	f305135e0b	nvc0/ir: always use CG mode for loads from atomic-only buffers Atomic operations don't update the local cache, which means that we would have to issue CCTL operations in order to get the updated values. When we know that a buffer is primarily used for atomic operations, it's easier to just avoid the caching at that level entirely. The same issue persists for non-atomic buffers, which will have to be fixed separately. Fixes the failing dEQP-GLES31.functional.atomic_counter.* tests. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `4443b6ddf2`)	2019-02-12 14:19:52 -08:00
Ilia Mirkin	eb766a259e	nvc0: add support for handling indirect draws with attrib conversion The hardware does not natively support FIXED and DOUBLE formats. If those are used in an indirect draw, they have to be converted. Our conversion tries to be clever about only converting the data that's needed. However for indirect, that won't work. Given that DOUBLE or FIXED are highly unlikely to ever be used with indirect draws, read the indirect buffer on the CPU and issue draws directly. Fixes the failing dEQP-GLES31.functional.draw_indirect.random.* tests. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `399215eb7a`)	2019-02-12 14:19:52 -08:00
Bas Nieuwenhuizen	a1ae60e9a3	amd/common: Use correct writemask for shared memory stores. The check was for 1 bit being set, which is clearly not what we want. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `3c24fc64c7`)	2019-02-12 14:19:52 -08:00
Bas Nieuwenhuizen	37ade3a566	radv: Only look at pImmutableSamples if the descriptor has a sampler. Equivalent of ANV patch `c7f4a2867c` CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `39ab4e12f7`)	2019-02-12 14:19:52 -08:00
Bart Oldeman	92fa6d6959	gallium-xlib: query MIT-SHM before using it. When Mesa is compiled for gallium-xlib using e.g. ./configure --enable-glx=gallium-xlib --disable-dri --disable-gbm -disable-egl and is used by an X server (usually remotely via SSH X11 forwarding) that does not support MIT-SHM such as XMing or MobaXterm, OpenGL clients report error messages such as Xlib: extension "MIT-SHM" missing on display "localhost:11.0". ad infinitum. The reason is that the code in src/gallium/winsys/sw/xlib uses MIT-SHM without checking for its existence, unlike the code in src/glx/drisw_glx.c and src/mesa/drivers/x11/xm_api.c. I copied the same check using XQueryExtension, and tested with glxgears on MobaXterm. This issue was reported before here: https://lists.freedesktop.org/archives/mesa-users/2016-July/001183.html Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Cc: <mesa-stable@lists.freedesktop.org> (cherry picked from commit `a203eaa4f4`)	2019-02-12 14:19:52 -08:00
Ilia Mirkin	5e85df1cfd	nv50,nvc0: add explicit settings for recent caps Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `38f542783f`)	2019-02-12 14:19:52 -08:00
Marek Olšák	e9dc4e252f	meson: drop the xcb-xrandr version requirement autotools doesn't have any requirement. This fixes meson on Ubuntu 16.04. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `1e85cfb91a`)	2019-02-12 14:19:52 -08:00
Dylan Baker	56a47e3421	Bump version for 19.0-rc3	2019-02-12 12:39:36 -08:00
Dylan Baker	ca36eb12fd	Revert "intel/compiler: More peephole select" This reverts commit `8fb8ebfbb0`.	2019-02-12 09:42:59 -08:00
Dylan Baker	9dd433dfa7	Revert "nir/opt_peephole_select: Don't peephole_select expensive math instructions" This reverts commit `378f996771`. This also remove the default true argument from the a2xx nir backend, which was introduced after this commit. There should be no change in functionality.	2019-02-12 09:42:16 -08:00
Dylan Baker	f59c77ef8c	Revert "intel/compiler: More peephole_select for pre-Gen6" This reverts commit `af07141b33`.	2019-02-11 16:26:01 -08:00
Dylan Baker	61c22ba94b	cherry-ignore: Add some patches	2019-02-11 16:24:42 -08:00
Jason Ekstrand	ad2b712a56	nir/deref: Rematerialize parents in rematerialize_derefs_in_use_blocks When nir_rematerialize_derefs_in_use_blocks_impl was first written, I attempted to optimize things a bit by not bothering to re-materialize the sources of deref instructions figuring that the final caller would take care of that. However, in the case of more complex deref chains where the first link or two lives in block A and then another link and the load/store_deref intrinsic live in block B it doesn't work. The code in rematerialize_deref_in_block looks at the tail of the chain, sees that it's already in block B and skips it, not realizing that part of the chain also lives in block A. The easy solution here is to just rematerialize deref sources of deref instructions as well. This may potentially lead to a few more deref instructions being created by the conditions required for that to actually happen are fairly unlikely and, thanks to the caching, it's all linear time regardless. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109603 Fixes: `7d1d1208c2` "nir: Add a small pass to rematerialize derefs per-block" Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (cherry picked from commit `9e6a6ef0d4`)	2019-02-11 16:24:42 -08:00
Ian Romanick	07e299a0a0	nir: Silence zillions of unused parameter warnings in release builds Fixes: `cd56d79b59` "nir: check NIR_SKIP to skip passes by name" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `78169870e4`)	2019-02-11 09:07:09 -08:00
Ilia Mirkin	36d99d9ad0	nvc0/ir: fix second tex argument after levelZero optimization We used to pre-set a bunch of extra arguments to a texture instruction in order to force the RA to allocate a register at the boundary of 4. However with the levelZero optimization, which removes a LOD argument when it's uniformly equal to zero, we undid that logic by removing an extra argument. As a result, we could end up with insufficient alignment on the second wide texture argument. Instead we switch to a different method of achieving the same result. The logic runs during the constraint analysis of the RA, and adds unset sources as necessary right before being merged into a wide argument. Fixes MISALIGNED_REG errors in Hitman when run with bindless textures enabled on a GK208. Fixes: `9145873b15` ("nvc0/ir: use levelZero flag when the lod is set to 0") Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `5de5beedf2`)	2019-02-07 09:51:39 -08:00
Kristian H. Kristensen	94f0908216	freedreno/a6xx: Emit blitter dst with OUT_RELOCW We're writing to the bo and the kernel needs to know for fd_bo_cpu_prep() to work. Fixes: `f93e431272` ("freedreno/a6xx: Enable blitter") Reviewed-by: Rob Clark <robdclark@gmail.com> Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> (cherry picked from commit `357ea7da51`)	2019-02-07 09:51:39 -08:00
Bas Nieuwenhuizen	f880c74717	amd/common: handle nir_deref_cast for shared memory from integers. Can happen e.g. after a phi. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `8d1718590b`)	2019-02-07 09:51:39 -08:00
Bas Nieuwenhuizen	6f36d3bbc0	amd/common: Handle nir_deref_type_ptr_as_array for shared memory. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `830fd0efc1`)	2019-02-07 09:51:39 -08:00
Bas Nieuwenhuizen	b4e8a3294c	amd/common: Add gep helper for pointer increment. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `e00d9a9a72`)	2019-02-07 09:51:39 -08:00
Bas Nieuwenhuizen	ef6809ba88	amd/common: Fix stores to derefs with unknown variable. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `dbdb44d575`)	2019-02-07 09:38:23 -08:00
Bas Nieuwenhuizen	7254d2f4a3	radv: Fix the shader info pass for not having the variable. For example with VK_EXT_buffer_device_address or VK_KHR_variable_pointers. Fixes: `a2b5cc3c39` "radv: enable variable pointers" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `00253ab2c4`)	2019-02-07 09:37:37 -08:00
Eric Engestrom	dbc43e3897	xvmc: fix string comparison Fixes: `6fca18696d` "g3dvl: Update XvMC unit tests." Cc: Younes Manton <younes.m@gmail.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `40b53a7203`)	2019-02-07 09:37:17 -08:00
Eric Engestrom	262fd16b99	xvmc: fix string comparison Fixes: `c7b65dcaff` "xvmc: Define some Xv attribs to allow users to specify color standard and procamp" Cc: Christian König <christian.koenig@amd.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> (cherry picked from commit `110a6e1839`)	2019-02-07 09:37:07 -08:00
Jonathan Marek	452f9b9984	freedreno: a2xx: fix fast clear Fixes: `912a9c8d` Signed-off-by: Jonathan Marek <jonathan@marek.ca> Cc: 19.0 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `3361305f57`)	2019-02-06 09:54:31 -08:00
Dylan Baker	131f12d49f	Version: Bump for rc2	2019-02-05 11:49:03 -08:00
Emil Velikov	f8f68c41a1	anv: wire up the state_pool_padding test Cc: Jason Ekstrand <jason@jlekstrand.net> Fixes: `927ba12b53` ("anv/tests: Adding test for the state_pool padding.") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com><Paste> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> (cherry picked from commit `8943eb8f03`)	2019-02-05 11:41:54 -08:00
Michel Dänzer	15e2fc16e9	loader/dri3: Use strlen instead of sizeof for creating VRR property atom sizeof counts the terminating null character as well, so that also contributed to the ID computed for the X11 atom. But the convention is for only the non-null characters to contribute to the atom ID. Fixes: `2e12fe425f` "loader/dri3: Enable adaptive_sync via _VARIABLE_REFRESH property" Reviewed-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `c0a540f320`)	2019-02-05 11:41:48 -08:00
Marek Olšák	3f5099180d	radeonsi: fix crashing performance counters (division by zero) Fixes: `e2b9329f17` "radeonsi: move remaining perfcounter code into si_perfcounter.c" (cherry picked from commit `742d6cdb42`)	2019-02-05 09:05:51 -08:00
Danylo Piliaiev	9667d89fe6	anv: Fix VK_EXT_transform_feedback working with varyings packed in PSIZ Transform feedback did not set correct SO_DECL.ComponentMask for varyings packed in VARYING_SLOT_PSIZ: gl_Layer - VARYING_SLOT_LAYER in VARYING_SLOT_PSIZ.y gl_ViewportIndex - VARYING_SLOT_VIEWPORT in VARYING_SLOT_PSIZ.z gl_PointSize - VARYING_SLOT_PSIZ in VARYING_SLOT_PSIZ.w Fixes: `36ee2fd61c` "anv: Implement the basic form of VK_EXT_transform_feedback" Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `64d3b148fe`)	2019-02-04 09:16:37 -08:00
Jason Ekstrand	c6649ca94d	intel/fs: Do the grf127 hack on SIMD8 instructions in SIMD16 mode Previously, we only applied the fix to shaders with a dispatch mode of SIMD8 but the code it relies on for SIMD16 mode only applies to SIMD16 instructions. If you have a SIMD8 instruction in a SIMD16 shader, neither would trigger and the restriction could still be hit. Fixes: `232ed89802` "i965/fs: Register allocator shoudn't use grf127..." Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `b4f0d062cd`)	2019-02-04 09:16:21 -08:00
Neha Bhende	89f84f98e0	st/mesa: Fix topogun-1.06-orc-84k-resize.trace crash We need to initialize all fields in rs->prim explicitly while creating new rastpos stage. Fixes: `bac8534267` ("st/mesa: allow glDrawElements to work with GL_SELECT feedback") v2: Initializing all fields in rs->prim as per Ilia. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> (cherry picked from commit `69d736b17a`)	2019-02-01 09:19:29 -08:00
Ernestas Kulik	c824f8031c	v3d: Fix leak in resource setup error path Reported by Coverity: in the case of unsupported modifier request, the code does not jump to the “fail” label to destroy the acquired resource. CID: 1435704 Signed-off-by: Ernestas Kulik <ernestas.kulik@gmail.com> Fixes: `45bb8f2957` ("broadcom: Add V3D 3.3 gallium driver called "vc5", for BCM7268.") (cherry picked from commit `90458bef54`)	2019-01-31 11:12:29 -08:00
Eric Anholt	7fdb08375f	v3d: Fix image_load_store clamping of signed integer stores. This was copy-and-paste fail, that oddly showed up in the CTS's reinterprets of r32f, rgba8, and srgba8 to rgba8i, but not r32ui and r32i to rgba8i or reinterprets to other signed int formats. Fixes: `6281f26f06` ("v3d: Add support for shader_image_load_store.") (cherry picked from commit `ab4d5775b0`)	2019-01-31 11:09:28 -08:00
Eric Anholt	535cc4f1d5	mesa: Skip partial InvalidateFramebuffer of packed depth/stencil. One of the CTS cases tries to invalidate just stencil of packed depth/stencil, and we incorrectly lost the depth contents. Fixes dEQP-GLES3.functional.fbo.invalidate.whole.unbind_read_stencil Fixes: `0c42b5f3cb` ("mesa: wire up InvalidateFramebuffer") Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `db2ae51121`)	2019-01-31 11:09:05 -08:00
Rob Clark	7f91ae20b9	freedreno: more fixing release tarball Fixes: `aa0fed10d3` freedreno: move ir3 to common location Signed-off-by: Rob Clark <robdclark@gmail.com> (cherry picked from commit `39cfdf9930`)	2019-01-31 11:08:53 -08:00
Rob Clark	0a72505a9e	freedreno: fix release tarball Fixes: `b4476138d5` freedreno: move drm to common location Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Signed-off-by: Rob Clark <robdclark@gmail.com> (cherry picked from commit `e252656d14`)	2019-01-31 11:08:11 -08:00
Samuel Pitoiset	31d0079a20	radv/winsys: fix hash when adding internal buffers This fixes serious stuttering in Shadow Of The Tomb Raider. Fixes: `50fd253bd6` ("radv/winsys: Add priority handling during submit.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `9c762c01c8`)	2019-01-31 11:07:40 -08:00
Ernestas Kulik	4d1dd3b0cd	vc4: Fix leak in HW queries error path Reported by Coverity: in the case where there exist hardware and non-hardware queries, the code does not jump to err_free_query and leaks the query. CID: 1430194 Signed-off-by: Ernestas Kulik <ernestas.kulik@gmail.com> Fixes: `9ea90ffb98` ("broadcom/vc4: Add support for HW perfmon") (cherry picked from commit `f6e49d5ad0`)	2019-01-31 11:07:26 -08:00
Emil Velikov	45d1aa2f6c	vc4: Declare the last cpu pointer as being modified in NEON asm. Earlier commit addressed 7 of the 8 instances available. v2: Rebase patch back to master (by anholt) Cc: Carsten Haitzler (Rasterman) <raster@rasterman.com> Cc: Eric Anholt <eric@anholt.net> Fixes: `300d3ae8b1` ("vc4: Declare the cpu pointers as being modified in NEON asm.") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `385843ac3c`)	2019-01-31 10:59:58 -08:00
Dylan Baker	2fddad9e3f	VERSION: bump to 19.0.0-rc1	2019-01-30 14:10:12 -08:00
Dylan Baker	2b603ee4f1	android,autotools,i965: Fix location of float64_glsl.h Android.mk and autotools disagree about where generated files should go, which wasn't a problem until we wanted to build a dist tarball. This corrects the problme by changing the output and include paths to be the same on android and autotools (meson already has the correct include path). Fixes: `7d7b30835c` ("automake: Fix path to generated source")	2019-01-30 14:10:12 -08:00
Dylan Baker	e7f6a5d17f	automake: Add --enable-autotools to distcheck flags Fixes: `e68777c87c` ("autotools: Deprecate the use of autotools")	2019-01-30 09:45:14 -08:00
Dylan Baker	1f5f12687f	configure: Bump SWR LLVM requirement to 7 It is currently impossible to build a dist tarball that works when SWR requires LLVM 6. To generate the tarball we'd need to configure with LLVM 6, which is fine. But to build the dist check we need LLVM 7, as RadeonSI and RadV require that version. Unfortunately the headers genererated with LLVM 6 don't compile with LLVM 7, the API has changed between the two versions. I weighed a couple of options here. One would be to ship an unbootstrapped tarball generated with meson. This would fix the issue by not bootstrapping, so whatever version of LLVM used would work because the SWR headers would be generated at compile time. Unfortunately this would involve some heavy modifications to the infastructure used to upload the tarballs, and I've decided not to persue this.	2019-01-30 09:27:14 -08:00
Dylan Baker	90a7a9c973	automake: Add include dir for nir src directory Fixes: `6281f26f06` ("v3d: Add support for shader_image_load_store.") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-01-29 23:24:57 +00:00
Dylan Baker	82365595e9	automake: Add float64.glsl to dist tarball Fixes: `b63a1f8e40` ("glsl: Create file to contain software fp64 functions") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-01-29 23:24:57 +00:00
Dylan Baker	7d7b30835c	automake: Fix path to generated source Fixes: `b63a1f8e40` ("glsl: Create file to contain software fp64 functions") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-01-29 23:24:57 +00:00
Matt Turner	9de90caca8	nir: Optimize double-precision lower_round_even() Use the trick of adding and then subtracting 2**52 (52 is the number of explicit mantissa bits a double-precision floating-point value has) to implement round-to-even. Cuts the number of instructions on SKL of the piglit test fs-roundEven-double.shader_test from 109 to 21. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-01-29 15:02:23 -08:00
Marek Olšák	3e249b853e	ac: use the correct LLVM processor name on Raven2 Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2019-01-29 17:46:55 -05:00
Eric Anholt	f7769b5121	v3d: Fix the autotools build. Noticed while looking at the gitlab-CI MR.	2019-01-29 14:00:27 -08:00
Jonathan Marek	31a1348a66	freedreno: fix sysmem rendering being used when clear is used This batch->cleared value is only used to decide to use sysmem rendering or not, so it should include any buffers that are affected by a clear. This is required because the a2xx fast clear doesn't work with sysmem rendering. The a22x "normal" clear path doesn't work with sysmem either. Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-29 20:22:33 +00:00
Jonathan Marek	c93d77431f	freedreno: fix depth usage logic Depth can be used even when there is no restore/resolve of depth. This happens when the depth buffer is invalidated after rendering to avoid the resolve operation. Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-29 20:22:33 +00:00
Jonathan Marek	bcefa0f1cb	freedreno: fix invalidate logic Set dirty bits on invalidate to trigger invalidate logic in fd_draw_vbo. Also, resource_written for color needs to be after the invalidate logic. Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-29 20:22:32 +00:00
Jonathan Marek	786f9639d6	mesa/st: wire up DiscardFramebuffer Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-29 20:22:32 +00:00
Rob Clark	0c42b5f3cb	mesa: wire up InvalidateFramebuffer And before someone actually starts implementing DiscardFramebuffer() lets rework the interface to something that is actually usable. Signed-off-by: Rob Clark <robdclark@gmail.com> Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-29 20:22:32 +00:00
Jonathan Marek	e685566612	st/dri: invalidate_resource depth/stencil before flush_resource This allows freedreno to be aware of the depth invalidate when flushing batches on flush_resource. AFAIK, the only other driver which might care about this change is vc4, where I think it should help by allowing the depth invalidate to work with GALLIUM_HUD. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-29 20:22:32 +00:00
Mario Kleiner	820dfcea43	egl/wayland-drm: Only announce formats via wl_drm which the driver supports. Check if a pixel format is supported by the Wayland servers gpu driver before exposing it to the client via wl_drm, so we avoid reporting formats to the client which the server gpu can't handle. Restrict this reporting to the new color depth 30 formats for now, as the ARGB/XRGB8888 and RGB565 formats are probably supported by every gpu under the sun. Atm. this is mostly useful to allow proper PRIME renderoffload for depth 30 formats on the typical Intel iGPU + NVidia dGPU "NVidia Optimus" laptop combo. Tested on Intel, AMD, NVidia with single-gpu setup and on a Intel + NVidia Optimus setup. Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2019-01-29 20:03:20 +00:00
Mario Kleiner	a34b0d68bb	egl/wayland: Allow client->server format conversion for PRIME offload. (v2) Support PRIME render offload between a Wayland server gpu and a Wayland client gpu with different channel ordering for their color formats, e.g., between Intel drivers which currently only support ARGB2101010 and XRGB2101010 import/display and nouveau which only supports ABGR2101010 rendering and display on nv-50 and later. In the wl_visuals table, we also store for each format an alternate sibling format which stores colors at the same precision, but with different channel ordering, e.g., ARGB2101010 <-> ABGR2101010. If a given client-gpu renderable format is not supported by the server for import, but the alternate format is supported by the server, expose the client-gpu renderable format as a valid EGLConfig to the client. At eglSwapBuffers time, during the blitImage() detiling blit from the client backbuffer to the linear buffer, the client format is converted to the server supported format. As we have to do a copy for PRIME anyway, this channel swizzling conversion comes essentially for free. Note that even if a server gpu in principle does support sampling from the clients native format, this conversion will be a performance advantage if it allows to convert to the servers preferred format for direct scanout, as the Wayland compositor may then be able to directly page-flip a fullscreen client wl_buffer onto the primary plane, or onto a hardware overlay plane, avoiding an extra data copy for desktop composition. Tested so far under Weston with: nouveau single-gpu, Intel single-gpu, AMD single-gpu, "Optimus" Intel server iGPU for display + NVidia client dGPU for rendering. v2: Implement minor review comments by Eric Engestrom: Add some comment and assert, and some style fixes for clarity. No functional change. Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2019-01-29 20:03:20 +00:00
Jason Ekstrand	a920979d4f	intel/fs: Use split sends for surface writes on gen9+ Surface reads don't need them because they just have the one address payload. With surface writes, on the other hand, we can put the address and the data in the different halves and avoid building the payload all together. The decrease in register pressure and added freedom in register allocation resulting from this change reduces spilling enough to improve the performance of one customer benchmark by about 2x. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	014edff0d2	intel/fs: Add interference between SENDS sources Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	eab1c55590	intel/fs: Support SENDS in SHADER_OPCODE_SEND Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	cca199fd85	intel/disasm: Properly disassemble split sends Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	8babaa84e8	intel/eu: Add support for the SENDS[C] messages Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	d6a6e10390	intel/inst: Indent some code We're about to add some more if cases so let's have the giant re-indent in it's own patch to make review easier. Acked-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	d96969120d	intel/inst: Fix the ia16_addr_imm helpers These have clearly never seen any use.... On gen8, the bottom 4 bits are missing so we need to shift them off before we call set_bits and shift again when we get the bits. Found by inspection. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	e46fb33143	intel/disasm: Rework SEND decoding to use descriptors Instead of fetching the information out of the instruction directly, fetch the descriptor and then pluck the information out of the descriptor. The current scheme works ok for SEND but with SENDS, it all falls to pieces because the descriptor is completely shuffled around. This commit doesn't actually convert everything. One notable exception is URB messages which don't even use descriptors in emit_urb_WRITE yet. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	13a6fabc62	intel/eu: Add more message descriptor helpers We want to be able to extract data from descriptors as well as unify a bit of the descriptor construction. One of the unifications we do is to unify the read/write and dataport descriptors. On gen4-5, read/write are substantially different and the read descriptors change between gen4 and gen4.x. On gen6, they unified layouts between read, write, and dataport. Then, on gen8, they added one bit to the message type field but left it reserved MBZ for read/write messages. This commit chooses to treat that as if they expanded the field everywhere and just didn't have enough enum values for read/write to bother with the extra bit. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	c3aa436bfe	intel/eu/validate: SEND restrictions also apply to SENDC Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	fee6bd8d8e	intel/eu: Use GET_BITS in brw_inst_set_send_ex_desc It's a bit more readable Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	b284d222db	intel/fs: Use SHADER_OPCODE_SEND for varying UBO pulls on gen7+ Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	8514eba693	intel/fs: Use SHADER_OPCODE_SEND for texturing on gen7+ Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	f547cebbe0	intel/fs: Use a logical opcode for IMAGE_SIZE Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	d2d3e04501	intel/fs: Use SHADER_OPCODE_SEND for surface messages Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	7f1cf046cd	intel/fs: Add a generic SEND opcode Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	ba3c5300f9	intel/eu: Rework surface descriptor helpers This commit pulls the surface descriptor helpers out into brw_eu.h and makes them no longer depend on the codegen infrastructure. This should allow us to use them directly from the IR code instead of the generator. This change is unfortunately less mechanical than perhaps one would like but it should be fairly straightforward. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	5b17379631	intel/eu: Add has_simd4x2 bools to surface_write functions Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	2ce93b88c0	intel/fs: Take an explicit exec size in brw_surface_payload_size() Instead of magically falling back to SIMD8 for atomics and typed messages on Ivy Bridge, explicitly figure out the exec size and pass that into brw_surface_payload_size. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	cf42b0f9e2	intel/fs: Handle IMAGE_SIZE in size_read() and is_send_from_grf() Like all the other sends, it's just mlen * REG_SIZE. Fixes: `3cbc02e469` "intel: Use TXS for image_size when we have..." Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	009c0bd840	intel/defines: Explicitly cast to uint32_t in SET_FIELD and SET_BITS If you pass a bool in as the value to set, the C standard says that it gets converted to an int prior to shifting. If you try to set a bool to bit 31, this lands you in undefined behavior. It's better just to add the explicit cast and let the compiler delete it for us. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Jason Ekstrand	077b9557a4	intel/fs: Get rid of fs_inst::equals There are piles of fields that it doesn't check so using it is a lie. The only reason why it's not causing problem is because it has exactly one user which only uses it for MOV instructions (which aren't very interesting) and only on Sandy Bridge and earlier hardware. Just get rid of it and inline it in the one place that it's actually used. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-29 18:43:55 +00:00
Rob Clark	446a14bc0a	freedreno: minor cleanups Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-29 12:30:50 -05:00
Rob Clark	c3baa077bf	freedreno: stop frob'ing pipe_resource::nr_samples Previously we tried to normalize nr_samples to MAX2(1, nr_samples) to avoid having to deal with 0 vs 1 everywhere. But this causes problems in mesa/st, for example st_finalize_texture() will think there is a nr_samples mismatch and recreate the texture. Somehow this manifests as corrupt x11 font rendering on generations that do not support MSAA (but apparently works fine on a5xx and a6xx which do support MSAA.) Fixes: `cf0c7258ee` freedreno/a5xx: MSAA Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-29 12:30:50 -05:00
Rob Clark	1a6ddfe5ee	freedreno/a6xx: fix blitter nr_samples check nr_samples for non-MSAA case could be either zero or one. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-29 12:22:08 -05:00
Rob Clark	9106a0fe33	freedreno/a5xx: fix blitter nr_samples check nr_samples for non-MSAA case could be either zero or one. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-29 12:21:19 -05:00
Bas Nieuwenhuizen	69edc972fc	radv: Enable VK_EXT_memory_priority. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-29 15:56:56 +01:00
Bas Nieuwenhuizen	50fd253bd6	radv/winsys: Add priority handling during submit. Switched to the raw bo list api to avoid having to use 2 arrays for everything. This was introduced in libdrm 2.4.97 which we already depend upon. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-29 15:56:52 +01:00
Bas Nieuwenhuizen	ead54d4a42	radv/winsys: Set winsys bo priority on creation. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-29 15:56:41 +01:00
Samuel Pitoiset	3a8d6c0880	radv: re-enable fast depth clears for 16-bit surfaces on VI This has been disabled some months ago because it introduced rendering issues with Shadow Of Warrier II (DXVK). This game is no longer affected, I wonder if `824cfc1ee5` ("radv: rework the TC-compat HTILE hardware bug with COND_EXEC") fixed the problem. I checked The Forest on my Polaris, and it renders fine too. According to Phillip, this gives +5.5% with Rise Of The Tomb Raider and DXVK. This is because DXVK uses 16-bit depth surfaces while the native port from Feral uses 32-bit depth surfaces. Unfortunately, Shadow Of The Tomb Raider isn't affected because it clears each layer of a D16 array texture individually. So it doesn't hit the fast clear path. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-29 15:20:55 +01:00
Eric Anholt	932ed9c00b	vc4: Enable NEON asm on meson cross-builds. The core Mesa with_asm_arch and USE_ARM_ASM flags are disabled for meson cross-builds because of the need to run host binaries on the build system. vc4 doesn't need to do that, so skip with_asm_arch to enable NEON on my cross-builds. Fixes: `ebcb4c2156` ("meson: Enable VC4's NEON assembly support.")	2019-01-28 16:45:48 -08:00
Carsten Haitzler (Rasterman)	300d3ae8b1	vc4: Declare the cpu pointers as being modified in NEON asm. Otherwise, the compiler is free to reuse the register containing the input for another call and assume that the value hasn't been modified. Fixes crashes on texture upload/download with current gcc. We now have to have a temporary for the cpu2 value, since outputs must be lvalues. (commit message by anholt) Fixes: `4d30024238` ("vc4: Use NEON to speed up utile loads on Pi2.")	2019-01-28 16:45:45 -08:00
Carsten Haitzler (Rasterman)	522f688471	vc4: Use named parameters for the NEON inline asm. This makes the asm code more intelligible and clarifies the functional change in the next commit. (commit message and commit squashing by anholt)	2019-01-28 16:40:46 -08:00
Jonathan Marek	f6292c32cc	kmsro: Add freedreno renderonly support Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-28 18:25:27 -05:00
Jonathan Marek	7d458c0c69	freedreno: a2xx: add perfcntrs Based on a5xx perfcntrs implementation. Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-28 18:21:16 -05:00
Jonathan Marek	cccec0b457	freedreno: a2xx: minor solid_vertexbuf fixups The big thing here is the 0x60 offset for the mem2gmem copy which I missed in my last patch. Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-28 18:21:16 -05:00
Jonathan Marek	912a9c8d8c	freedreno: a2xx: clear fixes and fast clear path This fixes the depth/stencil clear on a20x, and adds a fast clear path. The fast clear path is only used for a20x, needs performance tests on a22x. Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-28 18:21:16 -05:00
Jonathan Marek	cb2322c7c0	freedreno: a2xx: a20x hw binning Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-28 18:21:16 -05:00
Jonathan Marek	501c6e70d4	freedreno: update a2xx registers Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-28 18:21:16 -05:00
Timothy Arceri	fb78a6cb72	glsl: use remap location when serialising uniform program resource data This allows us to avoid expensive string compares since we already have a map to the pointers. These compares were taking ~30 seconds for a single shader compile in Godot due to it using 64,000+ uniforms. Fixes: `c4cff5f402` ("glsl: add basic support for resource list to shader cache") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109229	2019-01-29 09:39:54 +11:00
Vinson Lee	be5b271ea7	meson: Fix typo. meson.build:166:21: ERROR: Unknown method "verson_compare" for a string. Fixes: `c1efa240c9` ("meson: Add warnings and errors when using ICC") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Cc: 18.3 <mesa-stable@lists.freedesktop.org>	2019-01-28 10:47:32 -08:00
Jonathan Marek	7c930d99ad	freedreno: a2xx: enable early-Z testing Enable earlyZ when alpha test is disabled. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-28 13:04:41 -05:00
Jonathan Marek	32b1d2d716	freedreno: a2xx: ir2 cleanup Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-28 13:04:41 -05:00
Rob Herring	41a0acd6a1	Switch imx to kmsro and remove the imx winsys The kmsro winsys is equivalent to the imx winsys, so we can switch to it and remove the imx one. Signed-off-by: Rob Herring <robh@kernel.org>	2019-01-28 11:50:08 -06:00
Rob Herring	827e0d6654	kmsro: Add etnaviv renderonly support Enable using etnaviv for KMS renderonly. This still needs KMS driver name mapping to kmsro to be used automatically. Acked-by: Eric Anholt <eric@anholt.net> Signed-off-by: Rob Herring <robh@kernel.org>	2019-01-28 11:45:43 -06:00
Eric Anholt	272b6cf58f	kmsro: Extend to include hx8357d. This allows vc4 to initialize on the Adafruit PiTFT 3.5" touchscreen with the hx8357d tinydrm driver v2: Whitespace fix noted by Eric Engestrom, update commit message for the driver being merged. v3: Rebase on Rob Herring's pipe-loader changes. Acked-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Acked-by: Emil Velikov <emil.velikov@collabora.com> (v1)	2019-01-28 09:35:45 -08:00
Rob Herring	511e7b6f61	pipe-loader: Fallback to kmsro driver when no matching driver name found If we can't find a driver matching by name, then use the kmsro driver. This removes the need for needing a driver descriptor for every possible KMS driver. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-28 09:35:45 -08:00
Eric Anholt	ed65aeec78	pl111: Rename the pl111 driver to "kmsro". The vc4 driver can do prime sharing to many different KMS-only devices, such as the various tinydrm drivers for SPI-attached displays. Rename the driver away from "pl111" to represent what it will actually support: various sorts of KMS displays with the renderonly layer used to attach a GPU. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-28 09:35:45 -08:00
Samuel Pitoiset	afeef3cacf	radv: set noalias/dereferenceable LLVM attributes based on param types Instead of using this useless array_params_mask variable. This should set these two attributes to streamout buffers too. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-28 16:30:38 +01:00
Samuel Pitoiset	320b058d32	radv: simplify allocating user SGPRS for descriptor sets Unnecesary to check the current stages if desc_set_used_mask is used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-28 16:30:36 +01:00
Samuel Pitoiset	d1994ed229	radv: remove radv_userdata_info::indirect field Always false. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-28 16:30:33 +01:00
Gert Wollny	212c0c630a	mesa/main: Expose EXT_sRGB_write_control Use EXT_framebuffer_sRGB to expose EXT_sRGB_write_control on GLES. Remove the checks for desktion GL in the enable calls, since EXT_framebuffer_sRGB now also indicates support for switching the linear-sRGB color space conversion on GLES. Thanks to Ilia Mirkin for all the helpful discussions that helped to rework this series. v2: Fix alphabetical listing of extensions (Tapani Pälli) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1)	2019-01-28 12:18:40 +01:00
Gert Wollny	1013dfece1	mesa/main/version: Lower the requirements for GLES 3.0 GLES 3.0 does not actually require support for EXT_framebuffer_sRGB, it only needs support for sRGB attachments to framebuffers and framebuffer objects as defined in ARB_framebuffer_objects. v2: Clarify that ARB_framebuffer_objects is needed. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-28 12:18:40 +01:00
Gert Wollny	76c3f6fb3f	mesa/main: Use flag for EXT_sRGB instead of EXT_framebuffer_sRGB where possible All drivers that support EXT_framebuffer_sRGB also support EXT_sRGB, but in order to keep this commit minial, and not to break any drivers both flags are checked. v2: - Use only EXT_sRGB (Ilia Mirkin) - Move adding the flag EXT_sRGB to gl_extensions to a separate patch v3: use _mesa_has_EXT_framebuffer_sRGB instead of extension flag The _mesa_has function also checks for the correct versions and should be preferred over using the flags directly (Erik) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-28 12:18:40 +01:00
Gert Wollny	8f9dfb7d88	mesa/st: rework support for sRGB framebuffer attachements For GLES sRGB framebuffer attachemnt support is provided in two steps: sRGB attachments like described in EXT_sRGB (and GLES 3.0) that enable linear to sRGB color space transformation automatically, and the ability to switch formats of the render target surface between sRGB and linear that introduces full support for EXT_framebuffer_sRGB. Set the according flags to reflect these two levels of sRGB support. As a difference between desktopm GL and GLES, on desktop GL for a sRGB framebuffer attachment the linear-sRGB conversion is turned off by default, and for GLES it is turned on. This needs to be taken into account when initally creating a surface, i.e. on desktop GL creation of a sRGB surface is preferred, but on GLES sRGB surfaces are only created when explicitely requested. v2: - Use the new CAPS name Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-28 12:18:40 +01:00
Gert Wollny	385081cd17	i965: Set flag for EXT_sRGB Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: <Gurchetan Singh gurchetansingh@chromium.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-28 12:18:40 +01:00
Gert Wollny	7577c82fed	mesa:main: Add flag for EXT_sRGB to gl_extensions EXT_sRGB is an (incomplete) GLES extension that provides support for sRGB framebuffer attachments, hence it can be used to check for this support as an alternative to EXT_framebuffer_sRGB that provies the same functionality but also sRGB write control support. However, since EXT_sRGB is incomplete and superseted by GLES 3.0 it will not be exposed as an extension. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-28 12:18:40 +01:00
Gert Wollny	2845939d6a	virgl: Set sRGB write control CAP based on host capabilities v2: - Use the renamed CAPS - add assetions to make sure that mesa doesn't try to switch destination surface formats when it is not supported. (Ilia Mirkin) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2019-01-28 12:18:40 +01:00
Gert Wollny	8021f1875e	Gallium: Add new CAPS to indicate whether a driver can switch SRGB write Add a new cap that indicates whether the drivers supports enabling/disabling the conversion from linear space to sRGB for a framebuffer attachment. In Driver terms that this CAP indicates whether the driver can switcht between a linear and and a sRGB surface format for draw destinations witout changing the sourface itself. v2: rename CAP to DEST_SURFACE_SRGB_CONTROL to reflect its purpouse better (pointed out by Ilia Mirkin) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-28 12:18:40 +01:00
Neil Roberts	75b3719c4f	spirv: Don't use special semantics when counting vertex attribute size Under Vulkan, the double vertex attributes take up the same size regardless of whether they are vertex inputs or any other stage interface. Under OpenGL (ARB_gl_spirv), from GLSL 4.60 spec, section 4.3.9 Interface Blocks: "It is a compile-time error to have an input block in a vertex shader or an output block in a fragment shader. These uses are reserved for future use." So we also don't need to check if it is an vertex input or not, and use false in any case. v2: (changes made by Alejandro Piñeiro) * Update required after "spirv: Handle location decorations on block interface members" own updates (original patch was sent several months ago) * After Neil suggesting it, confirm that this change can be also done for OpenGL (ARB_gl_spirv). Expand commit message. v3: update after changing name of main method on a previous patch Signed-off-by: Neil Roberts <nroberts@igalia.com> Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-28 11:42:46 +01:00
Neil Roberts	5c797f7354	glsl_types: Rename parameter of glsl_count_attribute_slots glsl_count_attribute_slots takes a parameter to specify whether the type is being used as a vertex input because on GL double attributes only take up one slot. Vulkan doesn’t make this distinction so this patch renames the argument to is_gl_vertex_input in order to make it more clear that it should always be false on Vulkan. v2: minor variable renaming (s/member/member_type) (Tapani) Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-28 11:42:46 +01:00
Neil Roberts	dfc3a7cb3c	spirv/nir: handle location decorations on block interface members Previously the code was taking any location decoration on the block and using that to calculate the member locations for all of the members. I think this was assuming that there would only be one location decoration for the entire block. According to the Vulkan spec it is possible to add location decorations to individual members: “If the structure type is a Block but without a Location, then each of its members must have a Location decoration. If it is a Block with a Location decoration, then its members are assigned consecutive locations in declaration order, starting from the first member which is initially the Block. Any member with its own Location decoration is assigned that location. Each remaining member is assigned the location after the immediately preceding member in declaration order.” This patch makes it instead keep track of which members have been assigned an explicit location. It also has a space to store the location for the struct as a whole. Once all the decorations have been processed it iterates over each member to fill in the missing locations using the rules described above. So, this commit is needed to get working a case like this, on both Vulkan and OpenGL using SPIR-V (ARB_gl_spirv): out block { layout(location = 2) vec4 c; layout(location = 3) vec4 d; layout(location = 0) vec4 a; layout(location = 1) vec4 b; } name; v2: (changes made by Alejandro Piñeiro) * Update after introducing struct member splitting (See commit `b0c643d`) * Update after only exposing interface_type for blocks, not to any struct * Update after last changes done for xfb support v3: use "assign" instead of "add" on the new method added (Tapani) Signed-off-by: Neil Roberts <nroberts@igalia.com> Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-28 11:42:46 +01:00
Christian Gmeiner	34458c1cf6	etnaviv: add linear sampling support Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de>	2019-01-28 07:36:12 +01:00
Christian Gmeiner	42ca4dda2d	etnaviv: update headers from rnndb Update to etna_viv commit 4d2f857. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de>	2019-01-28 07:36:09 +01:00
Christian Gmeiner	5b4a155d2b	etnaviv: extend etna_resource with an addressing mode Defines how sampler (and pixel pipes) needs to access the data represented with a resource. The used default is mode is ETNA_ADDRESSING_MODE_TILED. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de>	2019-01-28 07:36:05 +01:00
Ilia Mirkin	d1d2bb8c07	nvc0: don't put text segment into bufctx The text segment is shared among multiple contexts, while each one has its own bufctx. So when reallocating the text segment, some contexts may end up with stale values in their bufctx's. Instead limit the exposure to the bufctx to within a single draw. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-01-27 21:47:09 -05:00
Timothy Arceri	0907ae35ad	radv/ac: fix some fp16 handling Fixes: `b722b29f10` ("radv: add support for 16bit input/output") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-28 10:41:48 +11:00
Eric Anholt	c496b60ed8	v3d: Create separate sampler states for the various blend formats. The sampler border color is encoded in the TMU's blending format (half floats, 32-bit floats, or integers) and must be clamped to the format's range unorm/snorm/int ranges by the driver. Additionally, the TMU doesn't know about how we're abusing the swizzle to support BGRA, A, and LA, so we have to pre-swizzle the border color for those. We don't really want to spend half a kb on sampler states in most cases, so skip generating the variants when the border color is unused or is 0,0,0,0.	2019-01-27 08:30:03 -08:00
Eric Anholt	5fe4250a2c	v3d: Move the sampler state to the long-lived state uploader. Samplers are small (8-24 bytes), so allocating 4k for them is a huge waste.	2019-01-27 08:30:03 -08:00
Eric Anholt	09472006ff	v3d: Use the symbolic names for wrap modes from the XML.	2019-01-27 08:30:03 -08:00
Eric Anholt	c51d125d18	v3d: Fix stencil sampling from a separate-stencil buffer. When the sampler view is in sample-stencil mode, we need to return uint stencil values. To do that, fill in the format table to return R8I, and have the sampler view point at the separate stencil buffer. Fixes dEQP-GLES31.functional.stencil_texturing.format.depth32f_stencil8_2d	2019-01-27 08:30:03 -08:00
Eric Anholt	8a0b0a8f37	v3d: Fix stencil sampling from packed depth/stencil. We need to pick the 8-bit unorm value out, not the depth component.	2019-01-27 08:30:03 -08:00
Eric Anholt	fcdbd441a2	v3d: Fix release-build warning about utile_h.	2019-01-27 08:30:03 -08:00
Eric Anholt	edb1fcd963	v3d: Flush blit jobs immediately after generating them. Fixes OOMs in the CTS's packed_pixels.varied_rectangle.* tests -- the series of texture uploads at the start before texturing occurred would end up all sitting around as cached jobs for reuse. By flushing immediately, peak active BO usage goes from 150M to 40M. We could maybe put some limits on how many jobs we keep around, but blits seem particularly unlikely to get reused for other drawing.	2019-01-27 08:30:03 -08:00
Eric Anholt	ac333ffa59	v3d: Fix BO stats accounting for imported buffers.	2019-01-27 08:30:03 -08:00
Eric Anholt	060575bea8	v3d: Drop maximum number of texture units down to 16. This is the GLES 3.2 minmax, and also what the closed source driver does. Avoids hitting OOMs in the CTS's dEQP-GLES3.functional.texture.units.all_units.only_cube.1.	2019-01-27 08:30:03 -08:00
Eric Anholt	3e743d8cd8	v3d: Avoid duplicating limits defines between gallium and v3d core. We don't want to pull the compiler into every include in the gallium driver, so just make a new little header to store the limits.	2019-01-27 08:30:03 -08:00
Eric Anholt	fe6a21c867	v3d: Fix overly-large vattr_sizes structs. We want one vector size per vector, not per component.	2019-01-27 08:30:03 -08:00
Eric Anholt	533b3f0541	v3d: Rename gallium-local limits defines from VC5 to V3D. The compiler has its limits under V3D_* (like most V3D stuff), so sync up with that.	2019-01-27 08:30:03 -08:00
Bas Nieuwenhuizen	b4870a15ae	radv: Remove unused variable. Trivial.	2019-01-27 13:51:35 +01:00
Niklas Haas	804cc44d09	radv: add device->instance extension dependencies From the vulkan spec 33.3 "Extension Dependencies": "Any device extension that has an instance extension dependency that is not enabled by vkCreateInstance is considered to be unsupported, hence it must not be returned by vkEnumerateDeviceExtensionProperties for any VkPhysicalDevice child of the instance." Therefore we need to check whether the instance-level extensions are actually enabled when deciding to support a device-level extension or not. Furthermore, we need to do this for all instance-level extensions of any (transitive) device-level extension dependency, due to the following paragraph: "If an extension is supported (as queried by vkEnumerateInstanceExtensionProperties or vkEnumerateDeviceExtensionProperties), then required extensions of that extension must also be supported for the same instance or physical device." Finally, because some of these vulkan extensions may be implicitly promoted to future vulkan core API versions, we can also satisfy the dependency if the vulkan API version is high enough. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-27 13:50:35 +01:00
Niklas Haas	d12dc39396	radv: correctly use vulkan 1.0 by default From the vulkan spec 3.2 "Instances": "Providing a NULL VkInstanceCreateInfo::pApplicationInfo or providing an apiVersion of 0 is equivalent to providing an apiVersion of VK_MAKE_VERSION(1,0,0)." Fixes: `ffa15861ef` "radv: UseEnumerateInstanceVersion for the default version." Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-27 12:49:28 +01:00
Niklas Haas	d9bd3b1cb8	glsl: fix block member alignment validation for vec3 Section 7.6.2.2 (Standard Uniform Block Layout) of the GL spec says: The base offset of the first member of a structure is taken from the aligned offset of the structure itself. The base offset of all other structure members is derived by taking the offset of the last basic machine unit consumed by the previous member and adding one. The current code does not reflect this last sentence - it effectively instead aligns up the next offset up to the alignment of the previous member. This causes an issue in exactly one case: layout(std140) uniform block { layout(offset=0) vec3 var1; layout(offset=12) float var2; }; As per section 7.6.2.1 (Uniform Buffer Object Storage) and elsewhere, a vec3 consumes 3 floats, i.e. 12 basic machine units. Therefore, `var1` in the example above consumes units 0-11, with 12 being the first available offset afterwards. However, before this commit, mesa incorrectly assumes `var2` must start at offset=16 when using explicit offsets, which results in a compile-time error. Without explicit offsets, the shaders actually work fine, indicating that mesa is already correctly aligning these fields internally. (Just not in the code that handles explicit buffer offset parsing) This patch should fix piglit tests: ssbo-explicit-offset-vec3.vert ubo-explicit-offset-vec3.vert Signed-off-by: Niklas Haas <git@haasn.xyz> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-01-27 03:00:03 -05:00
Jason Ekstrand	86e5f76d3d	spirv: Add support for SPV_EXT_physical_storage_buffer Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-26 13:41:50 -06:00
Jason Ekstrand	fb282a68bc	spirv: Implement OpConvertPtrToU and OpConvertUToPtr This only implements the actual opcodes and does not implement support for using them with specialization constants. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-26 13:41:50 -06:00
Jason Ekstrand	837ed2ba51	spirv: Handle OpTypeForwardPointer We handle forward declarations by creating the pointer type with it's storage type based on storage class and just waiting to fill out the actual deref type until we get the OpTypePointer. Because any composites using the forward declared type only care about the storage type (i.e. uint64_t, uvec2, etc.) when creating their glsl_type, this works fine and we can defer the actual deref_type as far as we need. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2019-01-26 13:41:50 -06:00
Jason Ekstrand	4602e705e4	spirv: Drop a bogus assert This was valid back when the only valid types of pointers were uint32 and uvec2. Now that we're allowing more variety, it could be just about anything so we'll just drop the assert. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2019-01-26 13:41:50 -06:00
Jason Ekstrand	9e34781aef	nir: Allow SSBOs and global to alias Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-26 13:41:50 -06:00
Jason Ekstrand	9839ce8bf9	nir/validate: Allow array derefs of vectors for nir_var_mem_global Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2019-01-26 13:39:18 -06:00
Jason Ekstrand	5f5503d498	nir/lower_io: Add support for nir_var_mem_global Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2019-01-26 13:39:18 -06:00
Jason Ekstrand	314d2c90c3	nir/lower_io: Add a 32 and 64-bit global address formats These are simple scalar addresses. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-26 13:39:18 -06:00
Jason Ekstrand	e461926ef2	nir: Add load/store/atomic global intrinsics These correspond roughly to reading/writing OpenCL global pointers. The idea is that they just take a bare address and load/store from it. Of course, exactly what this address means is driver-dependent. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2019-01-26 13:39:18 -06:00
Axel Davy	6380fedb60	st/nine: Enable debug info if NDEBUG is not set We want to have debug info as well if using meson's debugoptimized when ndebug is off. v2: use u_debug functions that do something even if DEBUG is not set. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2019-01-26 19:53:19 +01:00
Axel Davy	d7433c22e6	st/nine: Immediately upload user provided textures Fixes regression caused by `42d672fa6a` st/nine: Bind src not dst in nine_context_box_upload Before that patch, for user provided textures, when the texture was destroyed, the safety check for pending uploads, which according to the code "Following condition cannot happen currently", was flushing the queue and thus triggering the upload. After the patch, the texture destruction was delayed after the upload. However the user frees the texture buffer, as it thinks the texture released. Instead of reverting the faulty patch, this patch instead flushes the csmt queue right away after queuing the upload for this type of textures. This is more future-proof, as we may want to bind the surface for other reasons in the future. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Cc: 18.3 <mesa-stable@lists.freedesktop.org>	2019-01-26 19:53:00 +01:00
Matt Turner	a7d629a590	i965: Always compile fp64 funcs when needed Compilation of user-specified shaders with software fp64 works by compiling on demand an "fp64-funcs" shader implementing various fp64 operations and then linking it into the "user shader". In commit `64b8c86d37` Author: Timothy Arceri <tarceri@itsqueeze.com> Date: Thu Jan 17 17:16:29 2019 +1100 glsl: be much more aggressive when skipping shader compilation we changed the behavior of the shader cache to skip compilation earlier when we get a cache hit. After the aforementioned commit, compiling a user program using fp64 would store into the cache an entry for the fp64-funcs shader. Subsequent compilations of uncached user shaders using fp64 would fail in compile_fp64_funcs() after finding a cache entry for the fp64-funcs, but being unprepared to read from the cache. It's unclear to me how to retrieve the cached NIR of the fp64-funcs (if it even is cached), so just call _mesa_glsl_compile_shader() with force_recompile=true in order to ensure we generate the fp64-funcs successfully. Tested-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-26 10:33:22 -08:00
Matt Turner	18b467c066	intel/compiler: Add a file-level description of brw_eu_validate.c Acked-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2019-01-26 10:33:22 -08:00
Jonathan Marek	41ddf1d150	freedreno: add renderonly scanout This allows creating a fd_screen with a renderonly object which will be used to allocated scanout resources. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> [slight tweak to fix uninitialized 'prsc' in debug print] Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-26 10:47:21 -05:00
Rob Clark	cd79b5e0c2	freedreno/a2xx: fix unused variable warning Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-26 10:44:31 -05:00
Timothy Arceri	8e9ad592c3	tgsi: remove culldist semantic from docs The semantic was removed in `e6d9389366`. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-26 12:04:53 +11:00
Timothy Arceri	5d66f7103f	ac/nir_to_llvm: fix clamp shadow reference for more hardware Fixes the following piglit test on my VEGA and matches the behaviour in the tgsi backend. tests/spec/glsl-1.10/execution/samplers/glsl-fs-shadow2D-clamp-z.shader_test Fixes: `625dcbbc45` ("amd/common: pass address components individually to ac_build_image_intrinsic") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-26 12:03:24 +11:00
Eric Anholt	08f4a904b3	gallium: Make sure we return is_unorm/is_snorm for compressed formats. The util helpers were looking for a non-void channels in a non-mixed format and returning its snorm/unorm state. However, compressed formats don't have non-void channels, so they always returned false. V3D wants to use util_format_is_[su]norm for its border color clamping workarounds, so fix the functions to return the right answer for these. This now means that we ignore .is_mixed. I could retain the is_mixed check, but it doesn't seem like a useful feature -- the only code I could find that might care is freedreno's blit, which has some notes about how things are wonky in this area anyway. Reviewed-by: <Roland Scheidegger sroland@vmware.com>	2019-01-25 13:06:50 -08:00
Eric Anholt	104c7883e7	gallium: Fix comment about possible colorspaces. Two typos, and missing one of the colorspaces. Reviewed-by: <Roland Scheidegger sroland@vmware.com>	2019-01-25 13:06:47 -08:00
Eric Anholt	54abd2e084	gallium: Enable unit tests as actual meson unit tests. These tests don't need swrast, so we can always enable them when build_tests is set. Most of them run to successful completion quickly (.9s on my SKL). Reviewed-by: <Roland Scheidegger sroland@vmware.com>	2019-01-25 13:06:45 -08:00
Emil Velikov	3b6aaab7e9	mapi: print function declarations for shared glapi Earlier commit aimed to remove unneeded function declarations. Namely OpenGL entrypoints which are not applicable for OpenGLES* Although it did not consider the shared glapi which needs all, including hidden ones. Resulting in warning/errors like the following ../build/src/mapi/shared-glapi/glapi_mapi_tmp.h:26014:15: error: no previous prototype for ‘shared_dispatch_stub_1414’ [-Werror=missing-prototypes] This patch addressed that. Cc: Erik Faye-Lund <erik.faye-lund@collabora.com> Reported-by: Eric Anholt <eric@anholt.net> Fixes: `6148cce388` ("mapi: drop unneeded gl_dispatch_stub declarations") Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Acked-by: Eric Anholt <eric@anholt.net>	2019-01-25 13:04:04 -08:00
Rob Clark	4aa64940c6	freedreno: limit tiling to PIPE_BIND_SAMPLER_VIEW `1ce5d757d0` dropped this limit.. which is probably the right thing to do. But it results in an extra tiled->linear blit for glReadPixels() (ie. dEQP/piglit) which is hitting some intermittent corruption (looks like cache) on a6xx, causing a lot of spurious fails. Since we are getting close to 19.0 branchpoint, re-instate this limit for now, until the blitter problems are resolved. Fixes: `1ce5d757d0` freedreno: core buffer modifier support Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-25 10:20:05 -05:00
Samuel Pitoiset	378e2d2414	radv: fix computing number of user SGPRs for streamout buffers Streamout buffers are emitted like push constants. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-25 15:36:16 +01:00
Jose Fonseca	65b8d723fd	appveyor: Revert commits adding Cygwin support. This reverts commits `00ad77b9f6` and `5334dafee2`. This avoids Appveyor build breakage due to Cygwin, but more importantly, there are several problems with these patches, as highlighted to my recent mesa-dev mail. So better to revert for now, and pursue Cygwin support after these have been address.	2019-01-25 14:13:26 +00:00
Tapani Pälli	540939ecee	android: fix build issues with libmesa_anv_gen* libraries We need this include path to find nir/nir_xfb_info.h. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-25 15:21:06 +02:00
Andrii Simiklit	4759bb2fcf	intel/batch-decoder: fix a vb end address calculation According to the loop implementation (in 'ctx_print_buffer' function), which advances dword by dword over vertex buffer(vb), the vb size should be aligned by 4 bytes too. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109449 Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-25 15:12:30 +02:00
Andrii Simiklit	db39a44f10	intel/batch-decoder: fix vertex buffer size calculation for gen<8 It should be incremented by one according to how it is calculated by 'emit_vertex_buffer_state': "\#if GEN_GEN < 8 .BufferAccessType = step_rate ? INSTANCEDATA : VERTEXDATA, .InstanceDataStepRate = step_rate, \#if GEN_GEN >= 5 .EndAddress = ro_bo(bo, end_offset - 1), \#endif \#endif" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109449 Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-25 15:12:07 +02:00
Eric Engestrom	69e9440367	meson/vdpau: add missing soversion This mirrors what autotools does in src/gallium/state_trackers/vdpau/Makefile.am and src/gallium/targets/vdpau/Makefile.am: VDPAU_MAJOR = 1 VDPAU_MINOR = 0 libvdpau_gallium_la_LDFLAGS = -version-number $(VDPAU_MAJOR):$(VDPAU_MINOR) Reported-by: Igor Gnatenko <i.gnatenko.brain@gmail.com> Fixes: `68076b8747` "meson: build gallium vdpau state tracker" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-01-25 12:10:00 +00:00
Eric Engestrom	9af77fcf98	anv: drop always-successful VkResult Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-25 09:45:27 +00:00
Rafael Antognolli	f2ece26601	anv/allocator: Avoid race condition in anv_block_pool_map. Accessing bo->map and then pool->center_bo_offset without a lock is racy. One way of avoiding such race condition is to store the bo->map + center_bo_offset into pool->map at the time the block pool is growing, which happens within a lock. v2: Only set pool->map if not using softpin (Jason). v3: Move things around and only update center_bo_offset if not using softpin too (Jason). Cc: Jason Ekstrand <jason@jlekstrand.net> Reported-by: Ian Romanick <idr@freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109442 Fixes: `fc3f588320` Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-24 17:39:40 -08:00
Dylan Baker	c1efa240c9	meson: Add warnings and errors when using ICC ICC tries to be helpful by not erroring when it sees something that it doesn't understand, which is completely the opposite of helpful. Meson 0.49.0 does much better at handling this by really trying to make ICC error, but there are some things in mesa that still get ignored until 0.49.1 v2: - Fix id check, which is 'intel' not 'icc' Cc: 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v1)	2019-01-24 19:14:50 +00:00
Dylan Baker	7cb7f35bc7	meson: Fix compiler checks for SWR with ICC This is a bit fragile, as the way this "fixes" the check is to move the one that we know is correct before the one that is incorrectly reported as working. In meson 0.49.1 (which isn't out yet) this is fixed that the incorrect check is reported as a failure. Fixes: `e0b037d697` ("meson: Build SWR driver") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109129 Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-24 19:14:50 +00:00
Dylan Baker	3ba7ab8d2c	meson: fix swr KNL build There's a typo in one of the #defines that breaks compilation. Fixes: `e0b037d697` ("meson: Build SWR driver") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109023 Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-24 19:14:50 +00:00
Matt Turner	70a7ece035	gallivm: Return true from arch_rounding_available() if NEON is available LLVM uses the single instruction "FRINTI" to implement llvm.nearbyint. Fixes the rounding tests of lp_test_arit. Bug: https://bugs.gentoo.org/665570 Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-01-24 11:07:24 -08:00
Matt Turner	385ee7c3d0	gallium: Enable ASIMD/NEON on aarch64. NEON (now called ASIMD) is available on all aarch64 CPUs. Our code was missing an aarch64 path, leading to util_cpu_caps.has_neon always being false on aarch64. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-24 11:07:24 -08:00
Dave Airlie	1f6b92b476	gallium: use put image shm2 path (v2) This fixes the drisw paths to use the new shm2 interface, so that we don't trigger the X server overflow checks when the x offset is non-zero. This just hides the versioning in drisw, and either passes the src_x or adds the offset fixup for the fallback path. Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-01-25 04:27:45 +10:00
Dave Airlie	00af91ca46	glx: add support for putimageshm2 path (v2) v2: pass x,0 in as the offset coords at glx level not earlier Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-01-25 04:27:45 +10:00
Dave Airlie	db83a2b40f	dri_interface: add put shm image2 (v2) This adds a new interface to the swrast interface to fix an shm put image bug. The current code adds the x,y src offsets into the offset parameters, however if the x offset is > 0, and the put image copies up to the height of the image, this can trigger an X server validation check to fail and the renderering to get BadMatch. This patch fixes it to pass the x offset coord in as a src x. We cannot pass the Y coordinate due to the horrible code mangling the image w/h vs stride in swrastXPutImage. v2: drop srcx,y from api Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-01-25 04:27:45 +10:00
Emil Velikov	281421e1bc	mapi: remove machinery handling CSV files We haven't have one in years, so just drop the code. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	8a0012692a	mapi: remove old, unused ES* generator code As of earlier commit, everyone has switched to the new script for the ES dispatch. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	a41214ca3e	mapi/es2api: remove no longer present entrypoints With the previous scripts API from the following was incorrectly exported. Drop them from the list, since they're no longer around. GL_EXT_blend_func_extended GL_EXT_texture_integer Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	05f8558b27	mapi/es*api: remove GL_EXT_multi_draw_arrays entrypoints Now we use the upstream XML file and a cleaner generator. Thus the symbols are no longer exported and we can drop them from this list. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	5661ce6c64	mapi/es*api: remove GL_OES_EGL_image entrypoints As some point in the past we fixed the scripts so, these are no longer exported. Drop them from the list. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	9f86f1da7c	Revert "mapi/new: sort by slot number" This reverts commit a1f5d9412cf7cacb3534635f6c2409fafbe6574e. We no longer needed to sort - it was meant only to ease compare against the old generated files.	2019-01-24 18:13:25 +00:00
Emil Velikov	3bf08292d2	scons: wire the new generator for es1 and es2 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	0842bc879b	meson: wire the new generator for es1 and es2 v2: use ${foo})_py naming (Dylan) v3: use symbolic name for genCommon.py Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (v2)	2019-01-24 18:13:25 +00:00
Emil Velikov	656845301d	autotools: wire the new generator for es1 and es2 The output produced functionally identical, with the following changes: - A cosmetic: swapped ABI compatible types [ GLclampf -> GLfloat, etc ] - B cosmetic: renamed parameters [ zNear -> n, etc ] - C dropped extension entrypoints - invalid/incorrect To make things easier to validate, normalise both old/new headers run the sed patterns A, B and C to both sets. A s/\<GLclampf\>/GLfloat/g; s/\<GLclampx\>/GLfixed/g; s/\<GLvoid\>/void/g; B s/\ \* / */g; s/\<texture\>/target/g; s/\<plane\>/p/g; s/\<depth\>/d/g; s/\<modeAlpha\>/modeA/g; s/\<shader\>/program/g; s/\<obj\>/shaders/g; s/\<equation\>/eqn/g; s/\<param\>/data/g; s/\<params\>/data/g; s/\<buffers\>/buffer/g; s/\<src\>/mode/g; s/\<count\>/n/g; s/\<zNear\>/n/g; s/\<zFar\>/f/g; s/\<zfail\>/dpfail/g; s/\<zpass\>/dppass/g; s/\<buf\>/index/g; s/\<value\>/target/g; s/\<cap\>/target/g; s/\<maskNumber\>/index/g; s/\<srcRGB\>/sfactorRGB/g; s/\<dstRGB\>/dfactorRGB/g; s/\<srcAlpha\>/sfactorAlpha/g; s/\<dstAlpha\>/dfactorAlpha/g; s/\<primitiveMode\>/mode/g; s/\<primcount\>/instancecount/g; s/\<top\>/t/g; s/\<bottom\>/b/g; s/\<left\>/l/g; s/\<right\>/r/g; s/\<x\>/v0/g; s/\<y\>/v1/g; s/\<z\>/v2/g; s/\<w\>/v3/g; s/\<sfactor\>/mode/g; s/\<dfactor\>/dst/g; s/\<attribindex\>/bindingindex/g; s/\<internalFormat\>/internalformat/g; s/\<bufSize\>/bufsize/g; C glMultiDrawArraysEXT glMultiDrawElementsEXT glBindFragDataLocationEXT glGetTexParameterIivEXT glGetTexParameterIuivEXT glTexParameterIivEXT glTexParameterIuivEXT v2: - gl_dispatch_stub declarations are addressed with previous patch - the public_entries table is no longer generated Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	389bc2bc6e	mapi/new: remove duplicate GLvoid/void substitution We already do it a few lines above - drop the duplicate. Note that for consistency sake, we keep the substitution since the GL API is a mixed bad - some use GLvoid while others a normal void. We might want to merge this back in GLVND. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	5fa6c34949	mapi/new: fixup the GLDEBUGPROCKHR typedef to the non KHR one This way we can reuse the latter, which is already present in the headers that we use. Thus we can drop the manual typedef we generate. We might want to merge this back in GLVND. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	babec55f7e	mapi/new: don't print info we don't need for ES1/ES2 There is no need for the noop functions, the public_stubs and public_entries table or table size defines. Remove those. Pretty much all of this is applicable to GLVND, although it requires preparatory work. v2: - python style fixes (Dylan) - use "gldispatch" instead of not "glesv1" "glesv2" - remove the public_entries table/array (Erik) v3: - use if == "gldispatch", instead of "in" (Kyle) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (v2)	2019-01-24 18:13:25 +00:00
Emil Velikov	5b1bdce156	mapi/new: split out public_entries handling The only instance that requires the public_entries table is the dispatch library - split that into another function. We have to be careful with when undefining the guard, so split it out. We might want to merge this back in GLVND. Minor GLVND cleanup will be needed first. Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	313f977224	mapi/new: reinstate _NO_HIDDEN suffixes in the new generator Strictly speaking we can rework the rest of the code so we do not need those. That said, this will require a series on it's own so let's carry this local quirk for now. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	451805f810	mapi/new: use the static_data offsets in the new generator Otherwise the incorrect ones will be used, effectively breaking the ABI. Note: some entries in static_data.py list a suffixed API, while (for ES* at least) we expect the one w/o suffix. v2: - rework path handling (Dylan) - use else if chain (Erik) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	bba375c016	mapi/new: sort by slot number Makes it easier to compare the newly generated header against the old one. Will be reverted after the transition.	2019-01-24 18:13:25 +00:00
Emil Velikov	06eb3fe371	mapi/new: import mapi scripts from glvnd Currently we have over 20 scripts that generate the libGL* dispatch and various other functionality. More importantly we're using local XML files instead of the Khronos provides one(s). Resulting in an increasing complexity of writing, maintaining and bugfixing. One fairly annoying bug is handling of statically exported symbols. Today, if we enable a GL extension for GLES1/2, we add a special tag to the xml. Thus the ES dispatch gets generated, but also since we have no separate notion of GL/ES1/ES2 static functions it also gets exported statically. This commit adds step one towards clearing and simplifying our setup. It imports the mapi generator from GLVND. 012fe39 ("Remove a couple of duplicate typedefs.") v2: use local genCommon.py Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	cd0f11bac5	mapi: move genCommon.py to src/mapi/new The helper will also be used by the new Khronos gl.xml aware generator. v2: Move existing one, instead of duplicating it. v3: Correct genCommon.py references in meson [Erik] v4: Drop the file from the EGL EXTRA_DIST [Erik] Suggested-by: Kyle Brenneman <kbrenneman@nvidia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	a08a793180	genCommon.py: Fix typo in _LIBRARY_FEATURE_NAMES. Port glvnd commit 37fc6caa4b8 ("Fix typo in _LIBRARY_FEATURE_NAMES.") from Michal Srb. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	cf317bf093	mapi: add all _glapi_table entrypoints to static_data.py Currently various parts of mesa use the glapi_table differently. Some use _glapi_get_proc_offset() to get the offset, while others directly reference the specific offset via _gloffset_Function. Add all static entries, to ensure things don't break as we flip to the upstream XML + new mapi generator. Note: the offsets are also used for the alias remap table, thus we need to ensure we honour the correct offsets range or it will break. Currently this is done via MAX_OFFSETS constant, although a better solution is in the works. v2: add FramebufferTexture2DMultisampleEXT v3: add MAX_OFFSETS guard Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (v1) Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	fe9f5c0e21	mapi: sort static entrypoints numerically A few of the entrypoints were incorrectly placed. Sort those to align with the rest of the list. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Emil Velikov	5a81e8d40e	Revert "mesa/main: remove ARB suffix from glGetnTexImage" This reverts commit `f1998e15ff`. This changes the ABI, such that glGetnTexImageARB entry-point from the GLAPI gets removed. Thus accessing many functions by offset (as we do) will result in getting the wrong one. Follow-up work will swap the by-offset handling, but for now revert this patch. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:25 +00:00
Erik Faye-Lund	6148cce388	mapi: drop unneeded gl_dispatch_stub declarations These declarations are not used anywhere - be that generated code or otherwise. [Emil: format the hunk from Erik into a patch] Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 18:13:24 +00:00
Emil Velikov	ca152234e1	mesa: correctly use os.path.join in our python scripts With Windows in mind, using forward slash isn't the right thing to do. Even if it just works, we might want to fix it. As here, use __file__ instead of argv[0] and sys.path.insert over sys.path.append. With the path tweak being reportedly faster. Suggested-by: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-24 18:13:24 +00:00
Emil Velikov	9cc8e12505	freedreno: automake: ship ir3_nir_trig.py in the tarball Fixes: `aa0fed10d3` ("freedreno: move ir3 to common location") Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 18:13:24 +00:00
Eric Engestrom	8ed966b506	egl/glvnd: sync egl.xml from Khronos Fixes: `98984b7cdd` "egl: add glvnd entrypoints for EGL_MESA_query_driver" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 16:55:21 +00:00
Eric Engestrom	d2ca270511	travis: bump libdrm to 2.4.97 Fixes: `c02f761bdf` "winsys/amdgpu: use the new BO list API" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-24 14:50:33 +00:00
Veluri Mithun	85edfc04b8	egl: Implementation of egl dri2 drivers for MESA_query_driver Signed-off-by: Veluri Mithun <velurimithun38@gmail.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 14:37:52 +00:00
Eric Engestrom	98984b7cdd	egl: add glvnd entrypoints for EGL_MESA_query_driver Fixes: fbdd7bde29863935106c "egl: Implement EGL API for MESA_query_driver" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 14:37:47 +00:00
Veluri Mithun	6afce78128	egl: Implement EGL API for MESA_query_driver Signed-off-by: Veluri Mithun <velurimithun38@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 14:37:47 +00:00
Eric Engestrom	7d9274388b	egl: update headers from Khronos Cheating a tiny bit as these headers aren't in the Khronos repo yet, but I expect them to be within a couple days. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 14:37:44 +00:00
Eric Engestrom	381d0e753a	egl: finalize EGL_MESA_query_driver Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-24 14:37:36 +00:00
Matt Turner	e166003cb7	intel/compiler: Reset default flag register in brw_find_live_channel() emit_uniformize() emits SHADER_OPCODE_FIND_LIVE_CHANNEL with its flag_subreg set, so that the IR knows which flag is accessed. However the flag is only used on Gen7 in Align1 mode. To avoid setting unnecessary bits in the instruction words, get the information we need and reset the default flag register. This allows round-tripping through the assembler/disassembler. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2019-01-23 22:48:29 -08:00
Kenneth Graunke	74c9c906f9	gallium: Add forgotten docs for PIPE_CAP_GLSL_TESS_LEVELS_AS_INPUTS. Thanks to Ilia for catching this.	2019-01-23 17:16:22 -08:00
Mark Janes	022800a058	Revert "Implement EGL API for MESA_query_driver" This reverts commit `ff621a5055`. with default warnings configuration, this commit generates: ../src/egl/main/eglapi.c:2654:1: error: no previous prototype for ‘eglGetDisplayDriverConfig’ [-Werror=missing-prototypes]	2019-01-23 16:29:13 -08:00
Mark Janes	9e9fa13c81	Revert "Implementation of egl dri2 drivers for MESA_query_driver" This reverts commit `2720f78ef2`.	2019-01-23 16:28:47 -08:00
Veluri Mithun	2720f78ef2	Implementation of egl dri2 drivers for MESA_query_driver Signed-off-by: Veluri Mithun <velurimithun38@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-23 22:29:14 +00:00
Veluri Mithun	ff621a5055	Implement EGL API for MESA_query_driver Signed-off-by: Veluri Mithun <velurimithun38@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-23 22:29:14 +00:00
Veluri Mithun	499869908b	Add extension doc for MESA_query_driver Signed-off-by: Veluri Mithun <velurimithun38@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-23 22:29:14 +00:00
Sergii Romantsov	cfca5cd958	nir: Length of boolean vtn_value now is 1 During conversion type-length was lost due to math. v2 (Jason Ekstrand): - Use a size/offset of 4 bytes Fixes: `44227453ec` (nir: Switch to using 1-bit Booleans for almost everything) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109353 Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Tested-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-23 15:43:06 -06:00
Marek Olšák	42aea4f1a7	st/mesa: fix PRIMITIVES_GENERATED query after the "pipeline stat single" changes When this functionality was added, the PRIMITIVES_GENERATED query was accidentally omitted. This causes issues for drivers that support transform feedback." Fixes: `d644698b44` ("gallium: Add the ability to query a single pipeline statistics counter") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-23 14:32:57 -05:00
Marek Olšák	c89e8470e5	st/mesa: purge framebuffers when unbinding a context This fixes pipe_surface "leaks". Cc: 18.3 <mesa-stable@lists.freedesktop.org> Acked-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-01-23 14:32:57 -05:00
Erik Faye-Lund	5c17c01815	docs: add note about sending merge-requests from forks Sending MRs from the main Mesa repository increase clutter in the repository, and decrease visibility of project-wide branches. So it's better if MRs are sent from forks instead. Let's add a note about this, in case its not obvious to everyone. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-23 18:14:06 +01:00
Rob Clark	5a4af871e3	freedreno: set modifier when exporting buffer Fixes an assert we start hitting with kms/gbm: #0 0x0000007fbf3d6e3c in raise () from /lib64/libc.so.6 #1 0x0000007fbf3c4a68 in abort () from /lib64/libc.so.6 #2 0x0000007fbf3d04e8 in __assert_fail_base () from /lib64/libc.so.6 #3 0x0000007fbf3d0550 in __assert_fail () from /lib64/libc.so.6 #4 0x0000007fbf5a73c4 in gbm_dri_bo_create (gbm=0x5820f0, width=2160, height=1440, format=875713112, usage=0, modifiers=0x695e00, count=1) at ../src/gbm/backends/dri/gbm_dri.c:1150 #5 0x0000007fbf5a49c4 in gbm_bo_create_with_modifiers (gbm=0x5820f0, width=2160, height=1440, format=875713112, modifiers=0x695e00, count=1) at ../src/gbm/main/gbm.c:491 #6 0x0000007fbbac3d64 in get_back_bo (dri2_surf=0x6f4cc0) at ../src/egl/drivers/dri2/platform_drm.c:258 #7 0x0000007fbbac4318 in dri2_drm_image_get_buffers (driDrawable=0x704490, format=4098, stamp=0x6fc730, loaderPrivate=0x6f4cc0, buffer_mask=1, buffers=0x7fffffe210) at ../src/egl/drivers/dri2/platform_drm.c:409 #8 0x0000007fbf5a5318 in image_get_buffers (driDrawable=0x704490, format=4098, stamp=0x6fc730, loaderPrivate=0x70e150, buffer_mask=1, buffers=0x7fffffe210) at ../src/gbm/backends/dri/gbm_dri.c:135 #9 0x0000007fbe4308c4 in dri_image_drawable_get_buffers (drawable=0x6fc730, images=0x7fffffe210, statts=0x6f2660, statts_count=1) at ../src/gallium/state_trackers/dri/dri2.c:339 #10 0x0000007fbe430c44 in dri2_allocate_textures (ctx=0x614b30, drawable=0x6fc730, statts=0x6f2660, statts_count=1) at ../src/gallium/state_trackers/dri/dri2.c:466 #11 0x0000007fbe435580 in dri_st_framebuffer_validate (stctx=0x714160, stfbi=0x6fc730, statts=0x6f2660, count=1, out=0x7fffffe3b8) at ../src/gallium/state_trackers/dri/dri_drawable.c:85 #12 0x0000007fbe7b2c84 in st_framebuffer_validate (stfb=0x6f2190, st=0x714160) at ../src/mesa/state_tracker/st_manager.c:222 #13 0x0000007fbe7b4884 in st_api_make_current (stapi=0x7fbf0430d8 <st_gl_api>, stctxi=0x714160, stdrawi=0x6fc730, streadi=0x6fc730) at ../src/mesa/state_tracker/st_manager.c:1074 #14 0x0000007fbe434f44 in dri_make_current (cPriv=0x703c20, driDrawPriv=0x704490, driReadPriv=0x704490) at ../src/gallium/state_trackers/dri/dri_context.c:301 #15 0x0000007fbe42c910 in driBindContext (pcp=0x703c20, pdp=0x704490, prp=0x704490) at ../src/mesa/drivers/dri/common/dri_util.c:579 #16 0x0000007fbbabab40 in dri2_make_current (drv=0x69d170, disp=0x69c6e0, dsurf=0x6f4cc0, rsurf=0x6f4cc0, ctx=0x70cb40) at ../src/egl/drivers/dri2/egl_dri2.c:1456 #17 0x0000007fbbaa8ef4 in eglMakeCurrent (dpy=0x69c6e0, draw=0x6f4cc0, read=0x6f4cc0, ctx=0x70cb40) at ../src/egl/main/eglapi.c:862 #18 0x0000007fbf5736ac in InternalMakeCurrentVendor (dpy=dpy@entry=0x614fb0, draw=draw@entry=0x6f4cc0, read=read@entry=0x6f4cc0, context=context@entry=0x70cb40, apiState=apiState@entry=0x6fc940, vendor=0x6975f0) at libegl.c:861 #19 0x0000007fbf573764 in InternalMakeCurrentDispatch (dpy=0x614fb0, draw=0x6f4cc0, read=0x6f4cc0, context=0x70cb40, vendor=0x6975f0) at libegl.c:630 #20 0x0000000000403640 in init_egl (egl=0x5805a8 <gl>, gbm=0x580528 <gbm>, samples=0) at ../common.c:263 #21 0x0000000000403c1c in init_cube_smooth (gbm=0x580528 <gbm>, samples=0) at ../cube-smooth.c:225 #22 0x0000000000408618 in main (argc=1, argv=0x7fffffe8d8) at ../kmscube.c:145 Fixes: `1ce5d757d0` freedreno: core buffer modifier support Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-23 10:21:00 -05:00
Samuel Pitoiset	963c044c55	radv: always pass the GFX9 fence data to si_cs_emit_cache_flush() Remove two useless checks. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-23 11:31:14 +01:00
Samuel Pitoiset	5f0b17d581	radv: compute the GFX9 fence VA at allocation time Instead of doing every time we emit cache flushes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-23 11:31:12 +01:00
Samuel Pitoiset	e7ac792400	radv: only allocate the GFX9 fence and EOP BOs for the gfx queue It's invalid to emit a ZPASS_DONE event on the compute queue, and the fence BO is unused on the compute queue (ie. we don't flush CB or DB caches). This saves some space in the upload BO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-23 11:31:09 +01:00
Samuel Pitoiset	bd098884f1	radv: remove old_fence parameter from si_cs_emit_write_event_eop() This parameter is actually useless as the immediate value can always be zero. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-23 11:31:07 +01:00
Samuel Pitoiset	698afa177e	radv: improve gathering of load_push_constants with dynamic bindings For example, if a pipeline has two stages VS and FS. And if only the fragment stage needs dynamic bindings, we shouldn't allocate an extra user SGPR for the vertex stage. Of course, if the vertex stage loads constants, it needs an user SGPR. This should reduce the number of SET_SH_REG packets that are emitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-23 09:43:53 +01:00
Caio Marcelo de Oliveira Filho	e0485a1dd7	gallium: Add PIPE_CAP_GLSL_TESS_LEVELS_AS_INPUTS In the Intel backend, it makes the most sense to treat gl_TessLevelInner and gl_TessLevelOuter as ordinary shader inputs. For Radeon, it makes more sense to treat them as system values which get special handling. We already have a compiler option for this, but the Iris driver will need a capability bit so we can set it appropriately. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-01-23 00:35:56 -08:00
Ilia Mirkin	8e26d534be	nv50,nvc0: mark textures dirty on fb update We may have to flush the cache if there are any textures presently bound that refer to the outgoing framebuffer. This is only checked at validation time. Fixes a number of dEQP-GLES3.functional.fbo.color.repeated_clear.sample.* tests, which would bind a texture, then clear it while the binding was in effect, and then render to a different texture. This seems legal under the "no feedback loops" rule. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2019-01-22 23:16:01 -05:00
Timothy Arceri	678ef2a4a5	ac/nir_to_llvm: fix interpolateAt* for structs This fixes the arb_gpu_shader5 interpolateAt* tests that contain structs. Acked-by: Marek Olšák <marek.olsak@amd.com>	2019-01-23 10:41:37 +11:00
Timothy Arceri	559e5b0408	ac/nir_to_llvm: add bindless support for uniform handles Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-23 10:41:37 +11:00
Timothy Arceri	f0ed59076f	radeonsi/nir: add missing piece for bindless image support This fixes some piglit tests and is was TGSI does. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-23 10:41:37 +11:00
Rob Clark	1ce5d757d0	freedreno: core buffer modifier support Split out of a patch from Fritz Koenig to decouple from a6xx UBWC enablement, and added fd_resource_create_with_modifiers().	2019-01-22 16:33:27 -05:00
Rob Clark	c56fe4118a	loader: fix the no-modifiers case Normally modifiers take precendence over use flags, as they are more explicit. But if the driver supports modifiers, but the xserver does not, then we should fallback to the old mechanism of allocating a buffer using 'use' flags. Fixes: `069fdd5f9f` Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2019-01-22 16:33:27 -05:00
Fritz Koenig	7c4b9510d1	freedreno: add query for dmabuf modifiers	2019-01-22 16:33:27 -05:00
Fritz Koenig	ddbe6171e6	freedreno: drm_fourcc.h header include Add Qualcomm modifier for UBWC	2019-01-22 16:33:27 -05:00
Brian Paul	956c219c8f	svga: add new gallium formats to the format conversion table Fixes a static assertion which broke the build. Fixes: `3ee240890` "gallium: add SINT formats to have exact counterparts to SNORM formats" Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Neha Bhende<bhenden@vmware.com>	2019-01-22 12:58:04 -07:00
Marek Olšák	d85917deaf	radeonsi: rename rfence -> sfence Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 13:34:03 -05:00
Marek Olšák	260ff57647	radeonsi: rename rbo, rbuffer to buf or buffer Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 13:34:01 -05:00
Marek Olšák	63b91f25bc	radeonsi: rename rsrc -> ssrc, rdst -> sdst Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 13:33:04 -05:00
Marek Olšák	4666f36c04	radeonsi: rename rquery -> squery Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 13:32:59 -05:00
Marek Olšák	501ff90a95	radeonsi: rename r600_resource -> si_resource Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 13:32:18 -05:00
Lionel Landwerlin	a75b12ce66	vulkan: make generated enum to strings helpers available from c++ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-22 18:20:53 +00:00
Marek Olšák	1cfbed7587	radeonsi: remove r600 from comments Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:26:45 -05:00
Marek Olšák	e0a6399eb4	winsys/amdgpu: rename rfence, rsrc, rdst -> afence, asrc, adst Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:26:45 -05:00
Marek Olšák	2792ec2cdd	radeonsi: rename rview -> sview Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:26:45 -05:00
Marek Olšák	96610f625d	radeonsi: rename rscreen -> sscreen Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:25:57 -05:00
Marek Olšák	86e25ed5a3	radeonsi: disable render cond & pipeline stats for internal compute dispatches	2019-01-22 12:24:35 -05:00
Sonny Jiang	1b25d340b7	radeonsi: use compute for resource_copy_region when possible v2: marek: fix snorm8 blits Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2019-01-22 12:24:35 -05:00
Jiang, Sonny	8daf5bb209	radeonsi: add compute_last_block to configure the partial block fields	2019-01-22 12:22:46 -05:00
Marek Olšák	b443465fb9	gallium/util: add util_format_snorm8_to_sint8 (from radeonsi)	2019-01-22 12:21:43 -05:00
Marek Olšák	3ee240890c	gallium: add SINT formats to have exact counterparts to SNORM formats for radeonsi	2019-01-22 12:21:43 -05:00
Marek Olšák	4d5f8f39f3	radeonsi: move PKT3_WRITE_DATA generation into a helper function Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:14:26 -05:00
Marek Olšák	c252273f98	radeonsi: don't use WRITE_DATA.DST_SEL == MEM_GRBM on >= CIK Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:14:26 -05:00
Marek Olšák	a545415eb9	radeonsi: fix the top-of-pipe fence on SI SI doesn't have MEM. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:14:26 -05:00
Marek Olšák	e402961e1d	radeonsi: correct WRITE_DATA.DST_SEL definitions Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:14:26 -05:00
Marek Olšák	c605738113	radeonsi: compile clear and copy buffer compute shaders on demand same as all other shaders	2019-01-22 11:59:27 -05:00
Marek Olšák	f139589069	radeonsi: remove redundant call to emit_cache_flush in compute clear/copy launch_grid calls it.	2019-01-22 11:59:27 -05:00
Marek Olšák	e3d283eaca	radeonsi: use buffer_store_format_x & xy	2019-01-22 11:59:27 -05:00
Marek Olšák	4c4c8bb1f0	radeonsi: fix rendering to tiny viewports where the viewport center is > 8K This fixes an assertion failure with GL CTS when cts-runner is used. (not a specific test) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108877 Cc: 18.3 <mesa-stable@lists.freedesktop.org>	2019-01-22 11:59:27 -05:00
Marek Olšák	caa2dcd730	radeonsi: fix a u_blitter crash after a shader with FBFETCH This fixes an assertion failure with GL CTS when cts-runner is used. (not a specific test) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108877 Cc: 18.3 <mesa-stable@lists.freedesktop.org>	2019-01-22 11:59:27 -05:00
Marek Olšák	c02f761bdf	winsys/amdgpu: use the new BO list API	2019-01-22 11:59:27 -05:00
Jason Ekstrand	ac0f8a6ea0	anv: Implement transform feedback queries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:57 -06:00
Jason Ekstrand	7f4d9bb7b8	genxml: Add SO_PRIM_STORAGE_NEEDED and SO_NUM_PRIMS_WRITTEN Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:57 -06:00
Jason Ekstrand	673f33c77d	anv: Implement CmdBegin/EndQueryIndexed Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:57 -06:00
Jason Ekstrand	2be89cbd82	anv: Implement vkCmdDrawIndirectByteCountEXT Annoyingly, this requires that we implement integer division on the command streamer. Fortunately, we're only ever dividing by constants so we can use the mulh+add+shift trick and it's not as bad as it sounds. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	36ee2fd61c	anv: Implement the basic form of VK_EXT_transform_feedback Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	39925d60ec	anv: Add pipeline cache support for xfb_info Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	e3bd49eaa7	anv: Add but do not enable VK_EXT_transform_feedback Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:56 -06:00
Alejandro Piñeiro	6b50b0a4a8	nir/xfb: distinguish array of structs vs array of blocks Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	ac704e777c	nir/xfb: Properly handle arrays of blocks Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-22 10:42:56 -06:00
Alejandro Piñeiro	5649a0a6e8	nir/xfb: don't assert when xfb_buffer/stride is present but not xfb_offset In order to allow nir_gather_xfb_info to be used on OpenGL, specifically ARB_gl_spirv. So, from OpenGL 4.6 spec, section 11.1.2.1, "Output Variables": "outputs specifying both an XfbBuffer and an Offset are captured, while outputs not specifying both of these are not captured. Values are captured each time the shader writes to such a decorated object." This implies that are captured if both are present, and not if one of those are lacking. Technically, it doesn't explicitly point that having just one or the other is a mistake. In some cases, glslang is adding some extra XfbBuffer without XfbOffset around, and mentioning that technically that is not a bug (see issue#1526) And for the case of Vulkan, as the same glslang issue mentions, it is not clear if that should be a mistake or not. But even if it is a mistake, it is not really needed to be checked on the driver, and we can let the validation layers to check that. v2: simplify explicit_xfb_buffer and explicit_offset checks (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	4f99ac9144	nir/xfb: Fix offset accounting for dvec3/4 Before, we were double-counting the component slots when we had a dvec3 or dvec4. Instead, just add them in once and manually offset the recorded output offset. Fixes: `19064b8c` "nir: Add a pass for gathering transform feedback info" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	96fa23bca5	nir: Preserve offsets in lower_io_to_scalar_early Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-22 10:42:56 -06:00
Samuel Pitoiset	b2bbd978d0	nir: fix lowering arrays to elements for XFB outputs If we have a transform feedback output like: float[2] x2_out (VARYING_SLOT_VAR1.x, 0, 0) which is lowered by nir_lower_io_arrays_to_elements to, float x2_out (VARYING_SLOT_VAR1.x, 0, 0) float x2_out@5 (VARYING_SLOT_VAR2.x, 0, 0) We have to update the destination offset to avoid overwriting the same value. v2 (Jason Ekstrand): - Compute the correct offsets for arrays of vectors and/or doubles Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-22 10:42:56 -06:00
Samuel Pitoiset	9f4e0aa7c1	nir: do not remove varyings used for transform feedback When a xfb buffer is explicitely declared on a varying variable, we shouldn't remove it at link time. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	9c14440e81	spirv: Only set interface_type on blocks Instead of setting interface_type to whatever the per-vertex type is, we only set it on blocks. This allows later passes to tell the difference between variables that are in blocks and those that aren't. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	da29594636	spirv: Only split blocks Instead of splitting every per-vertex struct, just split the ones that are actually blocks. The reason for the split is so that we have separate variables for separate locations, qualifiers, and builtin decorations. The vulkan spec only allows these on members of blocks. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	662cfb121b	spirv: Initialize struct member offsets to -1 This is the "no offset specified" value. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:56 -06:00
Jason Ekstrand	b4eae8444e	anv: Always emit at least one vertex element This seems to make the simulator happier. The early return wasn't really protecting anything and the code that follows will happily initialize the dummy element to STORE_0 and emit it. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-22 10:42:56 -06:00
Eric Engestrom	610f956fde	configure: EGL requirements only apply if EGL is built Issue was hit with this configuration: --disable-{egl,gbm} --with-platform=drm Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `3208fd2e46` ("configure: move platform handling further up") Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-22 16:12:40 +00:00
Jonathan Marek	fc4f6b2f12	freedreno: a2xx: add partial lower_scalar pass for ir2 Some instructions can only be scalar on a2xx, lower these only Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-22 14:45:03 +00:00
Jonathan Marek	9f614c74b7	freedreno: a2xx: add ir2 copy propagation Two cases: * replacing srcs which refer to MOV instructions * replacing MOVs used to write to exports Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-22 14:45:03 +00:00
Jonathan Marek	c7dbf0b280	freedreno: a2xx: insert scalar MOV to allow 2 source scalar If we want to use a scalar instruction with two sources, both sources have to be in the same register. This covers a common case by inserting a scalar MOV into a previous instruction with only a vector alu instruction. A better method would be to have the sources end up in the same register in the first place, but when one source is a constant this is the only way. Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-22 14:45:03 +00:00
Jonathan Marek	67610a0323	freedreno: a2xx: NIR backend This patch replaces the a2xx TGSI compiler with a NIR compiler. It also adds several new features: -gl_FrontFacing, gl_FragCoord, gl_PointCoord, gl_PointSize -control flow (including loops) -texture related features (LOD/bias, cubemaps) -filling scalar ALU slot when possible Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2019-01-22 14:45:03 +00:00
Tapani Pälli	da3ca69afa	nir: cleanup glsl_get_struct_field_offset, glsl_get_explicit_stride Take away const qualifier from return type of these functions as -Wignored-qualifiers points out it is ignored for these cases. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 13:09:15 +02:00
Eric Engestrom	41a0c00392	travis: fix autotools build after --enable-autotools switch addition Fixes: `e68777c87c` "autotools: Deprecate the use of autotools" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 10:29:19 +00:00
Jason Ekstrand	27af1cc2a6	spirv: Update the JSON and headers from Khronos master This corresponds to commit 79b6681aadcb53c27d1052e on GitHub. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 18:55:05 -06:00
Jason Ekstrand	ca8c6c9781	nir: Mark deref UBO and SSBO access as non-scalar Fixes: `63b9aa2e25` "spirv: Add support for using derefs for..." Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 18:41:47 -06:00
Karol Herbst	5ee0adfb6e	nir/spirv: handle ContractionOff execution mode Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 20:36:41 +01:00
Rob Clark	fa737042ad	nir/vtn: add caps for some cl related capabilities vtn supports these, so don't squalk if user is happy with enabling these. v2: add new members sorted Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 20:36:41 +01:00
Karol Herbst	ce08e5f39c	vtn: handle SpvExecutionModelKernel Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 20:36:41 +01:00
Karol Herbst	8bb46de08b	mesa: add MESA_SHADER_KERNEL used for CL kernels Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 20:36:41 +01:00
Jason Ekstrand	2aa78e46e9	anv/pipeline: Add a pdevice helper variable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-21 11:57:00 -06:00
Jason Ekstrand	344171b9ee	relnotes: Add newly added Vulkan extensions Both the Intel and RADV people have been really bad about adding things to the release notes. We should start actually paying attention. Acked-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-21 11:46:06 -06:00
Jason Ekstrand	c7f4a2867c	anv: Only parse pImmutableSamplers if the descriptor has samplers Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-21 11:45:58 -06:00
Rhys Perry	f0ba826054	radv: prevent dirtying of dynamic state when it does not change DXVK often sets dynamic state without actually changing it. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 14:37:53 +00:00
Rhys Perry	e4c6423c5e	radv: avoid context rolls when binding graphics pipelines It's common in some applications to bind a new graphics pipeline without ending up changing any context registers. This has a pipline have two command buffers: one for setting context registers and one for everything else. The context register command buffer is only emitted if it differs from the previous pipeline's. v2: ensure late scissor emission is done when radv_emit_rbplus_state() is called v2: make use of cmd_buffer->state.workaround_scissor_bug v3: rename "workaround_scissor_bug" to "context_roll_without_scissor_emitted" Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 14:37:53 +00:00
Rhys Perry	5564a797f2	radv: add missed situations for scissor bug workaround v2: rename "workaround_scissor_bug" to "context_roll_without_scissor_emitted" Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 14:37:53 +00:00
Rhys Perry	5d1a29071a	radv: pass radv_draw_info to radv_emit_draw_registers() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 14:37:53 +00:00
Jonathan Marek	5886c5d092	freedreno: a2xx: sysmem rendering Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-21 09:22:34 -05:00
Jonathan Marek	bec6e4b054	freedreno: a2xx: fix non-zero texture base offsets Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-21 09:22:27 -05:00
Jonathan Marek	02ab85afd8	freedreno: a2xx: fix VERTEX_REUSE/DEALLOC on a20x On a20x, set VGT_VERTEX_REUSE_BLOCK_CNTL to 2 and don't change it. Small rearrangement on a220 to reduce the size of draw commands. Only set DEALLOC_CNTL on a20x because the correct a220 value is not known. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-21 09:22:22 -05:00
Jonathan Marek	0286a11b7e	freedreno: a2xx: fix gmem2mem viewport Fixes cases where previous viewport values might case gmem2mem to fail. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-21 09:22:16 -05:00
Jonathan Marek	64b12520a2	freedreno: a2xx: cleanup REG_A2XX_PA_CL_VTE_CNTL Doesn't change much, but reduces the size of fd2_emit_state gmem2mem does not need to change the value: no Z clipping on resolve mem2gmem now needs to restore the common value after rendering Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-21 09:22:10 -05:00
Jonathan Marek	6ef7700ac6	freedreno: a2xx: cleanup init_shader_const Only 3 vertices are used so we can drop the data for vertex 4 It doesn't make sense to have 1.1 for some coordinates, use 1.0 instead Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-21 09:21:51 -05:00
Karol Herbst	0a793c78a3	nir: add bit_size parameter to system values with multiple allowed bit sizes v2: add assert to verify we have at least one valid bit_size v3: fix use of load_front_face in nir_lower_two_sided_color and tgsi_to_nir Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 00:17:18 +01:00
Karol Herbst	4125211e9c	nir: add legal bit_sizes to intrinsics With OpenCL some system values match the address bits, but in GLSL we also have some system values being 64 bit like subgroup masks. With this it is possible to adjust the builder functions so that depending on the bit_sizes the correct bit_size is used or an additional argument is added in case of multiple possible values. v2: validate dest bit_size v3: generate hex values in python code remove useless imports rename and move bit_sizes v4: add 1 to legal bit_sizes for front_face Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 00:16:51 +01:00
Karol Herbst	27bd07e230	nir/validate: allow to check against a bitmask of bit_sizes Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 00:16:51 +01:00
Karol Herbst	b9fec2b38c	nir: replace more nir_load_system_value calls with builder functions Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-21 00:16:51 +01:00
Karol Herbst	987744be98	glsl/lower_output_reads: set invariant and precise flags on temporaries fixes a couple of deqp tests (on nvc0 and potential other drivers): dEQP-GLES3.functional.shaders.invariance.highp.common_subexpression_1 dEQP-GLES3.functional.shaders.invariance.highp.common_subexpression_2 dEQP-GLES3.functional.shaders.invariance.highp.common_subexpression_3 dEQP-GLES3.functional.shaders.invariance.mediump.common_subexpression_1 dEQP-GLES3.functional.shaders.invariance.mediump.common_subexpression_2 dEQP-GLES3.functional.shaders.invariance.mediump.common_subexpression_3 dEQP-GLES3.functional.shaders.invariance.lowp.common_subexpression_1 dEQP-GLES3.functional.shaders.invariance.lowp.common_subexpression_2 dEQP-GLES3.functional.shaders.invariance.lowp.common_subexpression_3 CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-01-21 00:16:50 +01:00
Rhys Kidd	8002eaab6c	nv50,nvc0: add missing CAPs for unsupported features Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-01-20 13:51:01 -05:00
Karol Herbst	acdad24585	nir/spirv: handle SpvStorageClassCrossWorkgroup v2: rename nir_var_global to nir_var_mem_global Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-19 20:01:42 +01:00
Karol Herbst	36a76b7192	nir: rename nir_var_shared to nir_var_mem_shared Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-19 20:01:41 +01:00
Karol Herbst	6fefd69724	nir: rename nir_var_ssbo to nir_var_mem_ssbo Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-19 20:01:41 +01:00
Karol Herbst	3afc1e068f	nir: rename nir_var_ubo to nir_var_mem_ubo Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-19 20:01:41 +01:00
Karol Herbst	9b24028426	nir: rename nir_var_function to nir_var_function_temp Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-19 20:01:41 +01:00
Karol Herbst	e5daef9587	nir: rename nir_var_private to nir_var_shader_temp Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-19 20:01:41 +01:00
Lionel Landwerlin	ad99c1670a	intel/genxml: add missing MI_PREDICATE compare operations Doesn't save us a great deal of lines but at least they get decoded in aubinators. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-01-19 15:47:36 +00:00
Lionel Landwerlin	79514cc5fb	anv: document cache flushes & invalidations A little bit of explanation regarding how vkCmdPipelineBarrier() works. v2: Avoid referring to data port cache when it's actually sampler caches (Jason) Complete explanation for indirect draws (Jason) v3: s/samplers/sampler/ (Jason) s/UBOs/data port/ Add documentation for VK_ACCESS_CONDITIONAL_RENDERING_READ_BIT_EXT (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v2)	2019-01-19 15:45:41 +00:00
Lionel Landwerlin	3c4c18341a	anv: narrow flushing of the render target to buffer writes In commit `9a7b319903` ("anv/query: flush render target before copying results") we tracked all the render target writes to apply a flushes in the vkCopyQueryResults(). But we can narrow this down to only when we write a buffer (which is the only input of vkCopyQueryResults). v2: Drop newer render target write flags introduce by `1952fd8d2c` ("anv: Implement VK_EXT_conditional_rendering for gen 7.5+") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v1)	2019-01-19 15:45:41 +00:00
Timothy Arceri	6ca652faf3	glsl: be much more aggressive when skipping shader compilation Currently we only add a cache key for a shader once it is linked. However games like Team Fortress 2 compile a whole bunch of shaders which are never actually linked. These compiled shaders can take up a bunch of memory. This patch changes things so that we add the key for the shader to the cache as soon as it is compiled. This means on a warm cache we can avoid the wasted memory from these shaders. Worst case scenario is we need to compile the shaders at link time but this can happen anyway if the shader has been evicted from the cache. Reduces memory use in Team Fortress 2 from 1.3GB -> 770MB on a warm cache from start up to the game menu. V2: only add key to cache when compilation is successful. Acked-by: Marek Olšák <marek.olsak@amd.com>	2019-01-19 13:12:25 +11:00
Francisco Jerez	c84ec70b3a	intel/fs: Promote execution type to 32-bit when any half-float conversion is needed. The docs are fairly incomplete and inconsistent about it, but this seems to be the reason why half-float destinations are required to be DWORD-aligned on BDW+ projects. This way the regioning lowering pass will make sure that the destination components of W to HF and HF to W conversions are aligned like the corresponding conversion operation with 32-bit execution data type. Tested-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-18 16:09:39 -08:00
Timothy Arceri	9e669ed22b	ac/nir_to_llvm: fix interpolateAt* for arrays This builds on the recent interpolate fix by Rhys `ee8488ea3b`. This fixes the arb_gpu_shader5 interpolateAt* tests that contain arrays. Fixes: `ee8488ea3b` ("ac/nir,radv,radeonsi/nir: use correct indices for interpolation intrinsics") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-19 10:59:38 +11:00
Timothy Arceri	860a9e4849	Revert "glsl: be much more aggressive when skipping shader compilation" This reverts commit `64b8c86d37`. Reverting for now as it was causing some segfaults.	2019-01-19 10:45:07 +11:00
Kristian H. Kristensen	5486c9d526	freedreno/a6xx: Turn on texture tiling by default The color swap isn't available for tiled formats and it's not needed either. We pick one channel order and use for all non-linear formats. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-18 14:27:15 -08:00
Kristian H. Kristensen	60c6778dda	freedreno: Synchronize batch and flush for staging resource Staging blit downloads would wait on the src resource instead of the staging resource and didn't make sure to submit the blit batch first. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-18 14:27:12 -08:00
Timothy Arceri	64b8c86d37	glsl: be much more aggressive when skipping shader compilation Currently we only add a cache key for a shader once it is linked. However games like Team Fortress 2 compile a whole bunch of shaders which are never actually linked. These compiled shaders can take up a bunch of memory. This patch changes things so that we add the key for the shader to the cache as soon as it is compiled. This means on a warm cache we can avoid the wasted memory from these shaders. Worst case scenario is we need to compile the shaders at link time but this can happen anyway if the shader has been evicted from the cache. Reduces memory use in Team Fortress 2 from 1.3GB -> 770MB on a warm cache from start up to the game menu. Acked-by: Marek Olšák <marek.olsak@amd.com>	2019-01-19 08:24:47 +11:00
Timothy Arceri	c9d7b0f184	glsl: don't skip GLSL IR opts on first-time compiles This basically reverts `c2bc0aa7b1`. By running the opts we reduce memory using in Team Fortress 2 from 1.5GB -> 1.3GB from start-up to game menu. This will likely increase Deus Ex start up times as per commit `c2bc0aa7b1`. However currently 32bit games like Team Fortress 2 can run out of memory on low memory systems, so that seems more important. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-19 08:24:43 +11:00
Caio Marcelo de Oliveira Filho	cd56d79b59	nir: check NIR_SKIP to skip passes by name Passes' function names, separated by comma, listed in NIR_SKIP environment variable will be skipped in debug mode. The mechanism is hooked into the _PASS macro, like NIR_PRINT. The extra macro NIR_SKIP is available as a developer convenience, to skip at pointer other than the passes entry points. v2: Fix typo in NIR_SKIP macro. (Bas) Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-18 12:31:49 -08:00
Danylo Piliaiev	1952fd8d2c	anv: Implement VK_EXT_conditional_rendering for gen 7.5+ Conditional rendering affects next functions: - vkCmdDraw, vkCmdDrawIndexed, vkCmdDrawIndirect, vkCmdDrawIndexedIndirect - vkCmdDrawIndirectCountKHR, vkCmdDrawIndexedIndirectCountKHR - vkCmdDispatch, vkCmdDispatchIndirect, vkCmdDispatchBase - vkCmdClearAttachments Value from conditional buffer is cached into designated register, MI_PREDICATE is emitted every time conditional rendering is enabled and command requires it. v2: by Jason Ekstrand - Use vk_find_struct_const instead of manually looping - Move draw count loading to prepare function - Zero the top 32-bits of MI_ALU_REG15 v3: Apply pipeline flush before accessing conditional buffer (The issue was found by Samuel Iglesias) v4: - Remove support of Haswell due to possible hardware bug - Made TMP_REG_PREDICATE and TMP_REG_DRAW_COUNT defines to define registers in one place. v5: thanks to Jason Ekstrand and Lionel Landwerlin - Workaround the fact that MI_PREDICATE_RESULT is not accessible on Haswell by manually calculating MI_PREDICATE_RESULT and re-emitting MI_PREDICATE when necessary. v6: suggested by Lionel Landwerlin - Instead of calculating the result of predicate once - re-emit MI_PREDICATE to make it easier to investigate error states. v7: suggested by Jason - Make anv_pipe_invalidate_bits_for_access_flag add CS_STALL if VK_ACCESS_CONDITIONAL_RENDERING_READ_BIT is set. v8: suggested by Lionel - Precompute conditional predicate's result to support secondary command buffers. - Make prepare_for_draw_count_predicate more readable. Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-18 18:31:44 +00:00
Danylo Piliaiev	ed6e2bf263	anv: Implement VK_KHR_draw_indirect_count for gen 7+ v2: by Jason Ekstrand - Move out of the draw loop population of registers which aren't changed in it. - Remove dependency on ALU registers. - Clarify usage of PIPE_CONTROL - Without usage of ALU registers patch works for gen7+ v3: set pending_pipe_bits \|= ANV_PIPE_RENDER_TARGET_WRITES Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-18 18:31:44 +00:00
Dylan Baker	9e989b860a	bin/meson-cmd-extract: Also handle cross and native files Native file support in command line serialization isn't present in meson 0.49, but will be for 0.49.1 and 0.50 Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-18 09:37:01 -08:00
Jason Ekstrand	b54df1b6df	anv: Re-sort the extensions list I like to keep things in good order so that you can find them. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-18 10:32:23 -06:00
Jason Ekstrand	eb32dad07c	intel/fs: Don't touch accumulator destination while applying regioning alignment rule In some shaders, you can end up with a stride in the source of a SHADER_OPCODE_MULH. One way this can happen is if the MULH is acting on the top bits of a 64-bit value due to 64-bit integer lowering. In this case, the compiler will produce something like this: mul(8) acc0<1>UD g5<8,4,2>UD 0x0004UW { align1 1Q }; mach(8) g6<1>UD g5<8,4,2>UD 0x00000004UD { align1 1Q AccWrEnable }; The new region fixup pass looks at the MUL and sees a strided source and unstrided destination and determines that the sequence is illegal. It then attempts to fix the illegal stride by replacing the destination of the MUL with a temporary and emitting a MOV into the accumulator: mul(8) g9<2>UD g5<8,4,2>UD 0x0004UW { align1 1Q }; mov(8) acc0<1>UD g9<8,4,2>UD { align1 1Q }; mach(8) g6<1>UD g5<8,4,2>UD 0x00000004UD { align1 1Q AccWrEnable }; Unfortunately, this new sequence isn't correct because MOV accesses the accumulator with a different precision to MUL and, instead of filling the bottom 32 bits with the source and zeroing the top 32 bits, it leaves the top 32 (or maybe 31) bits alone and full of garbage. When the MACH comes along and tries to complete the multiplication, the result is correct in the bottom 32 bits (which we throw away) and garbage in the top 32 bits which are actually returned by MACH. This commit does two things: First, it adds an assert to ensure that we don't try to rewrite accumulator destinations of MUL instructions so we can avoid this precision issue. Second, it modifies required_dst_byte_stride to require a tightly packed stride so that we fix up the sources instead and the actual code which gets emitted is this: mov(8) g9<1>UD g5<8,4,2>UD { align1 1Q }; mul(8) acc0<1>UD g9<8,8,1>UD 0x0004UW { align1 1Q }; mach(8) g6<1>UD g5<8,4,2>UD 0x00000004UD { align1 1Q AccWrEnable }; Fixes: `efa4e4bc5f` "intel/fs: Introduce regioning lowering pass" Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2019-01-18 10:18:52 -06:00
Jason Ekstrand	0a7ac6d543	intel/eu: Stop overriding exec sizes in send_indirect_message For a long time, we based exec sizes on destination register widths. We've not been doing that since `1ca3a94427` but a few remnants accidentally remained. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2019-01-18 10:18:52 -06:00
Samuel Pitoiset	f682ed11c3	radv: initialize the per-queue descriptor BO only once Totally useless to write the descriptors inside the loop. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-18 13:26:32 +01:00
Samuel Pitoiset	72d9745a40	radv: do not write unused descriptors to the per-queue BO Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-18 13:26:30 +01:00
Samuel Pitoiset	8c164ea8f5	radv: reduce size of the per-queue descriptor BO Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-18 13:26:28 +01:00
Samuel Pitoiset	83cc87ead4	radv: drop unused code related to 16 sample locations The driver only supports up to 8 sample locations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-18 13:26:24 +01:00
Karol Herbst	80dae7022e	gm107/ir: disable TEXS for tex with derivAll set fixes deqp tests: dEQP-GLES3.functional.shaders.texture_functions.texturegrad.samplercube_fixed_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.samplercube_float_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.isamplercube_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.usamplercube_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.sampler3d_fixed_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.sampler3d_float_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.isampler3d_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.usampler3d_vertex dEQP-GLES3.functional.shaders.texture_functions.texturegrad.sampler2dshadow_vertex dEQP-GLES3.functional.shaders.texture_functions.textureprojgrad.sampler3d_fixed_vertex dEQP-GLES3.functional.shaders.texture_functions.textureprojgrad.sampler3d_float_vertex dEQP-GLES3.functional.shaders.texture_functions.textureprojgrad.isampler3d_vertex dEQP-GLES3.functional.shaders.texture_functions.textureprojgrad.usampler3d_vertex dEQP-GLES3.functional.shaders.texture_functions.textureprojgrad.sampler2dshadow_vertex Fixes: `f821e80213` "gm107/ir: use scalar tex instructions where possible" Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-01-18 03:27:51 +01:00
Karol Herbst	30b5c9eda2	nv50/ir: disable tryCollapseChainedMULs in ConstantFolding for precise instructions fixes dEQP-GLES2.functional.shaders.invariance.mediump.loop_3 CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-01-18 02:03:30 +01:00
Bas Nieuwenhuizen	8424cd8fbd	nir: Account for atomics in copy propagation. Otherwise writes get propagated across atomics if no barrier is used. Without barrier writes should still be visible in the same invocation, so an atomic has to be considered a write. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Fixes: `b3c6146925` "nir: Copy propagation between blocks" Fixes: `62332d139c` "nir: Add a local variable-based copy propagation pass"	2019-01-18 00:55:35 +01:00
Rafael Antognolli	927ba12b53	anv/tests: Adding test for the state_pool padding. Add a test that checks that we can use the extra space allocated for padding while allocating larger anv_states. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:26 -08:00
Rafael Antognolli	731c4adcf9	anv/allocator: Add support for non-userptr. If softpin is supported, create new BOs for the required size and add the respective BO maps. The other main change of this commit is that anv_block_pool_map() now returns the map for the BO that the given offset is part of. So there's no block_pool->map access anymore (when softpin is used. v3: - set fd to -1 on softpin case (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:24 -08:00
Rafael Antognolli	643248b66a	anv: Remove state flush. We have all the state buffers snooped, so we don't need to clflush everything anymore. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:22 -08:00
Rafael Antognolli	5d61c74f3d	anv/allocator: Enable snooping on block pool and anv_bo_pool BOs. We are not going to use userptr for anv block pool BOs anymore. However, so far we have been relying on the fact that userptr BOs are snooped on non-llc platforms. Let's make sure that the block pool BOs are still snooped, and we can also remove the clflush'ing that we do on all state buffers. And since we plan to remove the flushes, set the anv_bo_pool BOs to cached (snooped on non-LLC platforms) too. For LLC platforms, they are all cached by default, so this becomes a no-op. v5: - Add snooping to anv_bo_pool BOs too (Jason). - Remove anv_gem_set_domain. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:20 -08:00
Rafael Antognolli	dfc9ab2ccd	anv/allocator: Add padding information. It's possible that we still have some space left in the block pool, but we try to allocate a state larger than that state. This means such state would start somewhere within the range of the old block_pool, and end after that range, within the range of the new size. That's fine when we use userptr, since the memory in the block pool is CPU mapped continuously. However, by the end of this series, we will have the block_pool split into different BOs, with different CPU mapping ranges that are not necessarily continuous. So we must avoid such case of a given state being part of two different BOs in the block pool. This commit solves the issue by detecting that we are growing the block_pool even though we are not at the end of the range. If that happens, we don't use the space left at the end of the old size, and consider it as "padding" that can't be used in the allocation. We update the size requested from the block pool to take the padding into account, and return the offset after the padding, which happens to be at the start of the new address range. Additionally, we return the amount of padding we used, so the caller knows that this happens and can return that padding back into a list of free states, that can be reused later. This way we hopefully don't waste any space, but also avoid having a state split between two different BOs. v3: - Calculate offset + padding at anv_block_pool_alloc_new (Jason). v4: - Remove extra "leftover". Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:19 -08:00
Rafael Antognolli	7ed0898a8d	anv/allocator: Rework chunk return to the state pool. This commit tries to rework the code that split and returns chunks back to the state pool, while still keeping the same logic. The original code would get a chunk larger than we need and split it into pool->block_size. Then it would return all but the first one, and would split that first one into alloc_size chunks. Then it would keep the first one (for the allocation), and return the others back to the pool. The new anv_state_pool_return_chunk() function will take a chunk (with the alloc_size part removed), and a small_size hint. It then splits that chunk into pool->block_size'd chunks, and if there's some space still left, split that into small_size chunks. small_size in this case is the same size as alloc_size. The idea is to keep the same logic, but make it in a way we can reuse it to return other chunks to the pool when we are growing the buffer. v2: - Include Jason's suggestions to the algorithm that returns chunks. - Update comments. v3: - Disallow returning 0 blocks (Jason). - fix min_size in the loop (Jason). - remove temporary variables (Jason) v4: - return_chunk() should never return blocks larger than pool->block_size. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:17 -08:00
Rafael Antognolli	6a1f4c96cc	anv: Remove some asserts. They won't be true anymore once we add support for multiple BOs with non-userptr. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:14 -08:00
Rafael Antognolli	f39dad7e4e	anv: Validate the list of BOs from the block pool. We now have multiple BOs in the block pool, but sometimes we still reference only the first one in some instructions, and use relative offsets in others. So we must be sure to add all the BOs from the block pool to the validation list when submitting commands. v2: - Don't add block pool BOs to the dependency list right before execbuf (Jason) - Call anv_execbuf_add_bo() to each BO in the block pools (Jason) - Use anv_execbuf_add_bo_set() to add surface state dependencies to execbuf. v3: - Add comment to the non-softpin case (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:10 -08:00
Rafael Antognolli	11a5d4620b	anv: Split code to add BO dependencies to execbuf. This part of the anv_execbuf_add_bo() code is totally independent of the BO being added. Let's split it out, so we can reuse it later. v3: rename to anv_execbuf_add_bo_set (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:08 -08:00
Rafael Antognolli	f874604f45	anv/allocator: Add support for a list of BOs in block pool. So far we use only one BO (the last one created) in the block pool. When we switch to not use the userptr API, we will need multiple BOs. So add code now to store multiple BOs in the block pool. This has several implications, the main one being that we can't use pool->map as before. For that reason we update the getter to find which BO a given offset is part of, and return the respective map. v3: - Simplify anv_block_pool_map (Jason). - Use fixed size array for anv_bo's (Jason) v4: - Respect the order (item, container) in anv_block_pool_foreach_bo (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:04 -08:00
Rafael Antognolli	e3dc56d731	anv: Update usage of block_pool->bo. Change block_pool->bo to be a pointer, and update its usage everywhere. This makes it simpler to switch it later to a list of BOs. v3: - Use a static "bos" field in the struct, instead of malloc'ing it. This will be later changed to a fixed length array of BOs. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:02 -08:00
Rafael Antognolli	fc3f588320	anv/allocator: Remove pool->map. After switching to using anv_state_table, there are very few places left still using pool->map directly. We want to avoid that because it won't be always the right map once we split it into multiple BOs. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:08:00 -08:00
Rafael Antognolli	54e21e145e	anv/allocator: Rename anv_free_list2 to anv_free_list. Now that we removed the original anv_free_list, we can now use its name. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:07:58 -08:00
Rafael Antognolli	234c9d8a40	anv/allocator: Remove anv_free_list. The next commit already renames anv_free_list2 -> anv_free_list since the old one is gone. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:07:56 -08:00
Rafael Antognolli	e2179aceaf	anv/allocator: Use anv_state_table on back_alloc too. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:07:52 -08:00
Rafael Antognolli	d18267fb48	anv/allocator: Use anv_state_table on anv_state_pool_alloc. Use anv_state_pool_return_blocks() to return blocks to the pool, instead of manually pushing them. v3: - return blocks from the end of the chunk (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:07:50 -08:00
Rafael Antognolli	6a1dcfe73d	anv/allocator: Add helper to push states back to the state table. The use of anv_state_table_add() combined with anv_state_table_push(), specially when adding a bunch of states to the table, is very verbose. So we add this helper that makes things easier to digest. We also already add the anv_state_table member in this commit, so things can compile properly, even though it's not used. v2: assert that the states are always aligned to their size (Jason) v3: Add "table" member to anv_state_pool in this commit. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:07:47 -08:00
Rafael Antognolli	e8b6e0a5ba	anv/allocator: Add getter for anv_block_pool. We will need the anv_block_pool_map to find the map relative to some BO that is not at the start of the block pool. v2: just return a pointer instead of a struct (Jason) v4: Update comment (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:07:43 -08:00
Rafael Antognolli	6a2d5ae305	anv/allocator: Add anv_state_table. Add a structure to hold anv_states. This table will initially be used to recycle anv_states, instead of relying on a linked list implemented in GPU memory. Later it could be used so that all anv_states just point to the content of this struct, instead of making copies of anv_states everywhere. One has to call anv_state_table_add(), which returns an index for the state in the table, and then get a pointer to such index, and finally fill in the rest of the struct. TODO: 1) There's a lot of common code between this table backing store memory and the anv_block_pool buffer, due to how we grow it. I think it's possible to refactory this and reuse code on both places. 2) Add unit tests. v3: - Rename state table memfd (Jason) - Return VK_ERROR_OUT_OF_HOST_MEMORY on more places (Jason) - anv_state_table_grow returns VkResult (Jason) - Rename variables to be more informative (Jason) - Return errors on state table grow. - Rename anv_state_table_push/pop to anv_free_list_push2/pop2 This will be renamed again to remove the trailing "2" later. v4: - Remove exit(-1) from anv_state_table (Jason). - Use uint32_t "next" field in anv_free_entry (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:07:34 -08:00
Rafael Antognolli	27478ce00e	anv/tests: Fix block_pool_no_free test. There were 2 problems with this test. First it was comparing highest, which was -1, with an uint32_t. So the current value would never be higher than that, and the assert would always be false. It just never reached this point because of the next problem. It was always looking for the highest value of each thread and storing it in thread_max. So a test case like this wouldn't work: [Thread]: [Blocks] [0]: [0, 32, 64, 96] [1]: [128, 160, 192, 224] [2]: [256, 288, 320, 352] Not only that would skip values and iterate only over thread number 2, instead of walking through all of them, but thread_max was also initialized to -1. And then compared to unsigned blocks[i][next[i]. We fix that by getting the smallest value of each thread, and checking if it is lower than thread_min, which is initialized to INT32_MAX. And then we end up walking through all the blocks of all threads. We also change "blocks" to be int32_t instead of uint32_t, since in some places (alloc_blocks) it was already referenced as int32_t, and that fixes the comparison to -1. v2: - keep highest initialized to -1, and change blocks to be int32_t. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 15:05:58 -08:00
Lionel Landwerlin	4149d41f2e	anv: fix invalid binding table index computation The ++ operator strikes again. Fixes: `f92c5bc8f3` ("anv/device: fix maximum number of images supported") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-17 11:49:10 -08:00
Eric Engestrom	c4c5c90255	docs: explain how to see what meson options exist Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-01-17 17:05:41 +00:00
Emil Velikov	406623f5b1	docs: update calendar, add news item and link release notes for 18.3.2 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-17 11:37:41 +00:00
Emil Velikov	9d58641bf2	docs: add sha256 checksums for 18.3.2 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `8320a07221`)	2019-01-17 11:32:20 +00:00
Emil Velikov	2dad014496	docs: add release notes for 18.3.2 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `95a3b709c0`)	2019-01-17 11:32:19 +00:00
Iago Toral Quiroga	f92c5bc8f3	anv/device: fix maximum number of images supported We had defined MAX_IMAGES as 8, which we used to size the array for image push constant data. The comment there stated that this was for gen8, but anv_nir_apply_pipeline_layout runs for all gens and writes that array, asserting that we don't exceed that number of images, which imposes a limit of MAX_IMAGES on all gens. Furthermore, despite this, we are exposing up to 64 images per shader stage on all gens, gen8 included. This patch lowers the number of images we expose in gen8 to 8 and keeps 64 images for gen9+ while making sure that only pre-SKL gens use push constant space to handle images. v2: - <= instead of < in the assert (Eric, Lionel) - Change the way the assertion is written (Eric) v3: - Revert the way the assertion is written to the form it had in v1, the version in v2 was not equivalent and was incorrect. (Lionel) v4: - gen9+ doesn't need push constants for images at all (Jason) Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v3)	2019-01-17 07:59:00 +01:00
Tapani Pälli	a311aa631d	anv: do not advertise AHW support if extension not enabled Fixes following failing vk-gl-cts cases on Linux desktop: dEQP-VK.api.external.memory.android_hardware_buffer.suballocated.buffer.info dEQP-VK.api.external.memory.android_hardware_buffer.suballocated.image.info dEQP-VK.api.external.memory.android_hardware_buffer.dedicated.image.info dEQP-VK.api.external.memory.android_hardware_buffer.dedicated.buffer.info Fixes: `517103abf1` "anv/android: add ahardwarebuffer external memory properties" Reported-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2019-01-17 07:22:02 +02:00
Eric Anholt	99ef66c325	vc4: Don't leak the GPU fd for renderonly usage. Noticed while debugging V3D -- the ro->gpu_fd was freshly opened in ro setup, and it needs to stay open until screen close (since it may be used by renderonly) and should be the same one used by the vc4 screen. Fixes: `7029ec05e2` ("gallium: Add renderonly-based support for pl111+vc4.")	2019-01-16 16:28:41 -08:00
Eric Anholt	0605726776	v3d: Don't leak the GPU fd for renderonly usage. The CTS was running out of fds, because of the ro->gpu_fd never being closed. ro->gpu_fd should match the screen (in case the caller of v3d_drm_screen_create_renderonly() has a scanout_for_resource() that uses gpu_fd) and the screen is expected to close its fd at the end, fixing the resource leak. Fixes: `e113b21cb7` ("v3d: Add renderonly support.")	2019-01-16 16:28:41 -08:00
Eric Anholt	59527a36e9	v3d: Restructure RO allocations using resource_from_handle. I had bugs in the old path where I was laying out as tiled (so we'd render tiled) but then only allocating space in the shared object for linear rendering. The resource_from_handle makes it so the same layout choices are made in both the import and export scanout cases. Also, fixes a leak of the fd that was tripping up the CTS. Now that we're checking PIPE_BIND_SHARED to choose to use RO, the DRM_FORMAT_MOD_LINEAR check wasn't needed any more. Fixes visual corruption and MMU faults in X in renderonly mode. Fixes: `bd09bb1629` ("v3d: SHARED but not necessarily SCANOUT buffers on RO must be linear.")	2019-01-16 16:28:41 -08:00
Eric Anholt	d70eb2302b	v3d: If the modifier is not known on BO import, default to linear for RO. Part of fixing DRI3 rendering with RO on X11. Fixes: `e113b21cb7` ("v3d: Add renderonly support.")	2019-01-16 16:28:41 -08:00
Timothy Arceri	cb527d2c4c	ac/nir_to_llvm: add support for structs to get_sampler_desc() Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-17 10:35:36 +11:00
Timothy Arceri	b12316cc92	ac/nir_to_llvm: fix regression in bindless support This wasn't ported over when deref support was implemented. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-17 10:35:36 +11:00
Timothy Arceri	e106e0f2dd	radeonsi/nir: get correct type for images inside structs Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-17 10:35:36 +11:00
Timothy Arceri	292887ac0d	ac/nir_to_llvm: fix type handling in image code The current code only strips off arrays and cannot find the type for images that are struct members. Instead of trying to get the image type from the variable, we just get it directly from the deref instruction. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-17 10:35:36 +11:00
Rhys Perry	8a52e4cc4f	radv: use dithered alpha-to-coverage This matches the behaviour of AMDVLK and hides banding. It is also seems to be allowed by the Vulkan spec. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-16 20:49:23 +00:00
Alok Hota	187a6506a3	swr/rast: Store cached files in multiple subdirs This improves cache filesystem performance, especially during CI tests Also updated jitcache magic number due to codegen parameter changes Removed 2 `if constexpr` to prevent C++17 requirement	2019-01-16 13:53:30 -06:00
Alok Hota	bb98be61f4	swr/rast: New execution engine per JIT Fixes relocation errors with LLVM 7.0.0	2019-01-16 13:53:30 -06:00
Alok Hota	b135db5d58	swr/rast: Scope MEM_CLIENT enum for mem usages Avoids confusion with other defaulted integer parameters - fixed some unspecified usages - removed unnecessary includes - removed unecessary protected access specifier in buckets framework	2019-01-16 13:53:30 -06:00
Alok Hota	c722ad7379	swr/rast: Unaligned and translations in gathers - added graphics address translation in odd gathers - added support for unaligned gathers in fetch shader - changed how 2+ GB offsets are handled to make them compatible with unaligned offsets	2019-01-16 13:53:30 -06:00
Alok Hota	9459863dfa	swr/rast: partial support for Tiled Resources - updated sample from TRTT surfaces correctly - implemented mapped status return for TRTT surfaces - implemented per-sample instruction minLod clamp - updated bilinear filter weight calculation to be closer to D3D specs - implemented "ReducedTexcoordRange" operation from D3D specs to avoid loss of precision on high-value normalized coordinates	2019-01-16 13:53:30 -06:00
Alok Hota	9cacf9d877	swr/rast: Add annotator to interleave isa text To make debugging simpler	2019-01-16 13:53:30 -06:00
Alok Hota	c9fa2ee343	swr/rast: Use gfxptr_t value in JitGatherVertices Use gfxptr_t type value for stream pointer uses in gather and similar calls	2019-01-16 13:53:30 -06:00
Gert Wollny	e68777c87c	autotools: Deprecate the use of autotools Since Meson will eventually be the only build system deprecate autotools now. It can still be used by invoking configure with the flag --enable-autotools NAKed-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Kristian H. Kristensen <hoegsberg@chromium.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Gert Wollny <gw.fossdev@gmail.com>	2019-01-16 09:52:42 -08:00
Dylan Baker	431e9abaab	meson: allow building dri driver without window system if osmesa is classic This was already enabled for gallium based osmesa with gallium drivers in `9d10581897`, so do the same for classic driver with classic osmesa. Fixes: `cbbd5bb889` ("meson: build classic osmesa") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-01-16 17:49:51 +00:00
Bruce Cherniak	ed7673afd2	gallium/swr: Fix multi-context sync fence deadlock. Various recreation scenarios lead to API thread getting stuck in swr_fence_finish(). This is a multi-context issue, whereby one context overwrites the fence read-value with a previous sync's lesser value. The fence sync value is supposed to be always increasing. In swr_fence_cb(), only update the "read" value if the new value is greater. (This may seem like we're not waiting on the other context to finish, but had we needed for it to finish there would have been a wait prior to submitting a new sync.) cc: mesa-stable@lists.freedesktop.org	2019-01-16 09:26:36 -06:00
Samuel Pitoiset	d5d7b5e950	ac/nir: don't trash L1 caches for store operations with writeonly memory Ported from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-16 13:57:22 +01:00
Kenneth Graunke	5b51d754d0	st/mesa: Optionally override RGB/RGBX dst alpha blend factors Intel's blending hardware does not properly return 1.0 for destination alpha for RGBX formats; it requires the factors to be overridden to either zero or one. Broadcom vc4 and v3d also could use this override. While overriding these factors is safe in general, Nouveau and Radeon would prefer not to. Their blending hardware already returns correct values for RGB/RGBX formats, and would like to avoid the resulting per-buffer blending and independent blend factors (rgb != a) since it can cause additional overhead. I considered simply handling this in the driver, but it's not as nice. pipe_blend_state doesn't have any format information, so we'd need the hardware blend state to depend on both pipe_blend_state and pipe_framebuffer_state. Furthermore, Intel GPUs don't have a native RGBX_SNORM format, so I avoid exposing one, which makes Gallium fall back to RGBA_SNORM. The pipe_surfaces we get in the driver have an RGBA format, making it impossible to tell that there shouldn't be an alpha channel. One could argue that st not handling it in that case is a bug. To work around this, we'd have to expose RGBX pipe formats, mapped to RGBA hardware formats, and add format swizzling special cases. All doable, but it ends up being more code than I'd like. st_atom_blend already has access to the right information and it's trivial to accomplish there, so we just add a cap bit and do that. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-15 20:53:44 -08:00
Marek Olšák	11735d6c9c	winsys/amdgpu: fix whitespace	2019-01-15 19:10:16 -05:00
Pierre Moreau	0b736f7fd4	meson: Fix with_gallium_icd to with_opencl_icd `with_gallium_icd` is never used throughout the different Meson build files, whereas `with_opencl_icd` tracks whether or not `gallium-opencl` was set to "icd". Fixes: `42ea0631f1` ("meson: build clover") Signed-off-by: Pierre Moreau <pierre.morrow@free.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-01-15 13:06:50 -08:00
Kenneth Graunke	d644698b44	gallium: Add the ability to query a single pipeline statistics counter Gallium historically has treated pipeline statistics queries as a single query, PIPE_QUERY_PIPELINE_STATISTICS, which returns a block of 11 values. This was originally patterned after the D3D1x API. Much later, Brian introduced an OpenGL extension that exposed these counters - but it exposes 11 separate queries, each of which returns a single value. Today, st/mesa simply queries all 11 values, and returns a single value. While pipeline statistics counters aren't typically performance critical, this is still not a great fit. A D3D1x->GL translator might request all 11 counters by creating 11 separate GL queries...which Gallium would map to reads of all 11 values each time, resulting in a total 121 counter reads. That's not ideal. This patch adds a new cap, PIPE_CAP_QUERY_PIPELINE_STATISTICS_SINGLE, and corresponding query type PIPE_QUERY_PIPELINE_STATISTICS_SINGLE. When calling create_query(), q->index should be set to one of the PIPE_STAT_QUERY_* enums to select a counter. Unlike the block query, this returns the value in pipe_query_result::u64 (as it's a single value) instead of the pipe_query_data_pipeline_statistics group. We update st/mesa to expose ARB_pipeline_statistics_query if either capability is set, preferring the new SINGLE variant when available. Thanks to Roland, Ilia, and Marek for helping me sort this out. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-15 11:43:04 -08:00
Kenneth Graunke	f967273fb4	st/mesa: Rearrange PIPE_QUERY_PIPELINE_STATISTICS result fetching. This just changes the order of the switch statements, so we only look at target if the query type is PIPE_QUERY_PIPELINE_STATISTICS. The next commit will introduce a new SINGLE query type which can be used for the same GL query types, and it won't want this processing. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-15 11:43:04 -08:00
Kenneth Graunke	e760be08b4	st/mesa: Make an enum for pipeline statistics query result indices. Gallium handles pipeline statistics queries as a single query (PIPE_QUERY_PIPELINE_STATISTICS) which returns a struct with 11 values. Sometimes it's useful to refer to each of those values individually, rather than as a group. To avoid hardcoding numbers, we define a new enum for each value. Here, the name and enum value correspond to the index in the struct pipe_query_data_pipeline_statistics result. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-15 11:43:04 -08:00
Dylan Baker	4a131a1330	meson: Add a script to extract the cmd line used for meson Upstream I'm persuing a more comprehensive solution, but this should prove a suitable stop-gap measure in the meantime. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109325 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Eric Engestrom <eric@engestrom.ch> Acked-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-15 17:38:47 +00:00
Samuel Pitoiset	7bef192018	radv: add support for VK_EXT_memory_budget A simple Vulkan extension that allows apps to query size and usage of all exposed memory heaps. The different usage values are not really accurate because they are per drm-fd, but they should be close enough. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-15 11:18:37 +01:00
Samuel Pitoiset	9784400a6b	radv: add two small helpers for getting VRAM and visible VRAM sizes Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-15 11:18:35 +01:00
Samuel Pitoiset	a6e5ce5130	radv: remove unnecessary returns in GetPhysicalDevice*Properties() These functions return nothing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-15 11:18:17 +01:00
Bas Nieuwenhuizen	568e7a2998	radv: Set partial_vs_wave for pipelines with just GS, not tess. Looking at -pro we need to enable it for pipelines with just a GS too. This seems to reduce the hangs from https://bugs.freedesktop.org/show_bug.cgi?id=109242 on a RX 550 to the point where I can't reproduce, after the false start with the wd_switch_on_eop patch due to flakiness. (but people are reporting it does not fix the issue completely for them on polaris 11) CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-15 10:22:30 +01:00
Marek Olšák	5183e794af	radeonsi: also apply the GS hang workaround to draws without tessellation ported from AMDVLK. Cc: 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-14 18:55:58 -05:00
Eric Anholt	bd09bb1629	v3d: SHARED but not necessarily SCANOUT buffers on RO must be linear. We don't have a way to talk to RO about modifiers it can do yet, so assume the minimum.	2019-01-14 15:40:55 -08:00
Eric Anholt	f72820c851	v3d: Add support for CS barrier() intrinsics.	2019-01-14 15:40:55 -08:00
Eric Anholt	9b45b06d7c	v3d: Add support for CS shared variable load/store/atomics. CS shared variables are handled effectively as SSBO access to a temporary buffer that will be allocated at CS dispatch time.	2019-01-14 15:40:55 -08:00
Eric Anholt	01d913cf90	v3d: Add support for CS workgroup/invocation id intrinsics. We get a payload for the ivec3 workgroup and an int local invocation index, and we use the core lowering to turn into the global invocation id and the local invocation id ivec3s.	2019-01-14 15:40:55 -08:00
Eric Anholt	6281f26f06	v3d: Add support for shader_image_load_store. This is only exposed on V3D 4.1+, because we didn't have the TMU write operations for images on 3.3 (To do GLES 3.1 there, you have to lower it to SSBO load/stores, which is a problem to solve later).	2019-01-14 15:40:55 -08:00
Eric Anholt	5932c2f0b9	v3d: Add SSBO/atomic counters support. So far I assume that all the buffers get written. If they weren't, you'd probably be using UBOs instead.	2019-01-14 15:40:55 -08:00
Eric Anholt	6c8edcb89c	v3d: Drop the GLSL version level. This was an arbitrary "we support lots of stuff" value when I started the driver. However, at 400 we expose OES_gpu_shader5, which claims support for dynamically indexing samplers, which the driver doesn't do yet.	2019-01-14 13:18:02 -08:00
Eric Anholt	1a63227ea0	v3d: Add support for matrix inputs to the FS. We've been relying on linking splitting up our varying matrices into separate vectors, but with SSO that doesn't happen. Supporting matrix inputs isn't too hard, though.	2019-01-14 13:18:02 -08:00
Eric Anholt	49b7e26fac	v3d: Add an isr to the simulator to catch GMP violations. Otherwise, the simulator raises the GMP interrupt and waits for it to be handled, and v3d ends up spinning in v3d_hw_tick(). Aborting right when violation happens gives us a chance to look at the backtrace of whatever thread triggered the violation.	2019-01-14 13:18:02 -08:00
Eric Anholt	3790ee07e6	v3d: Fix txf_ms 2D_ARRAY array index. We need to pass the array index through our coordinate transform unchanged. Fixes dEQP-GLES31.functional.texture.multisample.samples_1.*_2d_array	2019-01-14 13:18:02 -08:00
Eric Anholt	619a28b845	v3d: Add support for GL_ARB_framebuffer_no_attachments. Fixes dEQP-GLES31.functional.state_query.integer.max_framebuffer_height_getboolean when GLES3 is enabled.	2019-01-14 13:18:02 -08:00
Eric Anholt	051a41d3d5	v3d: Add support for the early_fragment_tests flag. If this flag hasn't been set by the shader and it has some visible side effects, then we need to disable EZ.	2019-01-14 13:18:02 -08:00
Eric Anholt	b417a9f7b2	v3d: Add support for flushing dirty TMU data at job end. This will be needed for SSBOs and image_load_store.	2019-01-14 13:18:02 -08:00
Samuel Pitoiset	ad6ceb2872	ac: add missing 16-bit types to glsl_base_to_llvm_type() Fix crashes with dEQP-VK.spirv_assembly.instruction.compute.workgroup_memory.*16 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-14 21:18:23 +01:00
Bas Nieuwenhuizen	76b12fa564	radv: Only use 32 KiB per threadgroup on Stoney. Causes hangs on some machines. What works for dEQP-VK.tessellation.shader_input_output.barrier: - running num_patches = 6 (which limits LDS to 32 KiB) - running num_patches = 8, and artificially cutting LDS size at 32 KiB. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-14 19:58:27 +00:00
Marek Olšák	76df5e8f52	st/dri: fix dri2_format_table for argb1555 and rgb565 The bug caused that rgb565 framebuffers used argb1555. Fixes: `433ca3127a` Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-14 14:54:19 -05:00
Jason Ekstrand	2d2737dcfe	nir: Add a bool to float32 lowering pass From @jekstrand's nir-1-bit-bool branch, with improved ior/inot lowering. ior: fmax instead of fadd allows removing the fsat. inot: seq(x, 0) can be better than fsub(1, x). On a2xx, it works better with the scalar instruction set. Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-01-14 19:27:06 +00:00
Caio Marcelo de Oliveira Filho	09c3ff01df	src/intel: use new hash table and set creation helpers Replace calls to create hash tables and sets that use _mesa_hash_pointer/_mesa_key_pointer_equal with the helpers _mesa_pointer_hash_table_create() and _mesa_pointer_set_create(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-01-14 10:49:33 -08:00
Caio Marcelo de Oliveira Filho	9fdded0cc3	src/compiler: use new hash table and set creation helpers Replace calls to create hash tables and sets that use _mesa_hash_pointer/_mesa_key_pointer_equal with the helpers _mesa_pointer_hash_table_create() and _mesa_pointer_set_create(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-01-14 10:49:28 -08:00
Caio Marcelo de Oliveira Filho	ee23e8b17c	util: Helper to create sets and hashes with pointer keys These combinations are common enough and deserve a shortcut. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-01-14 10:49:21 -08:00
Samuel Pitoiset	929df7afaf	ac/nir: set cache policy when loading/storing buffer images This was missing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-14 17:59:51 +01:00
Samuel Pitoiset	af2a85df74	ac/nir: add get_cache_policy() helper and use it Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-14 17:59:49 +01:00
Jason Ekstrand	5e4f9ea363	anv: Implement VK_KHR_depth_stencil_resolve	2019-01-14 10:16:52 -06:00
Jason Ekstrand	9f44088468	anv: Move resolve_subpass to genX_cmd_buffer.c We may have to do transitions around certain kinds of resolves so it helps to have it genX code.	2019-01-14 10:16:52 -06:00
Jason Ekstrand	930b17161f	anv/blorp: Refactor MSAA resolves into an exportable helper function This function is modeled after the aux_op functions except that it has a lot more parameters because it deals with two images as well as source and destination regions.	2019-01-14 10:16:52 -06:00
Jason Ekstrand	c92c449361	anv: Rename has_resolve to has_color_resolve	2019-01-14 10:16:52 -06:00
Jason Ekstrand	4bd976e3b8	intel/blorp: Add two more filter modes	2019-01-14 10:16:52 -06:00
Andres Gomez	3ec9ab80b8	bin/get-pick-list.sh: fix redirection in sh "&>" is bash specific. Fixes: `e0dbfc9953` ("bin/get-pick-list.sh: warn when commit lists invalid sha") Cc: Juan A. Suarez <jasuarez@igalia.com> Cc: Eric Engestrom <eric.engestrom@intel.com> Cc: Dylan Baker <dylan@pnwbakers.com> Cc: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2019-01-14 17:40:15 +02:00
Andres Gomez	716ed41a36	bin/get-pick-list.sh: fix the oneline printing "--summary" will also print extended header information such as creations, renames and mode changes. Let's just use "--no-patch", which suppresses the diff output. v2: Use "--no-patch" instead of the "-s" abbreviation (Eric). Fixes: `559c32d241` ("bin/get-pick-list.sh: simplify git oneline printing") Cc: Juan A. Suarez <jasuarez@igalia.com> Cc: Eric Engestrom <eric.engestrom@intel.com> Cc: Dylan Baker <dylan@pnwbakers.com> Cc: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2019-01-14 17:36:56 +02:00
Michel Dänzer	1a20b56798	amd/common: Restore v4i32 suffix for llvm.SI.load.const intrinsic It was accidentally dropped in commit `e4803ab7d2` "amd/common: use llvm.amdgcn.s.buffer.load for LLVM 8.0", breaking the universe with LLVM 7. Trivial.	2019-01-14 12:52:52 +01:00
Nicolai Hähnle	7fbd48fdc0	amd/common/vi+: enable SMEM loads with GLC=1 Only on LLVM 8.0+, which supports the new intrinsic. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-14 08:30:15 +01:00
Nicolai Hähnle	e4803ab7d2	amd/common: use llvm.amdgcn.s.buffer.load for LLVM 8.0 llvm.SI.load.const is deprecated. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-14 08:30:12 +01:00
Iago Toral Quiroga	1c1ae6376c	anv/pipeline_cache: free NIR shader cache Fixes: `f6aa9f7185` 'anv/pipeline_cache: Add support for caching NIR' Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-14 07:59:27 +01:00
Danylo Piliaiev	0862929bf6	glsl: Fix copying function's out to temp if dereferenced by array Function's out variable could be an array dereferenced by an array: func(v[w[i]]); or something more complicated. Copy index in any case. Fixes: `76c27e47b9` ("glsl: Copy function out to temp if we don't directly ref a variable") Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-14 12:04:07 +11:00
Kenneth Graunke	04c2f12ab2	i965: Drop mark_surface_used mechanism. The original idea was that the backend compiler could eliminate surfaces, so we would have it mark which ones are actually used, then shrink the binding table accordingly. Unfortunately, it's a pretty blunt mechanism - it can only prune things from the end, not the middle - since we decide the layout before we even start the backend compiler, and only limit the size. It also basically gives up if it sees indirect array access. Besides, we do the vast majority of our surface elimination in NIR anyway, not the backend - and I don't see that trend changing any time soon. Vulkan abandoned this plan a long time ago, and I don't use it in Iris, but it's still been kicking around in i965. I hacked shader-db to print the binding table size in bytes, and observed no changes with this patch. So, this code appears to do nothing useful. Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-13 09:35:32 -08:00
Eric Engestrom	bdf6a5c1d2	egl: fix python lib deprecation warning DeprecationWarning: the imp module is deprecated in favour of importlib Instead of complicated logic, just import the file directly. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-01-13 13:59:08 +00:00
Jason Ekstrand	b938d5fbef	spirv: Emit switch conditions on-the-fly Instead of emitting all of the conditions for the cases of a switch statement up-front, emit them on-the-fly as we emit the code for each case. The original justification for this was that we were going to have to build a default case anyway which would need them all. However, we can just trust CSE to clean up the mess in that case. Emitting each condition right before the if statement that uses it reduces register pressure and, in one customer benchmark, reduces spilling and improves performance by about 2x. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-12 17:55:49 -06:00
Jason Ekstrand	821b6861ec	nir/gcm: Support deref instructions Even though no one's been brave enough to ever use this pass, I like to keep it functionally working. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-12 17:55:49 -06:00
Jason Ekstrand	24c8108ea6	intel/nir: Call nir_opt_deref in brw_nir_optimize It's an optimization so we should probably be calling it in the optimization loop. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-12 17:55:49 -06:00
Jason Ekstrand	e57e26121a	spirv: Contain the GLSLang issue #179 workaround to old GLSLang Instead of applying the workaround universally, detect semi-old GLSLang via the generator ID and only enable the workaround on old GLSLang. This isn't nearly as precise as one would like it to be because the first GLSLang generator id version bump was on October 7, 2017 which is about 1.5 years after the bug was fixed. However, it at least lets us disable it for non-GLSLang and for more modern versions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-12 17:55:49 -06:00
Jason Ekstrand	b57c1ec421	spirv: Whack sampler/image pointers to uniform A long time in a galaxy far far away, there was a GLSLang bug with how it handled samplers passed in as function parameters. (The bug can be found here: https://github.com/KhronosGroup/glslang/issues/179.) Unfortunately, that version was shipped in several apps and has been causing heartburn for our SPIR-V parser ever since. Recent changes to NIR uncovered a moderately old bug in how we work around this issue. In particular, we ended up with a deref_cast from uniform to local which is not a no-op cast so nir_opt_deref wasn't getting rid of the cast. The only reason why it worked before was because someone just happened to call nir_fixup_deref_modes which "fixed" the cast (that shouldn't be happening) and then a later round of copy-prop would get rid of it. The fact that the deref_cast survived that long without causing trouble for other parts of NIR is a bit surprising. Just whacking the mode of the pointer seems to fix it fairly unobtrusively. Currently, only apps with this bug will have a local variable containing an image or sampler. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109304 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-12 17:55:49 -06:00
Kenneth Graunke	2b876bc922	st/nir: Lower TES gl_PatchVerticesIn to a constant if linked with a TCS. If the TCS and TES are linked together, we can simply replace the TES's gl_PatchVerticesIn system value with a constant, possibly allowing extra optimization or letting the driver avoid uploading a special value. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-01-11 13:07:54 -08:00
Jonathan Marek	3d182601bb	glsl/nir: keep bool types when native_integers=false With the new handling of bool types, the conversion to float in glsl_to_nir should not apply to bool types anymore. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-11 19:16:11 +00:00
Jonathan Marek	b27ad17115	glsl/nir: ftrunc for native_integers=false float to int cast out_type in the default cast case is always GLSL_TYPE_FLOAT, so we get a mov otherwise. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-11 19:16:11 +00:00
Jonathan Marek	d3b47e073e	glsl/nir: int constants as float for native_integers=false All alu instructions emitted with native_integers=false expect float (or bool in some cases) constants, so this change is necessary. This will cause changes with some intrinsics which had integer sources, such as nir_intrinsic_load_uniform. Apparently it might cause issues with some opt passes, but perhaps those don't apply in OpenGL ES 2.0 cases? Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-11 19:16:11 +00:00
Jason Ekstrand	1ede463b6e	intel/peephole_ffma: Fix swizzle propagation The num_components value passed into get_mul_for_src is used to only compose the parts of the swizzle that we know will be used so we don't compose invalid swizzle components. However, we had a bug where we passed the number of components of the add all the way through. For the given source, we need the number of components read from that source. In the case where we have a narrow add, say 2 components, that is sourced from a chain of wider instructions, we may not compose all the swizzles. All we really need to do is pass through the right number of components at each level. Fixes: `2231cf0ba3` "nir: Fix output swizzle in get_mul_for_src" Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-01-11 10:44:08 -06:00
Kenneth Graunke	ae683ed3bc	nir: Allow a non-existent sampler deref in nir_lower_samplers_as_deref GL_ARB_gl_spirv does not provide a sampler deref for e.g. texelFetch(), so we can't assume that both are present and identical. Simply lower each if it is present. Fixes regressions in GL_ARB_gl_spirv tests since I switched everyone to using this pass. Thanks to Alejandro Piñeiro for catching these. Fixes: `f003859f97` nir: Make gl_nir_lower_samplers use gl_nir_lower_samplers_as_deref Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-11 07:54:32 -08:00
Eric Engestrom	e12b0b5c6d	travis: avoid using unset llvm-config Fixes the following errors: usage: which [-as] program ... /Users/travis/.travis/job_stages: line 110: --version: command not found ... caused by the use of an undefined $LLVM_CONFIG Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-11 14:38:35 +00:00
Eric Engestrom	c8ae891035	egl: remove unused include Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-11 14:37:47 +00:00
Eric Engestrom	d75fbff667	egl: add missing includes Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2019-01-11 14:37:47 +00:00
Iago Toral Quiroga	4b1e436bc9	anv/pipeline_cache: fix incorrect guards for NIR cache Fixes: `f6aa9f7185` 'anv/pipeline_cache: Add support for caching NIR' Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-11 12:45:18 +01:00
Kenneth Graunke	ad9832d17b	blorp: Pass the batch to lookup/upload_shader instead of context This will allow drivers to pin shader buffers if necessary. i965 and anv do not need to do this today, but iris will. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-10 20:52:04 -08:00
Kenneth Graunke	084a1cdbb7	blorp: Add blorp_get_surface_address to the driver interface. Currently, BLORP expects drivers to provide two functions for dealing with buffers: blorp_emit_reloc and blorp_surface_reloc. Both record a relocation and combine the BO address and offset into a full 64-bit address. Traditionally, blorp_surface_reloc has written that combined address to an implicitly-known buffer where surface states are stored. (In contrast, blorp_emit_reloc returns the value.) The upcoming Iris driver stores surface states in multiple buffers, which makes it impossible for blorp_surface_reloc to write the combined address - it only takes an offset, not the actual buffer to write to. This commit adds a third function, blorp_get_surface_address, which combines and returns an address, which is then passed to ISL's surface state fill functions. Softpin-only drivers can return a real address here and skip writing it in blorp_surface_reloc. Relocation-based drivers are have options. They can simply return 0 from the new function, and continue writing the address from blorp_surface_reloc. Or, they can return a presumed address from blorp_get_surface_address, and have other relocation processing write the real value later. For now, i965 and anv simply return 0. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-10 20:51:53 -08:00
Ilia Mirkin	2165636e9c	docs: fix gallium screen cap docs Make sure that the next line starts with spaces so that bullets are maintained throughout, add `` around a few more special tokens, and fix SAMPLE_COUNT_TEXTURE -> SAMPLE_COUNT. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-01-10 21:44:09 -05:00
Danylo Piliaiev	a2db6b4254	glsl: Make invariant outputs in ES fragment shader not to cause error In all GLSL ES versions output variables in fragment shader are allowed to be invariant. From Section 4.6.1 ("The Invariant Qualifier") GLSL ES 1.00 spec: "Only the following variables may be declared as invariant: ... - Built-in special variables output from the fragment shader." From Section 4.6.1 ("The Invariant Qualifier") GLSL ES 3.00 spec: "Only variables output from a shader can be candidates for invariance." Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107842	2019-01-11 13:01:11 +11:00
Jason Ekstrand	eb4b1477dc	anv/pipeline: Cache the pre-lowered NIR This adds a second level of caching for the pre-lowered NIR that's only based off of the shader module, entrypoint and specialization constants. This is enough for spirv_to_nir as well as our first round of lowering and optimization. Caching at this level should allow for faster shader recompiles due to state changes. The NIR caching does not get serialized to disk via either the VkPipelineCache serialization mechanism or the transparent on-disk cache. We could but it's usually not that expensive to fall back to SPIR-V for the odd cache miss especially if it only happens once for several misses and it simplifies the cache. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-10 19:15:27 -06:00
Jason Ekstrand	f6aa9f7185	anv/pipeline_cache: Add support for caching NIR Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-10 19:15:27 -06:00
Jason Ekstrand	8dfda5ebbe	anv/pipeline: Hash shader modules and spec constants separately The stuff hashed by anv_pipeline_hash_shader is exactly the inputs to anv_shader_compile_to_nir so it can be used for NIR caching. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-10 19:15:27 -06:00
Jason Ekstrand	b90e55a5d5	compiler/types: Serialize/deserialize subpass input types correctly They have glsl_sampler_dim enum values of 8 and 9 which don't work when you & them with 0x7. Fortunately, we have plenty of bits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-10 19:15:27 -06:00
Jason Ekstrand	73ddfbeb85	anv/pipeline: Move wpos and input attachment lowering to lower_nir This lets us make anv_pipeline_compile_to_nir take a device instead of a pipeline. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-10 19:15:27 -06:00
Matt Turner	32e266a9a5	i965: Compile fp64 funcs only if we do not have 64-bit hardware support Brown bag fix...	2019-01-10 15:22:17 -08:00
Jason Ekstrand	8ea8727a87	anv/pipeline: Constant fold after apply_pipeline_layout Thanks to the new NIR load_descriptor intrinsic added by the UBO/SSBO lowering series, we weren't getting UBO pushing because the UBO range detection pass couldn't see the constants it needed. This fixes that problem with a quick round of constant folding. Because we're folding we no longer need to go out of our way to generate constants when we lower the vulkan_resource_index intrinsic and we can make it a bit simpler. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-01-10 20:34:00 +00:00
Rob Clark	031e94dc72	freedreno/a6xx: fix 3d+tiled layout The last round of fixing 3d layer+level layout skipped the tiled case, since tiled texture support was not in place yet. This finishes the job. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-10 14:21:39 -05:00
Rob Clark	c92c18c70c	freedreno/a6xx: move tile_mode to sampler-view CSO This is known when the CSO is created, so no need to patch it in later. Also, it seems like smaller textures where the first level is small enough to be linear, it seems like we should set linear tile mode. See: dEQP-GLES3.functional.texture.format.unsized.rgb_unsigned_byte_3d_pot Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-10 14:21:39 -05:00
Rob Clark	eb625d30b7	freedreno/a6xx: separate stencil restore/resolve fixes Previously we'd use format/etc from the primary (z32) buffer for the stencil (s8), due to confusion about rsc vs psurf. Rework this to drop extra arg and push down handling of separate stencil case (and make sure we take the fmt from the right place). This doesn't completely fix separate-stencil, but at least it avoids the GPU scribbling over random other cmdstream buffers and causing a bunch of bogus fails in dEQP. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-10 14:21:39 -05:00
Rob Clark	04aff7e42b	freedreno: make cmdstream bo's read-only to GPU If nothing else, this will make problems with cmdstream getting blit over with pixels easier to track down (ie. faults when it first happens rather than strange failures later from corrupted cmdstream when a stateobj is later reused). (NOTE this somewhat depends on the kernel supporting the flag, and the iommu implementation. But the worst case is just that the cmdstream ends up writeable as before.) Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-10 14:21:39 -05:00
Guido Günther	286de96af8	etnaviv: fix typo in cflush_all description Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-01-10 18:46:10 +01:00
Eric Engestrom	53fbde4df3	radv: remove a few more unnecessary KHR suffixes Cc: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (v1)	2019-01-10 16:53:44 +00:00
Rhys Perry	0210243923	nir: fix copy-paste error in nir_lower_constant_initializers Fixes: `393b59e077` ('nir: Rework nir_lower_constant_initializers() to handle functions') Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-10 10:51:52 -06:00
Andres Gomez	6c3164cd08	docs: complete the calendar and release schedule documentation As suggested by Emil Velikov. Cc: Dylan Baker <dylan.c.baker@intel.com> Cc: Juan A. Suarez <jasuarez@igalia.com> Cc: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Acked-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-01-10 15:53:02 +02:00
Andres Gomez	428164d87f	glsl/linker: specify proper direction in location aliasing error The check for location aliasing was always asuming output variables but this validation is also called for input variables. Fixes: `e2abb75b0e` ("glsl/linker: validate explicit locations for SSO programs") Cc: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-10 15:51:57 +02:00
Andres Gomez	e2e03f84f9	editorconfig: Add max_line_length property The property is supported by the most of the editors, but not all: https://github.com/editorconfig/editorconfig/wiki/EditorConfig-Properties#max_line_length Cc: Eric Engestrom <eric@engestrom.ch> Cc: Eric Anholt <eric@anholt.net> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-01-10 15:50:34 +02:00
Tapani Pälli	864cc419eb	intel/isl: move tiled_memcpy static libs from i965 to isl Patch moves intel_tiled_memcpy[_sse41] libraries to isl, renames some functions and types and makes the required build system changes for meson, automake and Android. No functional changes are introduced. v2: code cleanups, move isl_get_memcpy_type to i965 (Jason) v3: move isl_mem_copy_fn to priv header, cleanups (Jason, Dylan) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-10 08:02:30 +02:00
Matt Turner	406f603b34	i965: Enable 64-bit GLSL extensions Now that we have software implementations of ARB_gpu_shader_int64 and ARB_gpu_shader_fp64 we can unconditionally enable these extensions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:41 -08:00
Matt Turner	613ac3aaa2	i965: Compile fp64 software routines and lower double-ops Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:41 -08:00
Matt Turner	18b4e87370	intel/compiler: Heap-allocate temporary storage Shaders containing software implementations of double-precision operations can be very large such that we cannot stack-allocate an array of grf_count*16. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:41 -08:00
Matt Turner	622d429128	intel/compiler: Expand size of the 'nr' field Shaders containing software implementations of double-precision operations can be very large such that we have more the 2^16 virtual registers during optimization. Move the 'nr' field to the union containing the immediate storage and expand it to 32-bits. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:41 -08:00
Matt Turner	7e4e9da90d	intel/compiler: Prevent warnings in the following patch The next patch replaces an unsigned bitfield with a plain unsigned, which triggers gcc to begin warning on signed/unsigned comparisons. Keeping this patch separate from the actual move allows bisectablity and generates no additional warnings temporarily. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:41 -08:00
Matt Turner	2b801b6668	intel/compiler: Rearrange code to avoid future problems A follow on commit will move nr to the same union as the immediate data, so we should assert these invariants before we overwrite the nr field. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:41 -08:00
Matt Turner	3b967e1724	intel/compiler: Avoid false positive assertions A follow on patch will move the 'nr' field to the union containing the immediate field, so prepare by checking that we're only testing these assertions if the .file is correct. The assertions with != ARF were kind of silly to begin with because the <128 check is specifically only for things in the GRF. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:41 -08:00
Matt Turner	8534742404	intel/compiler: Split 64-bit MOV-indirects if needed Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:40 -08:00
Matt Turner	e76772af6c	intel/compiler: Lower 64-bit MOV/SEL operations	2019-01-09 16:42:40 -08:00
Matt Turner	2623653126	nir: Unset metadata debug bit if no progress made NIR metadata validation verifies that the debug bit was unset (by a call to nir_metadata_preserve) if a NIR optimization pass made progress on the shader. With the expectation that the NIR shader consists of only a single main function, it has been safe to call nir_metadata_preserve() iff progress was made. However, most optimization passes calculate progress per-function and then return the union of those calculations. In the case that an optimization pass makes progress only on a subset of the functions in the shader metadata validation will detect the debug bit is still set on any unchanged functions resulting in a failed assertion. This patch offers a quick solution (short of a larger scale refactoring which I do not wish to undertake as part of this series) that simply unsets the debug bit on unchanged functions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 16:42:40 -08:00
Matt Turner	e633fae5cb	nir: Add lowering support for 64-bit operations to software Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 16:42:40 -08:00
Matt Turner	fe2cbcf3ee	nir: Create nir_builder in nir_lower_doubles_impl() We're going to use it more in a future patch, and this avoids a lot of gross code. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 16:42:40 -08:00
Matt Turner	ecb115eb3f	nir: Add and set info::uses_64bit Will be used to communicate that a shader uses 64-bit operations to the concerned lowering passes. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 16:42:40 -08:00
Matt Turner	41f3e9e5f5	nir: Implement lowering of 64-bit shift operations Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 16:42:40 -08:00
Matt Turner	62d55f1281	nir: Wire up int64 lowering functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 16:42:40 -08:00
Jason Ekstrand	adab27e741	nir: Add some more int64 lowering helpers [mattst88]: Found in an old branch of Jason's. Jason implemented: inot, iand, ior, iadd, isub, ineg, iabs, compare, imin, imax, umin, umax Matt implemented: ixor, bcsel, b2i, i2b, i2i8, i2i16, i2i32, i2i64, u2u8, u2u16, u2u32, u2u64, and fixed ilt Reviewed-by: Elie Tournier <tournier.elie@gmail.com>	2019-01-09 16:42:40 -08:00
Matt Turner	dde73e646f	nir: Tag entrypoint for easy recognition by nir_shader_get_entrypoint() We're going to have multiple functions, so nir_shader_get_entrypoint() needs to do something a little smarter. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 16:42:40 -08:00
Matt Turner	393b59e077	nir: Rework nir_lower_constant_initializers() to handle functions Previously it assumed that only a single function (the entrypoint) existed and attempted to lower constant initializers of shader outputs for each function, for instance.	2019-01-09 16:42:40 -08:00
Sagar Ghuge	f998ce4111	glsl: Add "built-in" functions to do fp32_to_int64(fp32) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	2632c12477	glsl: Add "built-in" functions to do fp32_to_uint64(fp32) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	876a4b85fe	glsl: Add "built-in" functions to do fp64_to_int64(fp64) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	21e9bb2b3f	glsl: Add utility function to round and pack int64_t value Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	5a674fd789	glsl: Add "built-in" functions to do fp64_to_uint64(fp64) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	5a87441807	glsl: Add utility function to round and pack uint64_t value Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	c9d333a6b7	glsl: Add "built-in" functions to do int64_to_fp32(int64_t) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	d5cf6e92b4	glsl: Add "built-in" functions to do uint64_to_fp32(uint64_t) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	b830efb191	glsl: Add "built-in" functions to do int64_to_fp64(int64_t) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Sagar Ghuge	7c5b982b89	glsl: Add "built-in" functions to do uint64_to_fp64(uint64_t) Reviewed-by: Elie Tournier <tournier.elie@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-01-09 16:42:40 -08:00
Matt Turner	15757bc80b	glsl: Add "built-in" functions to convert bool to double And vice versa. Reviewed-by: Elie Tournier <tournier.elie@gmail.com>	2019-01-09 16:42:40 -08:00
Matt Turner	e213f3871f	glsl: Add "built-in" functions to do ffract(fp64) Reviewed-by: Elie Tournier <tournier.elie@gmail.com>	2019-01-09 16:42:40 -08:00
Matt Turner	5c9a659f50	glsl: Add "built-in" function to do ffloor(fp64) Reviewed-by: Elie Tournier <tournier.elie@gmail.com>	2019-01-09 16:42:40 -08:00
Matt Turner	83762afa66	glsl: Add "built-in" functions to do fmin/fmax(fp64) Reviewed-by: Elie Tournier <tournier.elie@gmail.com>	2019-01-09 16:42:40 -08:00
Matt Turner	92ac2169fb	glsl: Add "built-in" functions to do ffma(fp64) Definitely not actually a fused-multiply add. Reviewed-by: Elie Tournier <tournier.elie@gmail.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	3db81b5d9f	glsl: Add "built-in" functions to do round(fp64) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	48891ab441	glsl: Add "built-in" functions to do trunc(fp64) v2: use mix. Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	2119094b1d	glsl: Add "built-in" functions to do sqrt(fp64) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	cad58fc5e7	glsl: Add "built-in" functions to do fp32_to_fp64(fp32) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	407bd1bbf9	glsl: Add "built-in" functions to do fp64_to_fp32(fp64) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	f499942b31	glsl: Add "built-in" functions to do int_to_fp64(int) v2: use mix Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	773190f281	glsl: Add "built-in" functions to do fp64_to_int(fp64) v2: use mix Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	cbf090b809	glsl: Add "built-in" functions to do uint_to_fp64(uint) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	a3551ee61f	glsl: Add "built-in" functions to do fp64_to_uint(fp64) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	4a93401546	glsl: Add "built-in" functions to do mul(fp64, fp64) v2: use mix Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	f111d72596	glsl: Add "built-in" functions to do add(fp64, fp64) v2: use mix and findMSB to optimise. v3: [Sagar] Fix zFrac0 == 0u case in __normalizeRoundAndPackFloat64 Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	c036fc97a2	glsl: Add "built-in" functions to do lt(fp64, fp64) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	3e4d5ea7b8	glsl: Add utility function to extract 64-bit sign Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:40 -08:00
Elie Tournier	ec6e823a99	glsl: Add "built-in" functions to do eq/ne(fp64, fp64)	2019-01-09 16:42:40 -08:00
Elie Tournier	c802cdde9d	glsl: Add "built-in" function to do sign(fp64) v2: use mix. Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	eac66f0248	glsl: Add "built-in" functions to do neg(fp64) v2: use mix. Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Elie Tournier	0428951b9d	glsl: Add "built-in" function to do abs(fp64) Signed-off-by: Elie Tournier <elie.tournier@collabora.com>	2019-01-09 16:42:40 -08:00
Matt Turner	b63a1f8e40	glsl: Create file to contain software fp64 functions The following patches will add implementations of various double-precision operations to this file. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:40 -08:00
Ian Romanick	412472da5c	glsl: Add utility to convert text files to C strings Will be used to convert the .glsl source file containing software fp64 routines to a .h file that can be included while building the compiler. This commit contains two squashed together: the first from Ian adding the utility (with the existing title), and the second from Dylan making the code both python2 and python3 compatible. This is somewhat modeled after the xxd utility that comes with Vim. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> xxd.py: Make python2 and 3 compatible This makes use of unicode_literals, so that undecorated strings are considered text (python2 unicode, python3 str) and not bytes in python2 and text in python3. It makes use of io.open, which provides python2 with python3's open behavior (it's an alias in python3), in particular support for the 't' and 'b' option. Finally, it decorates all of the string literals with the 'b' prefix, so that python interprets them as bytes. I've removed the stdin and stdout options, as python2 always requires these to be bytes, but python3 always treats them as text (there is a way to get at the underlying bytes buffer, but that's even more complexity), and makes the input files required arguments. In the meson we use the '@INPUT@' shorthand instead of listing each input, as meson will expand that to [prog_python, '@INPUT0@', @INPUT1@, ..., @OUTPUT@, ...]	2019-01-09 16:42:40 -08:00
Timothy Arceri	76c27e47b9	glsl: Copy function out to temp if we don't directly ref a variable Otherwise we can end up with IR that looks like this: ( (declare (temporary ) vec4 f@8) (assign (xyzw) (var_ref f@8) (var_ref f) ) (call f16 ((swiz y (var_ref f@8) ))) (assign (xyzw) (var_ref f) (var_ref f@8) ) )) When we really need: (declare (temporary ) float inout_tmp) (assign (x) (var_ref inout_tmp) (swiz y (var_ref f) )) (call f16 ((var_ref inout_tmp) )) (assign (y) (var_ref f) (swiz y (swiz xxxx (var_ref inout_tmp) ))) (declare (temporary ) void void_var) The GLSL IR function inlining code seemed to produce correct code even without this but we need the correct IR for GLSL IR -> NIR to be able to understand whats going on. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:40 -08:00
Matt Turner	63f6d7afd6	glsl: Add function support to glsl_to_nir Based on a patch from Tim Arceri, but I had to substantially rewrite it as a result of the NIR derefs rework. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-09 16:42:40 -08:00
Francisco Jerez	230a8a541d	intel/fs: Remove FS_OPCODE_UNPACK_HALF_2x16_SPLIT opcodes. These are broken on a future platform, but it turns out we don't need to fix them, since they're just type-converting moves with strided source. Kill them. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:09 -08:00
Francisco Jerez	cbea91eb57	intel/fs: Remove nasty open-coded CHV/BXT 64-bit workarounds. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:09 -08:00
Francisco Jerez	2c99c7a56c	intel/fs: Remove existing lower_conversions pass. It's redundant with the functionality provided by lower_regioning now. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:09 -08:00
Francisco Jerez	efa4e4bc5f	intel/fs: Introduce regioning lowering pass. This legalization pass is meant to handle situations where the source or destination regioning controls of an instruction are unsupported by the hardware and need to be lowered away into separate instructions. This should be more reliable and future-proof than the current approach of handling CHV/BXT restrictions manually all over the visitor. The same mechanism is leveraged to lower unsupported type conversions easily, which obsoletes the lower_conversions pass. v2: Give conditional modifiers the same treatment as predicates for SEL instructions in lower_dst_modifiers() (Iago). Special-case a couple of other instructions with inconsistent conditional mod semantics in lower_dst_modifiers() (Curro). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:09 -08:00
Francisco Jerez	b94519971a	intel/fs: Constify fs_inst::can_do_source_mods(). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:09 -08:00
Francisco Jerez	c301f447ea	intel/fs: Respect CHV/BXT regioning restrictions in copy propagation pass. Currently the visitor attempts to enforce the regioning restrictions that apply to double-precision instructions on CHV/BXT at NIR-to-i965 translation time. It is possible though for the copy propagation pass to violate this restriction if a strided move is propagated into one of the affected instructions. I've only reproduced this issue on a future platform but it could affect CHV/BXT too under the right conditions. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:08 -08:00
Francisco Jerez	464e79144f	intel/eu/gen7: Fix brw_MOV() with DF destination and strided source. I triggered this bug while prototyping code for a future platform on IVB. Could be a problem today though if a strided move is copy-propagated into a type-converting move with DF destination. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:08 -08:00
Francisco Jerez	bc781a0323	intel/fs: Fix bug in lower_simd_width while splitting an instruction which was already split. This seems to be a problem in combination with the lower_regioning pass introduced by a future commit, which can modify a SIMD-split instruction causing its execution size to become illegal again. A subsequent call to lower_simd_width() would hit this bug on a future platform. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:08 -08:00
Francisco Jerez	812ede088f	intel/fs: Implement quad swizzles on ICL+. Align16 is no longer a thing, so a new implementation is provided using Align1 instead. Not all possible swizzles can be represented as a single Align1 region, but some fast paths are provided for frequently used swizzles that can be represented efficiently in Align1 mode. Fixes ~90 subgroup quad swap Vulkan CTS tests. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:08 -08:00
Francisco Jerez	c5f9c0009d	intel/fs: Handle source modifiers in lower_integer_multiplication(). lower_integer_multiplication() implements 32x32-bit multiplication on some platforms by bit-casting one of the 32-bit sources into two 16-bit unsigned integer portions. This can give incorrect results if the original instruction specified a source modifier. Fix it by emitting an additional MOV instruction implementing the source modifiers where necessary. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2019-01-09 12:03:08 -08:00
Andrii Simiklit	0206ffc28d	anv/pipeline: remove unnecessary null-pointer check Looks like it is impossible that 'last' variable is a null because at least the get_vs_prog_data shouldn't return a null pointer. So this check is unnecessary starts from commit: `99d497c5b6` "anv/pipeline: Replace get_fs_input_map with ..." This small issue is found by cppcheck. Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-09 12:29:12 -06:00
Indrajit Das	d2c170eb35	st/va: Return correct status from vlVaQuerySurfaceStatus This ensures that during encoding, applications can get the correct status of the surface before submitting more operations on the same. Reviewed-by: Leo Liu <leo.liu@amd.com> Signed-off-by: Indrajit Das <indrajit-kumar.das@amd.com>	2019-01-09 11:34:22 -05:00
Roland Scheidegger	0c226d40ef	Revert "llvmpipe: Always return some fence in flush (v2)" This reverts commit `f6a6da8131`. With this commit we see massive amounts of asserts triggering in lp_fence_wait(), assert(f->issued), for instance with libgl_xlib state tracker and piglit. Not entirely sure if the assert could just be removed.	2019-01-09 17:28:53 +01:00
Marek Olšák	e986c1ca1d	st/mesa: don't leak pipe_surface if pipe_context is not current We have found some pipe_surface leaks internally. This is the same code as surface_destroy in radeonsi. Ideally, surface_destroy would be in pipe_screen. Cc: 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2019-01-09 11:08:44 -05:00
Marek Olšák	fd82a1d1d6	st/mesa: don't reference pipe_surface locally in PBO code Reviewed-by: Brian Paul <brianp@vmware.com>	2019-01-09 11:08:44 -05:00
Marek Olšák	5da442338b	st/mesa: unify window-system renderbuffer initialization Reviewed-by: Brian Paul <brianp@vmware.com>	2019-01-09 11:08:44 -05:00
Mario Kleiner	5e30e54e05	radeonsi: Fix use of 1- or 2- component GL_DOUBLE vbo's. With Mesa 18.1, commit `be973ed21f`, si_llvm_load_input_vs() changed the number of source 32-bit wide dword components used for fetching vertex attributes into the vertex shader from a constant 4 to a variable num_channels number, depending on input data format, with some special case handling for input data formats like 64-Bit doubles. In the case of a GL_DOUBLE input data format with one or two components though, e.g, submitted via ... a) glTexCoordPointer(1, GL_DOUBLE, 0, buffer); b) glTexCoordPointer(2, GL_DOUBLE, 0, buffer); ... the input format would be SI_FIX_FETCH_RG_64_FLOAT, but no special case handling was implemented for that case, so in the default path the number of 32-bit dwords would be set to the number of float input components derived from info->input_usage_mask. This ends with corrupted input to the vertex shader, because fetching a 64-bit double from the vbo requires fetching two 32-bit dwords instead of 1, and fetching a two double input requires 4 dword fetches instead of 2, so in these cases the vertex shader receives incomplete/truncated input data: a) float v = gl_MultiTexCoord0.x; -> v.x is corrupted. b) vec2 v = gl_MultiTexCoord0.xy; -> v.x is assigned correctly, but v.y is corrupted. This happens with the standard TGSI IR compiled shaders. Under NIR with R600_DEBUG=nir, we got correct behavior because the current radeonsi nir code always assigns info->input_usage_mask = TGSI_WRITEMASK_XYZW, thereby always fetches 4 dwords regardless of what the shader actually needs. Fix this by properly assigning 2 or 4 dword fetches for one or two component GL_DOUBLE input. Fixes: `be973ed21f` ("radeonsi: load the right number of components for VS inputs and TBOs") Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com> Cc: mesa-stable@lists.freedesktop.org Cc: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2019-01-09 11:08:44 -05:00
Rhys Perry	ee8488ea3b	ac/nir,radv,radeonsi/nir: use correct indices for interpolation intrinsics Fixes artifacts in World of Warcraft when Multi-sample Alpha-Test is enabled with DXVK. It also fixes artifacts with Fallout 4's god rays with DXVK. Various piglit interpolateAt*() tests under NIR are also fixed. v2: formatting fix update commit message to include Fallout 4 and the Fixes tag Fixes: `f4e499ec79` ('radv: add initial non-conformant radv vulkan driver') Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106595 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>	2019-01-09 14:57:07 +00:00
Samuel Pitoiset	b8c4f523b4	radv: skip draws with instance_count == 0 Loosely based on RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-09 14:22:38 +01:00
Samuel Pitoiset	a2b5cc3c39	radv: enable variable pointers The Vulkan spec 1.1.97 says: "variablePointers specifies whether the implementation supports the SPIR-V VariablePointers capability. When this feature is not enabled, shader modules must not declare the VariablePointers capability." As the SPIR-V feature is enabled, we should turn on the extension feature as well. All dEQP-VK.spirv_assembly.instruction.compute.variable_pointers.* pass with the khronos internal repo. Note that a bunch of them fails with the public repo, but it's expected as they violate the specification. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-09 12:32:18 +01:00
Samuel Pitoiset	d58b11e709	radv: get rid of bunch of KHR suffixes Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Eric Engestrom <eric.engestrom@intel.com> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-09 12:26:48 +01:00
Maya Rashish	a2ddb710fd	radeon: fix printf format specifier. From glibc printf(3): Z A nonstandard synonym for z that predates the appearance of z. Do not use in new code. Z may not exist on non-glibc systems. Prefer the standard symbol. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-01-09 14:15:06 +11:00
Tomasz Figa	f6a6da8131	llvmpipe: Always return some fence in flush (v2) If there is no last fence, due to no rendering happening yet, just create a new signaled fence and return it, to match the expectations of the EGL sync fence API. Fixes random "Could not create sync fence 0x3003" assertion failures from Skia on Android, coming from the following code: https://android.googlesource.com/platform/frameworks/base/+/master/libs/hwui/pipeline/skia/SkiaOpenGLPipeline.cpp#427 Reproducible especially with thread count >= 4. One could make the driver always keep the reference to the last fence, but: - the driver seems to explicitly destroy the fence whenever a rendering pass completes and changing that would require a significant functional change to the code. (Specifically, in lp_scene_end_rasterization().) - it still wouldn't solve the problem of an EGL sync fence being created and waited on without any rendering happening at all, which is also likely to happen with Android code pointed to in the commit. Therefore, the simple approach of always creating a fence is taken, similarly to other drivers, such as radeonsi. Tested with piglit llvmpipe suite with no regressions and following tests fixed: egl_khr_fence_sync conformance eglclientwaitsynckhr_flag_sync_flush eglclientwaitsynckhr_nonzero_timeout eglclientwaitsynckhr_zero_timeout eglcreatesynckhr_default_attributes eglgetsyncattribkhr_invalid_attrib eglgetsyncattribkhr_sync_status v2: - remove the useless lp_fence_reference() dance (Nicolai), - explain why creating the dummy fence is the right approach. Signed-off-by: Tomasz Figa <tfiga@chromium.org>	2019-01-09 02:06:13 +01:00
Eric Anholt	700aeaf9c8	glsl: Fix buffer overflow with an atomic buffer binding out of range. The binding is checked against the limits later in the function, so we need to make sure we don't overflow before the check here. Fixes this valgrind warning (and sometimes segfault): ==1460== Invalid write of size 4 ==1460== at 0x74C98DD: ast_declarator_list::hir(exec_list, _mesa_glsl_parse_state) (ast_to_hir.cpp:4943) ==1460== by 0x74C054F: _mesa_ast_to_hir(exec_list, _mesa_glsl_parse_state) (ast_to_hir.cpp:159) ==1460== by 0x7435C12: _mesa_glsl_compile_shader (glsl_parser_extras.cpp:2130) in dEQP-GLES31.functional.debug.negative_coverage.get_error.compute. exceed_atomic_counters_limit Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-01-08 15:44:58 -08:00
Eric Anholt	211b826790	nir: Make nir_deref_instr_build/get_const_offset actually use size_align. I think this was copy-and-paste mistake -- nir_opt_large_constants was passing in glsl_get_natural_size_align_bytes() given brw_nir.c's arguments to the opt pass. I wanted to reuse this function for handling constant offsets of arrays of images in V3D. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-01-08 15:40:53 -08:00
Danylo Piliaiev	9f29d90327	glsl/linker: Fix unmatched TCS outputs being reduced to local variable Always match TCS outputs since they are shared by all invocations within the patch and should not be converted to local variables. This is one of the issues found in Downward. Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104297	2019-01-09 10:31:13 +11:00
Eric Anholt	db3b6b6bca	v3d: Enable GL_ARB_texture_gather on V3D 4.x. This is part of GLES 3.1, and with the NIR lowering we're now passing the GLES31 testcases.	2019-01-08 13:03:44 -08:00
Eric Anholt	6051c11d17	nir: Add nir_lower_tex support for Broadcom's swizzled TG4 results. V3D returns the texels in a different order in the resulting vec4 from what GLSL wants, so we need to put in a swizzle. Fixes dEQP-GLES31.functional.texture.gather.basic.2d.rgba8.base_level.level_1 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-08 13:03:41 -08:00
Bas Nieuwenhuizen	3fcec4a550	freedreno: Move register constant files to src/freedreno. This way they can be shared. Build tested with meson, but not too sure on the autotools stuff though. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Rob Clark <robdclark@gmail.com>	2019-01-08 21:46:14 +01:00
Caio Marcelo de Oliveira Filho	baabfb1959	nir: fix warning in nir_lower_io.c Initialize the variable with NULL. Fixes the following In file included from ../src/compiler/nir/nir_lower_io.c:34: ../src/compiler/nir/nir_lower_io.c: In function ‘nir_lower_explicit_io’: ../src/compiler/nir/nir.h:668:11: warning: ‘addr’ may be used uninitialized in this function [-Wmaybe-uninitialized] return src; ^~~ ../src/compiler/nir/nir_lower_io.c:735:17: note: ‘addr’ was declared here nir_ssa_def *addr; ^~~~ v2: Avoid using a 'default' case so we get help from the compiler when new deref types are added. (Lionel) Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-08 12:29:56 -08:00
Chia-I Wu	3cb65cf8aa	freedreno/drm: sync uapi again "pad" was missing in Mesa's msm_drm.h. sizeof(drm_msm_gem_info) remains the same, but now the compiler initializes the field to zero. Buffer allocation results in EINVAL without this for me. Cc: Rob Clark <robdclark@gmail.com> Cc: Kristian Høgsberg <hoegsberg@gmail.com> Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com>	2019-01-08 19:55:28 +00:00
Chia-I Wu	6eeb1fe491	meson: fix EGL/X11 build without GLX dep_xcb and others were not set under this configuration. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-01-08 10:58:48 -08:00
Eric Engestrom	b38a48a569	wsi: drop unneeded KHR suffix Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-08 18:48:03 +00:00
Eric Engestrom	4f5a526789	anv: drop unneeded KHR suffix Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-08 18:47:56 +00:00
Karol Herbst	d0c6ef2793	nir: rename global/local to private/function memory the naming is a bit confusing no matter how you look at it. Within SPIR-V "global" memory is memory accessible from all threads. glsl "global" memory normally refers to shader thread private memory declared at global scope. As we already use "shared" for memory shared across all thrads of a work group the solution where everybody could be happy with is to rename "global" to "private" and use "global" later for memory usually stored within system accessible memory (be it VRAM or system RAM if keeping SVM in mind). glsl "local" memory is memory only accessible within a function, while SPIR-V "local" memory is memory accessible within the same workgroup. v2: rename local to function as well v3: rename vtn_variable_mode_local as well Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-08 18:51:46 +01:00
Dylan Baker	401dca1c73	autotools: Remove tegra vdpau driver This has never functioned and probably wont ever function, due to the way gallium media state trackers are architected and the tegra video decoder is architected. Cc: Thierry Reding <thierry.reding@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Fixes: `1755f608f5` ("tegra: Initial support")	2019-01-08 09:42:56 -08:00
Pierre Moreau	ba55cb2bcd	clover/meson: Ignore 'svn' suffix when computing CLANG_RESOURCE_DIR The version exported by LLVM in its CMake configuration files can include the “svn” suffix when building a development version (for example “8.0.0svn”). However the exported clang headers are still found under “lib/clang/8.0.0/”, without the “svn” suffix. Meson takes care of removing the “svn” suffix from the version when using the dependency’s `version()` method. This processing is already performed in “configure.ac” when using autotools. Signed-off-by: Pierre Moreau <pierre.morrow@free.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-01-08 08:53:38 -08:00
Lionel Landwerlin	add5a2ec92	anv: flush fast clear colors into compressed surfaces In the following scenario : 1. Create image format R8G8B8A8_UNORM 2. Create image view format R8G8B8A8_SRGB 3. Clear the view through a sub pass to a particular color 4. Barrier on the image to from color attachment to source transfer 5. Copy the image into a linear buffer to check the content The step 4 resolving the clear color is unaware of the SRGB format of the view, because the blorp resolve operations operate on images the color associated with the resolve will not operate on SRGB format but UNORM. Leading to the wrong color being written into surfaces. This change forces a clear color resolve at the end of the render pass so following resolves won't have to deal with the clear color with a format that doesn't match the image's format. On gfxbench vulkan_5_normal 1280x720, this appear to cost us ~0.5fps, from 49.316 down to 48.949. v2: Only fast clear resolve when image & view have different formats (Lionel) v3: Update warning (Jason) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108911 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: mesa-stable@lists.freedesktop.org	2019-01-08 16:37:00 +00:00
Lionel Landwerlin	366eb656ac	anv: explictly specify format for blorp ccs/mcs op Resolve operations can happen when dealing with view (begin/end subpasses) in which case the view's format needs to apply, not the image's format. v2: Relayout arguments of a ccs_op() call (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108911 Cc: mesa-stable@lists.freedesktop.org	2019-01-08 16:36:56 +00:00
Tapani Pälli	c292414765	dri3: initialize adaptive_sync as false before configQueryb Fixes following errors from valgrind output: ==23388== Conditional jump or move depends on uninitialised value(s) ==23388== at 0x48B4924: loader_dri3_drawable_init (loader_dri3_helper.c:381) ==23388== by 0x48A97D2: dri3_create_drawable (dri3_glx.c:386) ==23388== by 0x489E190: driFetchDrawable (dri_common.c:369) ==23388== by 0x48A9187: dri3_bind_context (dri3_glx.c:195) ==23388== by 0x488B75C: MakeContextCurrent (glxcurrent.c:220) ==23388== by 0x488B8DB: glXMakeCurrent (glxcurrent.c:267) ==23388== by 0x10A987: ??? (in /usr/bin/glxgears) ==23388== by 0x4BEB412: (below main) (in /usr/lib64/libc-2.28.so) ==23388== ==23388== Conditional jump or move depends on uninitialised value(s) ==23388== at 0x48B5A40: loader_dri3_swap_buffers_msc (loader_dri3_helper.c:923) ==23388== by 0x48A9B7E: dri3_swap_buffers (dri3_glx.c:587) ==23388== by 0x4887A81: glXSwapBuffers (glxcmds.c:857) ==23388== by 0x10ADED: ??? (in /usr/bin/glxgears) ==23388== by 0x4BEB412: (below main) (in /usr/lib64/libc-2.28.so) Fixes: `2e12fe425f` "loader/dri3: Enable adaptive_sync via _VARIABLE_REFRESH property" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>	2019-01-08 08:15:07 +02:00
Dave Airlie	4298a85ae8	virgl: use primconvert provoking vertex properly This stores the raster state and calls the correct primconvert interface using the currently bound raster state. Reviewed-By: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2019-01-08 12:06:41 +10:00
Jason Ekstrand	754eff07d2	anv: Sort properties and features switch statements Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-07 18:41:15 -06:00
Jason Ekstrand	05d72d6d48	spirv: Sort supported capabilities Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-07 18:41:15 -06:00
Jason Ekstrand	34af63fa22	anv: Enable the new deref-based UBO/SSBO path Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	63b9aa2e25	spirv: Add support for using derefs for UBO/SSBO access For now, it's hidden behind a cap. Hopefully, we can eventually drop that along with all the manual offset code in spirv_to_nir. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	3a7c5667c8	spirv: Make better use of vtn_pointer_uses_ssa_offset The choice of whether or not we should use block_load/store isn't a choice between external and not so much as a choice between deref instructions and manually calculated offsets. In vtn_pointer_from_ssa, we guard the index+offset case behind vtn_pointer_uses_ssa_offset and then branch out from there. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	adc155a815	spirv: Add explicit pointer types Instead of baking in uvec2 for UBO and SSBO pointers and uint for push constant and shared memory pointers, make it configurable. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	be039cb467	spirv: Choose atomic deref type with pointer_uses_ssa_offset Previously, we hard-coded the rule about workgroup variables and the builder lower_workgroup_access_to_offsets flag. Instead base it on the handy helper we have for exactly this sort of thing. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	5c3cb9c3ce	spirv: Add error checking for Block and BufferBlock decorations Variable pointers being well-defined across the block boundary requires a couple of very specific SPIR-V validation rules. Normally, we'd trust the validator to catch these but since CTS tests have been found in the wild which violate them, we'll carry our own checks. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	e90b738f20	nir/vulkan: Add a descriptor type to vulkan resource intrinsics Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	f393b10b3f	nir/lower_io: Add "explicit" IO lowering This new pass is for lowering explicitly laid out memory coming in from SPIR-V or a similar source. It's quite a bit more complicated than the normal lower_io because we have to be able to handle matrices. The way the stride information is stored for matrices is awkward and dealing with row-major matrices is especially painful. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	52dd43c7ef	nir/validate: Allow array derefs on vectors in more modes Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	013ee5732b	nir/intrinsics: Add access flags to load/store_deref Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	7755171e4c	nir/intrinsics: Allow deref sources to consume anything This commit adds a new num_components value for intrinsic sources of -1 which means that it consumes everything and the number of components effectively isn't validated. This is useful for deref sources which just take the result of the deref and we leave it up to the driver to decide what that size should be. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	d0fe52a456	nir/validate: Allow derefs in phi nodes We added this assert when first moving derefs over to instructions to ensure that deref chains could go all the way back to the variables. Now that we're going to start using derefs for things that we can do variable pointers on such as UBOs and SSBOs, we need to be able to run derefs through phi nodes, selects, and basically anything else. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	7e85480a67	nir/remove_dead_variables: Properly handle deref casts We already detect any incomplete deref chains (where the deref is used for something other than another deref or a load/store) and flag the variable as used thanks to deref_used_for_not_store. All that's left to do is to properly skip casts when cleaning up. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	78d80f7db2	nir/deref: Skip over casts in fixup_deref_modes This pass is used when, for instance, we lazily change the mode of variables rather than replacing the variable with a new one. Since we only do this in cases where we know we have full deref chains, it's ok to just skip them in fixup_deref_modes. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	d8e3edb784	nir/deref: Support casts and ptr_as_array in comparisons The code which constructs deref paths already gives you the path starting at the nearest deref_cast or deref_var. All we need to do for casts is handle the case where the start of the path isn't a deref_var. For ptr_as_array derefs, we just bail if we have any after the divergence point between the two derefs. We may be able to do better in the future but this works for now. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	a1c688517d	nir/opt_deref: Properly optimize ptr_as_array derefs When handling casts, we can't blindly propagate the parent of a cast into a ptr_as_array deref because doing so might loose the stride information from the cast. Instead, before we can propagate into ptr_as_array derefs, we need to check that the cast is a cast of an array deref and that the stride matches. For other types of derefs, we can continue to propagate casts as normal because they don't need the stride. We also add an optimization which can combine a ptr_as_array deref with it parent if it is also an array deref of some form. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	427558a717	nir/validate: Don't allow derefs in if conditions Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	e94a027af8	nir: Add a ptr_as_array deref type These correspond directly to SPIR-V's OpPtrAccessChain. As such, they treat whatever their parent gives them as if it's the first element in some array and dereferences that array. If the parent is, itself, an array deref, then the two indices can just be added together to get the final array deref. However, it can also be used in cases where what you have is a dereference to some random vec2 value somewhere. In this case, we require a cast before the ptr_as_array and use the ptr_stride field in the cast to provide a stride for the ptr_as_array derefs. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	fc9c4f89b8	nir: Move propagation of cast derefs to a new nir_opt_deref pass We're going to want to do more deref optimizations going forward and this gives us a central place to do them. Also, cast propagation will get a bit more complicated with the addition of ptr_as_array derefs. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	bf1a1eed88	spirv: Propagate layout decorations to created glsl_types Instead of just storing the decorations in the vtn_type, propagate them all the way through to the glsl_type. For array strides, this means we need to handle them earlier so we break array stride handling into it's own function and explicitly call it for both pointer and array types. Due to type deduplication in the SPIR-V, we may have explicit layout decorations on all sorts of types that don't actually want them. In order to prevent these leaking into unfortunate places in NIR, we explicitly strip them off before creating NIR variables and when casting pointers to non-external memory. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-08 00:38:30 +00:00
Jason Ekstrand	6cebeb4f71	glsl_type: Add support for explicitly laid out matrices and arrays SPIR-V allows for matrix and array types to be decorated with explicit byte stride decorations and matrix types to be decorated row- or column-major. This commit adds support to glsl_type to encode this information. Because this doesn't work nicely with std430 and std140 alignments, we add asserts to ensure that we don't use any of the std430 or std140 layout functions with explicitly laid out types. In SPIR-V, the layout information for matrices is applied to the parent struct member instead of to the matrix type itself. However, this is gets rather clumsy when you're walking derefs trying to compute offsets because, the moment you hit a matrix, you have to crawl back the deref chain and find the struct. Instead, we take the same path here as we've taken in spirv_to_nir and put the decorations on the matrix type itself. This also subtly adds support for strided vector types. These don't come up in SPIR-V directly but you can get one as the result of taking a column from a row-major matrix or a row from a column-major matrix. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	7f70b3e555	glsl_type: Simplify glsl_channel_type This is C++ so we can just poke at the fields of glsl_type if we wish and calling get_instance is way easier and more reliable than handling each instance separately. While we're at it, we re-arrange the base type labels to match the enum order and add 8-bit type support. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	d8a11bfc08	glsl_type: Add a C wrapper to get struct field offsets Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	d34f19feba	glsl_type: Drop the glsl_get_array_instance C helper It was added in `bce6f99875` even though it's completely redundant with glsl_array_type(). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	a700a82bda	nir: Distinguish between normal uniforms and UBOs Previously, NIR had a single nir_var_uniform mode used for atomic counters, UBOs, samplers, images, and normal uniforms. This commit splits this into nir_var_uniform and nir_var_ubo where nir_var_uniform is still a bit of a catch-all but the nir_var_ubo is specific to UBOs. While we're at it, we also rename shader_storage to ssbo to follow the convention. We need this so that we can distinguish between normal uniforms and UBO access at the deref level without going all the way back variable and seeing if it has an interface type. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	c9a4135e14	nir: Allow storing to shader_storage I have no idea how shader_storage made it into the list of banned variable modes for stores but it clearly should be allowed. This only doesn't cause us a problem today because we never actually use derefs on shader_storage variables. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	cd93b0a670	nir/validate: Require array indices to match the deref bit size This doesn't currently change anything because array indices are required to be 32 bits and all derefs are also 32 bits. However, we will one day have 64-bit derefs for OpenCL. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	abfe674c54	spirv: Handle arbitrary bit sizes for deref array indices We already had code in link_as_ssa to handle bit sizes; we just need to use it. While we're at it we clean up link_as_ssa a bit and add an explicit bit_size parameter in preparation for a day when we have derefs that aren't 32 bit. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	bfe31c5e46	nir/builder: Add nir_i2i and nir_u2u helpers which take a bit size Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com	2019-01-08 00:38:29 +00:00
Jason Ekstrand	639c236e74	spirv: Emit NIR deref instructions on-the-fly This simplifies our deref handling by emitting the actual NIR deref instructions on-the-fly instead of of building up a deref chain and then emitting them at the last moment. In order for this to work with the parts of the compiler that assume they can chase deref chains, we have to run nir_rematerialize_derefs_in_use_blocks_impl to put the derefs back in the right places. Otherwise, in cases such as loop continues where the SPIR-V blocks are not in the same order as the NIR blocks, we may end up with a deref chain with a parent that does not dominate it's child and nir_repair_ssa_impl will insert phis in the deref chain. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	c59f07684c	spirv: Sign-extend array indices The SPIR-V spec was recently updated to clarify that array indices are treated as signed integers. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	f8992eb5ba	anv/apply_pipeline_layout: Set the cursor in lower_res_reindex_intrinsic The loop through instructions doesn't set the cursor for us so unless we set it somewhere, we may end up emitting instructions in the wrong place. The only reason why we haven't been bitten by this in the past is that it only happens in a few variable pointers cases and the CTS tests for those don't use much control flow so things were getting emitted in the correct order by accident. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	42b2f3e91f	spirv: Handle any bit size in vector_insert/extract This crops up both in the actual SPIR-V VectorInsert/Extract opcodes as well as various places where we deal with vector derefs. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Jason Ekstrand	a392ddb781	glsl_type: Support serializing 8 and 16-bit types Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-01-08 00:38:29 +00:00
Bas Nieuwenhuizen	70ed049cc6	spirv: Fix matrix parameters in function calls. They can be handled exactly the same as arrays, we just need to handle the base type correctly in the switches. Fixes: `a45b6fb452` "spirv: Pass SSA values through functions" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109204 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-08 01:30:03 +01:00
Bas Nieuwenhuizen	3cc940277a	radv: Fix rasterization precision bits. Note that these limits are exact, not a "precision is at least x", as texel coords also get snapped to a multiple of this step size before filtering. This fixes CTS tests dEQP-VK.texture.explicit_lod.2d.sizes.31x55_nearest_linear_mipmap_nearest_repeat dEQP-VK.texture.explicit_lod.2d.sizes.57x35_nearest_linear_mipmap_nearest_repeat Fixes: `f4e499ec79` "radv: add initial non-conformant radv vulkan driver" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109151 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-07 23:27:30 +01:00
Kenneth Graunke	f003859f97	nir: Make gl_nir_lower_samplers use gl_nir_lower_samplers_as_deref These days, we have two sampler lowering passes. The newer one, gl_nir_lower_samplers_as_deref, is used by radeonsi. It rewrites variables to drop structures out of sampler deref chains, to make life simpler. It then sets var->data.binding for non-bindless sampler and image variables based on the GL uniform storage's opaque index values. The older one converts sampler deref chains (nir_tex_src_texture_deref) to a numerical offset (nir_tex_src_texture_offset). It also stores the constant-valued portion of that number in tex->texture_index, making life really simple for drivers that don't support indirects. It too pokes at GL uniform storage's opaque index values. Logically, we can do the first pass (simplify derefs, set bindings) then the second (turn derefs to offsets, set texture_index). This patch does exactly that, eliminating some redundancy (only one pass has to poke at GL uniform storage), and gaining proper var->data.binding values for drivers using the full lowering. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-07 14:25:04 -08:00
Kenneth Graunke	c69f9297cf	nir: Fix gl_nir_lower_samplers_as_deref's structure type handling. We recurse to remove structures, and at each step, re-modify the resulting type for our link in the deref chain. For arrays, the result of recursion is the new underlying type - so we wrap it with the array dimensionality again. For structs, we want to simply use the new underlying type, skipping the struct altogether. The correct way to do this is to do nothing at all. Previously, we had reset type to next->type, which is the /old/ field type, not the new field type we obtained by recursing. This undid our recursive work. Fixes about 338 tests with nested structs, such as: dEQP-GLES2.functional.uniform_api.value.initial.get_uniform.nested_structs_arrays.sampler2D_samplerCube_fragment Note that currently only radeonsi uses this pass, and NIR support is disabled there by default, so the breakage was likely not seen by most people. The next commit uses this pass for more drivers, so this fix prevents regressions from that change. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-07 14:25:04 -08:00
Bas Nieuwenhuizen	be6cee51c0	amd/common: Add some parentheses to silence warning. [1/59] Compiling C object 'src/amd/common/src@amd@common@@amd_common@sta/ac_nir_to_llvm.c.o'. ../mesa/src/amd/common/ac_nir_to_llvm.c: In function ‘get_inst_tessfactor_writemask’: ../mesa/src/amd/common/ac_nir_to_llvm.c:4089:32: warning: suggest parentheses around ‘+’ inside ‘<<’ [-Wparentheses] writemask = ((1 << num_comps + 1) - 1) << first_component; ~~~~~~~~~~^~~ ../mesa/src/amd/common/ac_nir_to_llvm.c:4091:33: warning: suggest parentheses around ‘+’ inside ‘<<’ [-Wparentheses] writemask = (((1 << num_comps + 1) - 1) << first_component) << 4; Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-07 23:15:37 +01:00
Bas Nieuwenhuizen	64c83efaee	radv: Remove unused variable. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-07 23:15:33 +01:00
Bas Nieuwenhuizen	656c1c488c	radv: Remove device path. unused and gcc complains about strncpy. (from what I can see because strncpy does not leave a 0 byte on truncate. That said we don't use it so this does not fix a real bug). Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-07 23:15:14 +01:00
Marek Olšák	492ad9a402	ac: remove unused variable from ac_build_ddxy trivial	2019-01-07 14:51:25 -05:00
Andres Gomez	0cc01f45e7	glsl: correct typo in GLSL compilation error message v2: Add the "fix" tag (Erik). Fixes: `037f68d81e` ("glsl: apply align layout qualifier rules to block offsets") Cc: Timothy Arceri <tarceri@itsqueeze.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-07 19:07:33 +02:00
Jason Ekstrand	027835b1da	vulkan: Update the XML and headers to 1.1.97 Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-07 10:00:01 -06:00
Andres Gomez	6decc6b1d9	docs: update 18.3 and add 19.x cycles for the release calendar v2: replace incorrect "<td/>" with "<td>" (Eric). Cc: Dylan Baker <dylan.c.baker@intel.com> Cc: Juan A. Suarez <jasuarez@igalia.com> Cc: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Acked-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Juan A. Suarez <jasuarez@igalia.com>	2019-01-07 17:19:47 +02:00
Bas Nieuwenhuizen	110564fdec	anv/android: Do not reject storage images. We do the ImageFormatProperties check already, and rejecting an usage flag when both ImageFormatProperties and the WSI (which is Android) support it is not allowed. Intel does support storage for some of the support WSI formats, such as R8G8B8A8_UNORM, and looking at the ISL_SURF_USAGE_DISABLE_AUX_BIT, the imported images do not have any form of compression that would prevent this fix. v2: Also consider STORAGE bit for Gralloc usage bits. (From Kevin Strasser <kevin.strasser@intel.com>) Fixes: `053d4c328f` "anv: Implement VK_ANDROID_native_buffer (v9)" Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-07 15:20:55 +01:00
Bas Nieuwenhuizen	9a45a190ad	radv: Implement buffer stores with less than 4 components. We started using it in the btoi paths for r32g32b32, and the LLVM IR checker will complain about it because we end up with intrinsics with the wrong type extension in the name. Fixes: `593996bc02` ("radv: implement buffer to image operations for R32G32B32") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-01-07 14:54:14 +01:00
Jon Turney	00ad77b9f6	appveyor: Add a Cygwin build script	2019-01-07 13:40:58 +00:00
Jon Turney	5334dafee2	appveyor: put build steps in a script, rather than inline in appveyor.yml	2019-01-07 13:40:57 +00:00
Lucas Stach	d015888efb	etnaviv: annotate variables only used in debug build Some of the status variables in the compiler are only used in asserts and thus may be unused in release builds. Annotate them accordingly to avoid 'unused but set' warnings from the compiler. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-01-07 11:51:02 +01:00
Lucas Stach	b56d903b5a	etnaviv: enable full overwrite in a few more cases Take into account the render target format when checking if the color mask affects all channels of the RT. This allows to enable full overwrite in a few cases where a non-alpha format is used. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-01-07 11:50:23 +01:00
Timothy Arceri	6dade5d534	nir: avoid uninitialized variable warning Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109231	2019-01-07 10:57:00 +11:00
Timothy Arceri	17fac39398	st/glsl: refactor st_link_nir() The functional change here is moving the nir_lower_io_to_scalar_early() calls inside st_nir_link_shaders() and moving the st_nir_opts() call after the call to nir_lower_io_arrays_to_elements(). This fixes a bug with the following piglit test due to the current code not cleaning up dead code after we lower arrays. This was causing an assert in the new duplicate varyings link time opt introduced in `70be9afccb`. tests/spec/glsl-1.10/execution/vsfs-unused-array-member.shader_test Moving the nir_lower_io_to_scalar_early() calls also allows us to tidy up the code a little and merge some loops. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-07 10:54:20 +11:00
Eric Anholt	8847370424	v3d: Use the core tex lowering. Even without any clever optimization on the unpack operations, this gives us a useful value for the channels read field, which we can use to avoid ldtmu instructions to the no-op register. instructions in affected programs: 890712 -> 881974 (-0.98%)	2019-01-04 15:59:59 -08:00
Eric Anholt	f217a94542	nir: Add nir_lower_tex options to lower sampler return formats. I've been doing this in the nir-to-vir and nir-to-qir backends of v3d and vc4, but nir could potentially do some useful stuff for us (like avoiding unpack/repacks) if we give it the information. v2: Skip lowering for txs/query_levels v3: Fix a crash on old-style shadow v4: Rename to tex_packing, use nir_format_unpack_sint/uint helpers, pack the enum. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-04 15:59:57 -08:00
Eric Anholt	a74f2aeb4f	nir: Allow nir_format_unpack_int/sint to unpack larger values. For V3D, I want to unpack 4-16-bit packed integers for 8 and 16-bit integer samplers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-04 15:59:30 -08:00
Jason Ekstrand	19c608fe43	intel/blorp: Be more conservative about copying clear colors In `92eb5bbc68` we attempted to avoid copying clear colors whenever we weren't doing a resolve. However, this broke MSAA resolves because we need the clear color in the source. This patch makes blorp much more conservative such that it only avoids the clear color copy if either aux_usage == NONE or it's explicitly doing a fast-clear. Fixes: `92eb5bbc68` "intel/blorp: Only copy clear color when doing..." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107728 Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-01-04 17:57:43 -06:00
Eric Anholt	81b9361b68	v3d: Stop scalarizing our uniform loads. We can pull a whole vector in a single indirect load. This saves a bunch of round-trips to the TMU, instructions for setting up multiple loads, references to the UBO base in the uniforms, and apparently manages to reduce register pressure as well. instructions in affected programs: 3086665 -> 2454967 (-20.47%) uniforms in affected programs: 919581 -> 721039 (-21.59%) threads in affected programs: 1710 -> 3420 (100.00%) spills in affected programs: 596 -> 522 (-12.42%) fills in affected programs: 680 -> 562 (-17.35%) Improves 3dmmes performance by 2.29312% +/- 0.139825% (n=5)	2019-01-04 15:41:23 -08:00
Eric Anholt	f8a8de8b9a	v3d: Do UBO loads a vector at a time. In the process of adding support for SSBOs and CS shared vars, I ended up needing a helper function for doing TMU general ops. This helper can be that starting point, and saves us a bunch of round-trips to the TMU by loading a vector at a time.	2019-01-04 15:41:23 -08:00
Eric Anholt	b0e0086257	v3d: Remove dead switch cases and comments from v3d_nir_lower_io. Moving things to NIR left this mess around. All we lower now is uniforms.	2019-01-04 15:41:23 -08:00
Eric Anholt	f8e6b364b0	v3d: Fix up VS output setup during precompiles. I noticed that a VS I was debugging was missing all of its output stores -- outputs_written was for POS, VAR0, VAR3, while the shader's variables were POS, VAR9, and VAR12. I'm not sure what outputs_written is supposed to be doing here, but we can just walk the declared variables and avoid both this bug and the emission of extra stvpms for less-than-vec4 varyings.	2019-01-04 15:41:23 -08:00
Eric Anholt	e1385e879d	v3d: Reinstate the new shader-db output after v3d_compile() refactor. I misplaced it in the rebase conflicts.	2019-01-04 15:26:19 -08:00
Caio Marcelo de Oliveira Filho	bbf9ee9b18	nir: remove dead code from copy_prop_vars When copy_prop_vars also took care of dead write handling, intrin was used as part of store_to_entry. Now it isn't, so this assignment isn't used really used. Add a comment clarifying what happens to intrin. Fixes: `4dfa7adc10` "nir: Remove handling of dead writes from copy_prop_vars" Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-04 15:18:41 -08:00
Lionel Landwerlin	31e4c9ce40	i965: add CS stall on VF invalidation workaround Even with the previous commit, hangs are still happening. The problem there is that the VF cache invalidate do happen immediately without waiting for previous rendering to complete. What happens is that we invalidate the cache the moment the PIPE_CONTROL is parsed but we still have old rendering in the pipe which continues to pull data into the cache with the old high address bits. The later rendering with the new high address bits then doesn't have the clean cache that it expects/needs. v2: Update commit message/explanation with Jason's Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `a363bb2cd0` ("i965: Allocate VMA in userspace for full-PPGTT systems.") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109072	2019-01-04 11:18:54 +00:00
Lionel Landwerlin	92b7407090	i965: include draw_params/derived_draw_params for VF cache workaround These buffers are using VB slots and should be included in the workaround decision. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `a363bb2cd0` ("i965: Allocate VMA in userspace for full-PPGTT systems.") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109072	2019-01-04 11:18:54 +00:00
Lionel Landwerlin	da634a4acb	intel/blorp: emit VF caching workaround before 3DSTATE_VERTEX_BUFFERS Probably no difference but it's nice to have i965 & blorp emit things in the same order. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-04 11:18:51 +00:00
Lionel Landwerlin	e5ed217545	i965: limit VF caching workaround to gen8/9/10 Documentation of the 3DSTATE_VERTEX_BUFFERS packet says this is only needed before ICL. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-01-04 11:18:48 +00:00
Andres Gomez	f0312cfa93	glsl/linker: complete documentation for assign_attribute_or_color_locations Commit `27f1298b9d` ("glsl/linker: validate attribute aliasing before optimizations") forgot to complete the documentation. Cc: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-01-04 09:04:31 +02:00
Gurchetan Singh	6b7aea9d85	virgl: remove empty file Fixes: 174f53 ("virgl: consolidate transfer code") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-01-03 20:59:29 +01:00
Gurchetan Singh	ca66457b05	virgl: don't flush an empty range Otherwise, the gl-1.0-long-dlist Piglit test crashes. Fixes: db7757 ("virgl: modify how we handle GL_MAP_FLUSH_EXPLICIT_BIT") Reported by airlied@ v2: Exit on any invalid range (Erik) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109190 Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Tested-by: Jakob Bornecrantz <jakob@collabora.com>	2019-01-03 20:59:29 +01:00
Eric Engestrom	393a756e6a	docs: advertise distro-provided meson cross-files Hopefully we can kick start the revolution and other distros will start providing them as well :) Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-01-03 18:53:21 +00:00
Eric Engestrom	8b363bc42e	docs: fix the meson aarch64 cross-file `gcc-ar` is preferred over the generic `ar`, and the `arm` family is for 32-bit ARM [1]. [1] https://mesonbuild.com/Reference-tables.html#cpu-families Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-01-03 18:53:21 +00:00
Jakob Bornecrantz	6a9be6fc0c	virgl/vtest: Use default socket name from protocol header No functional change as the socket name is the same, just removing the double definition of the path. Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org> Signed-off-by: Jakob Bornecrantz <jakob@collabora.com>	2019-01-03 15:50:38 +00:00
Rob Clark	e869481ef3	freedreno: fix staging resource size for arrays A 2d-array texture (for example), should get the # of array elements from box->depth, rather than depth0 which is minified. Fixes dEQP-GLES3.functional.shaders.texture_functions.texture.sampler2darray_bias_float_fragment with tiled textures. Reported-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-03 08:11:40 -05:00
Rob Clark	67a7f6f244	freedreno: remove blit_via_copy_region() If we hit the memcpy() path for copy_region(), that will try to do a transfer_map(), which goes badly for blits to/from staging triggered by transfer_map() or transfer_unmap(). We could possibly add fd_blit2() which has allow_transfer_map param, and call that for staging blits. But I'm not really sure if trying the blit via copy_region() is very useful. At least for newer gens that implement fd_context::blit(), it probably isn't. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-03 08:10:32 -05:00
Rob Clark	2fc17e16a3	freedreno/a6xx: rework blitter API Switch over to using fd_context::blit(), in the same way that a5xx does. The previous patch wires fd_resource_copy_region() up to the blitter so a6xx no longer needs to bypass the core layer to accelerate this. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-03 08:10:23 -05:00
Rob Clark	53b8eb78d5	freedreno: try blitter for fd_resource_copy_region() Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-03 08:10:16 -05:00
Rob Clark	228eddd7ee	freedreno: rework blit API First step to unify the way fd5 and fd6 blitter works. Currently a6xx bypasses the blit API in order to also accelerate resource_copy_region() But this approach can lead to infinite recursion: #0 fd_alloc_staging (ctx=0x5555936480, rsc=0x7fac485f90, level=0, box=0x7fbab29220) at ../src/gallium/drivers/freedreno/freedreno_resource.c:291 #1 0x0000007fbdebed04 in fd_resource_transfer_map (pctx=0x5555936480, prsc=0x7fac485f90, level=0, usage=258, box=0x7fbab29220, pptrans=0x7fbab29240) at ../src/gallium/drivers/freedreno/freedreno_resource.c:479 #2 0x0000007fbe5c5068 in u_transfer_helper_transfer_map (pctx=0x5555936480, prsc=0x7fac485f90, level=0, usage=258, box=0x7fbab29220, pptrans=0x7fbab29240) at ../src/gallium/auxiliary/util/u_transfer_helper.c:243 #3 0x0000007fbde2dcb8 in util_resource_copy_region (pipe=0x5555936480, dst=0x7fac485f90, dst_level=0, dst_x=0, dst_y=0, dst_z=0, src=0x7fac47c780, src_level=0, src_box_in=0x7fbab2945c) at ../src/gallium/auxiliary/util/u_surface.c:350 #4 0x0000007fbdf2282c in fd_resource_copy_region (pctx=0x5555936480, dst=0x7fac485f90, dst_level=0, dstx=0, dsty=0, dstz=0, src=0x7fac47c780, src_level=0, src_box=0x7fbab2945c) at ../src/gallium/drivers/freedreno/freedreno_blitter.c:173 #5 0x0000007fbdf085d4 in fd6_resource_copy_region (pctx=0x5555936480, dst=0x7fac485f90, dst_level=0, dstx=0, dsty=0, dstz=0, src=0x7fac47c780, src_level=0, src_box=0x7fbab2945c) at ../src/gallium/drivers/freedreno/a6xx/fd6_blitter.c:587 #6 0x0000007fbde2f3d0 in util_try_blit_via_copy_region (ctx=0x5555936480, blit=0x7fbab29430) at ../src/gallium/auxiliary/util/u_surface.c:864 #7 0x0000007fbdec02c4 in fd_blit (pctx=0x5555936480, blit_info=0x7fbab29588) at ../src/gallium/drivers/freedreno/freedreno_resource.c:993 #8 0x0000007fbdf08408 in fd6_blit (pctx=0x5555936480, info=0x7fbab29588) at ../src/gallium/drivers/freedreno/a6xx/fd6_blitter.c:546 #9 0x0000007fbdebdc74 in do_blit (ctx=0x5555936480, blit=0x7fbab29588, fallback=false) at ../src/gallium/drivers/freedreno/freedreno_resource.c:129 #10 0x0000007fbdebe58c in fd_blit_from_staging (ctx=0x5555936480, trans=0x7fac47b7e8) at ../src/gallium/drivers/freedreno/freedreno_resource.c:326 #11 0x0000007fbdebea38 in fd_resource_transfer_unmap (pctx=0x5555936480, ptrans=0x7fac47b7e8) at ../src/gallium/drivers/freedreno/freedreno_resource.c:416 #12 0x0000007fbe5c5c68 in u_transfer_helper_transfer_unmap (pctx=0x5555936480, ptrans=0x7fac47b7e8) at ../src/gallium/auxiliary/util/u_transfer_helper.c:516 #13 0x0000007fbde2de24 in util_resource_copy_region (pipe=0x5555936480, dst=0x7fac485f90, dst_level=0, dst_x=0, dst_y=0, dst_z=0, src=0x7fac47b8e0, src_level=0, src_box_in=0x7fbab2997c) at ../src/gallium/auxiliary/util/u_surface.c:376 #14 0x0000007fbdf2282c in fd_resource_copy_region (pctx=0x5555936480, dst=0x7fac485f90, dst_level=0, dstx=0, dsty=0, dstz=0, src=0x7fac47b8e0, src_level=0, src_box=0x7fbab2997c) at ../src/gallium/drivers/freedreno/freedreno_blitter.c:173 #15 0x0000007fbdf085d4 in fd6_resource_copy_region (pctx=0x5555936480, dst=0x7fac485f90, dst_level=0, dstx=0, dsty=0, dstz=0, src=0x7fac47b8e0, src_level=0, src_box=0x7fbab2997c) at ../src/gallium/drivers/freedreno/a6xx/fd6_blitter.c:587 ... Instead rework the API to push the fallback back to core code, so that we can rework resource_copy_region() to have it's own fallback path, and then finally convert fd6 over to work in the same way. This also makes ctx->blit() optional, and cleans up some unnecessary callers. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-03 08:09:52 -05:00
Rob Clark	f1c88336e6	freedreno: skip depth resolve if not written For multi-pass rendering, it is common to keep the same depth buffer from previous pass, to discard geometry that would be hidden by later draws. In the later passes with depth-test enabled, but depth-write disabled, there is no reason to do gmem2mem resolve. TODO probably do something similar for stencil.. although stencil buffer isn't used as commonly these days Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-01-03 08:09:24 -05:00
Timothy Arceri	4d3f6cb973	nir: merge some basic consecutive ifs After trying multiple times to merge if-statements with phis between them I've come to the conclusion that it cannot be done without regressions. The problem is for some shaders we end up with a whole bunch of phis for the merged ifs resulting in increased register pressure. So this patch just merges ifs that have no phis between them. This seems to be consistent with what LLVM does so for radeonsi we only see a change (although its a large change) in a single shader. Shader-db results i965 (SKL): total instructions in shared programs: 13098176 -> 13098152 (<.01%) instructions in affected programs: 1326 -> 1302 (-1.81%) helped: 4 HURT: 0 total cycles in shared programs: 332032989 -> 332037583 (<.01%) cycles in affected programs: 60665 -> 65259 (7.57%) helped: 0 HURT: 4 The cycles estimates reported by shader-db for i965 seem inaccurate as the only difference in the final code is the removal of the redundent condition evaluations and jumps. Also the biggest code reduction (~7%) for radeonsi was in a tomb raider tressfx shader but for some reason this does not get merged for i965. Shader-db results radeonsi (VEGA): Totals from affected shaders: SGPRS: 232 -> 232 (0.00 %) VGPRS: 164 -> 164 (0.00 %) Spilled SGPRs: 59 -> 59 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 14584 -> 13520 (-7.30 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 13 -> 13 (0.00 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-01-03 15:17:16 +11:00
Timothy Arceri	19cafe8084	nir: add rewrite_phi_predecessor_blocks() helper This will also be used by the if merge pass in the following commit. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-01-03 15:17:16 +11:00
Timothy Arceri	5122fbc4ba	nir: simplify does_varying_match() Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-03 11:47:56 +11:00
Timothy Arceri	8d05ee2005	nir: make use of does_varying_match() helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-03 11:47:56 +11:00
Timothy Arceri	0016166d19	nir: make nir_opt_remove_phis_impl() static Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-01-03 11:47:56 +11:00
Eric Anholt	d2b899c0ec	v3d: Refactor compiler entrypoints. Before, I had per-stage entryoints with some helpers shared between them. As I extended for compute shaders and shader-db, it turned out that the other common code in the middle wanted to be shared too.	2019-01-02 14:12:29 -08:00
Eric Anholt	0805060573	v3d: Handle dynamically uniform IF statements with uniform control flow. Loops will be trickier, since we need some analysis to figure out if the breaks/continues inside are uniform. Until we get that in NIR, this gets us some quick wins. total instructions in shared programs: 6192844 -> 6174162 (-0.30%) instructions in affected programs: 487781 -> 469099 (-3.83%)	2019-01-02 14:12:29 -08:00
Eric Anholt	5e9ee6e841	v3d: Fold comparisons for IF conditions into the flags for the IF. total instructions in shared programs: 6193810 -> 6192844 (-0.02%) instructions in affected programs: 800373 -> 799407 (-0.12%)	2019-01-02 14:12:29 -08:00
Eric Anholt	078dc176bc	v3d: Don't try to fold non-SSA-src comparisons into bcsels. There could have been a write of a src in between the comparison and the bcsel that would invalidate the comparison.	2019-01-02 14:12:29 -08:00
Eric Anholt	2e0433b687	v3d: Move the "Find the ALU instruction generating our bool" out of bcsel. This will be reused for if statements.	2019-01-02 14:12:29 -08:00
Eric Anholt	c3ae0aa264	v3d: Simplify the emission of comparisons for the bcsel optimization. I wanted to reuse the comparison stuff for nir_ifs, but for that I just want the flags and no destination value. Splitting the conditions from the destinations ended up cleaning the existing code up, anyway.	2019-01-02 14:12:29 -08:00
Eric Anholt	49d8e2aff1	v3d: Don't forget to include RT writes in precompiles. Looking at some assembly dumps for an optimization, we were clearly missing important parts of the shader!	2019-01-02 14:12:29 -08:00
Eric Anholt	3a81c753a3	v3d: Fix segfault when failing to compile a program. We'll still fail at draw time, but this avoids a regression in shader-db execution once I enable TLB writes in precompiles. Fixes: `b38e4d313f` ("v3d: Create a state uploader for packing our shaders together.")	2019-01-02 14:12:29 -08:00
Marek Olšák	3ae57957be	radeonsi: always unmap texture CPU mappings on 32-bit CPU architectures Team Fortress 2 32-bit version runs out of the CPU address space. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2019-01-02 15:01:59 -05:00
Marek Olšák	edfca1f8dc	radeonsi: remove unused variables in si_insert_input_ptr Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2019-01-02 15:01:58 -05:00
Marek Olšák	cba475b3e7	radeonsi: use u_decomposed_prims_for_vertices instead of u_prims_for_vertices It seems to be the same, but this doesn't use integer division with a variable divisor. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2019-01-02 15:01:56 -05:00
Marek Olšák	54bc87469a	radeonsi: make si_cp_wait_mem more configurable Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2019-01-02 15:01:54 -05:00
Marek Olšák	9d2c3a1fe0	radeonsi: call si_fix_resource_usage for the GS copy shader as well Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2019-01-02 15:01:53 -05:00
Marek Olšák	d28e208213	radeonsi: don't emit redundant PKT3_NUM_INSTANCES packets Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2019-01-02 15:01:50 -05:00
Caio Marcelo de Oliveira Filho	7d6babf995	nir: add a way to print the deref chain Makes debugging easier when we care about the deref chain and not the deref instruction itself. To make it take a const pointer, constify some of the static functions in nir_print.c. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-02 10:09:04 -08:00
Dylan Baker	a2596450ac	meson: Error out if building nouveau and using LLVM without rtti Nouveau requires rtti. Often LLVM is configured without rtti, and code with and without cannot be linked safely. Lets just error out if nouveau is requested and llvm is built without rtti. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109202 Fixes: `c5a97d658e` ("meson: fix builds against LLVM built without rtti") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-02 09:30:12 -08:00
Alexander von Gluck IV	1b97a72328	egl/haiku: Fix reference to disp vs dpy Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `00992700c9` "egl: set the EGLDevice when creating a display"	2019-01-02 13:45:09 +00:00
Iago Toral Quiroga	ec79069856	compiler/spirv: use 32-bit polynomial approximation for 16-bit asin() The 16-bit polynomial execution doesn't meet Khronos precision requirements. Also, the half-float denorm range starts at 2^(-14) and with asin taking input values in the range [0, 1], polynomial approximations can lead to flushing relatively easy. An alternative is to use the atan2 formula to compute asin, which is the reference taken by Khronos to determine precision requirements, but that ends up generating too many additional instructions when compared to the polynomial approximation. Specifically, for the Intel case, doing this adds +41 instructions to the program for each asin/acos call, which looks like an undesirable trade off. So for now we take the easy way out and fallback to using the 32-bit polynomial approximation, which is better (faster) than the 16-bit atan2 implementation and gives us better precision that matches Khronos requirements. v2: - Fallback to 32-bit using recursion (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:39 +01:00
Iago Toral Quiroga	fda3f6d424	compiler/spirv: implement 16-bit frexp Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:35 +01:00
Iago Toral Quiroga	7d3c34197a	compiler/spirv: implement 16-bit hyperbolic trigonometric functions v2: - use nir_fadd_imm and nir_fmul_imm helpers (Jason) v3: - since we need to define one for fsub use it for fdiv too (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	88663ba67c	compiler/spirv: implement 16-bit exp and log v2 - use nir_fmul_imm helper (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	f18554e2ce	compiler/spirv: implement 16-bit atan2 v2: - fix huge_val for 16-bit, it was mean't to be 2^14 not 10^14. v3: - rebase on top of new bool sized opcodes - use nir_b2f helper - use nir_fmul_imm helper Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	1c8de08ec9	compiler/spirv: implement 16-bit atan v2: - use nir_fadd_imm and nir_fmul_imm helpers (Jason) - rebased on top of new sized boolean opcodes - use nir_b2f helper Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	df118535ca	compiler/spirv: implement 16-bit acos Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	dbbbe24d76	compiler/spirv: implement 16-bit asin v2: - use nir_fmul_imm and nir_fadd_imm helpers (Jason) v3: - missed one case where we need to replace nir_imm_float with nir_imm_floatN_t (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	95b7c29c2c	compiler/spirv: handle 16-bit float in radians() and degrees() v2: - use nir_imm_fmul helper (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	aeee683780	compiler/nir: add nir_fadd_imm() and nir_fmul_imm() helpers Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Iago Toral Quiroga	5fc9ad1cb0	compiler/nir: add a nir_b2f() helper Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-01-02 07:54:05 +01:00
Timothy Arceri	70be9afccb	nir: link time opt duplicate varyings If we are outputting the same value to more than one output component rewrite the inputs to read from a single component. This will allow the duplicate varying components to be optimised away by the existing opts. shader-db results i965 (SKL): total instructions in shared programs: 12869230 -> 12860886 (-0.06%) instructions in affected programs: 322601 -> 314257 (-2.59%) helped: 3080 HURT: 8 total cycles in shared programs: 317792574 -> 317730593 (-0.02%) cycles in affected programs: 2584925 -> 2522944 (-2.40%) helped: 2975 HURT: 477 shader-db results radeonsi (VEGA): SGPRS: 31576 -> 31664 (0.28 %) VGPRS: 17484 -> 17064 (-2.40 %) Spilled SGPRs: 184 -> 167 (-9.24 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 583340 -> 569368 (-2.40 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 6162 -> 6270 (1.75 %) Wait states: 0 -> 0 (0.00 %) vkpipeline-db results RADV (VEGA): Totals from affected shaders: SGPRS: 14880 -> 15080 (1.34 %) VGPRS: 10872 -> 10888 (0.15 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 674016 -> 668396 (-0.83 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 2708 -> 2704 (-0.15 %) Wait states: 0 -> 0 (0.00 % V2: bunch of tidy ups suggested by Jason Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-02 12:19:17 +11:00
Timothy Arceri	d828694b80	nir: rework nir_link_opt_varyings() This just cleans things up a little and make things more safe for derefs. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-02 12:19:17 +11:00
Timothy Arceri	c0aba8b0dc	nir: add can_replace_varying() helper This will be reused by the following patch. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-02 12:19:17 +11:00
Timothy Arceri	50de3f80a8	nir: rename nir_link_constant_varyings() nir_link_opt_varyings() The following patches will add support for an additional optimisation so this function will no longer just optimise varying constants. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-02 12:19:17 +11:00
Timothy Arceri	0a4378ce56	st/glsl_to_nir: call nir_lower_load_const_to_scalar() in the st This will help the new opt introduced in the following patches allowing us to remove extra duplicate varyings. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-01-02 12:19:17 +11:00
Timothy Arceri	2ef0f944f5	radeonsi: make use of ac_are_tessfactors_def_in_all_invocs() Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-02 10:01:31 +11:00
Timothy Arceri	2832bc972b	ac/nir_to_llvm: add ac_are_tessfactors_def_in_all_invocs() The following patch will use this with the radeonsi NIR backend but I've added it to ac so we can use it with RADV in future. This is a NIR implementation of the tgsi function tgsi_scan_tess_ctrl(). Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-02 10:01:24 +11:00
Timothy Arceri	2817a4ec0b	radeonsi: remove unrequired param in si_nir_scan_tess_ctrl() Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-02 10:01:15 +11:00
Timothy Arceri	4dda445750	tgsi/scan: correctly walk instructions in tgsi_scan_tess_ctrl() The previous code used a do while loop and continues after walking a nested loop/if-statement. This means we end up evaluating the last instruction from the nested block against the while condition and potentially exit early if it matches the exit condition of the outer block. Fixes: `386d165d8d` ("tgsi/scan: add a new pass that analyzes tess factor writes") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-02 09:53:01 +11:00
Timothy Arceri	dd061eb044	tgsi/scan: fix loop exit point in tgsi_scan_tess_ctrl() This just happened not to crash/assert because all loops have at least 1 if-statement and due to a second bug we end up matching the same ENDIF to exit both the iteration over the if-statment and the loop. The second bug is fixed in the following patch. Fixes: `386d165d8d` ("tgsi/scan: add a new pass that analyzes tess factor writes") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-01-02 09:53:01 +11:00
Ilia Mirkin	8f98ff362c	nv30: disable rendering to 3D textures There's no way to tell the 3D engine about swizzling on such textures. While rendering to NPOT ones may be possible, there's no great way to expose that in gallium, nor would there be any practical benefit. Fixes the non-compressed-format "copyteximage 3D" failures. Something odd going on with the compressed formats. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-01-01 15:11:14 -05:00
Bas Nieuwenhuizen	8c93ef5de9	radv: Do a cache flush if needed before reading predicates. This caused random failures for two conditional rendering tests: dEQP-VK.conditional_rendering.draw_clear.draw.update_with_rendering_discard dEQP-VK.conditional_rendering.draw_clear.draw.update_with_rendering_no_discard These wrote the predicate with the vertex shader, did a barrier and then started the conditional rendering. However the cache flushes for the barrier only happen on first draw, so after the predicate has been read. Fixes: `e45ba51ea4` "radv: add support for VK_EXT_conditional_rendering" Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-12-31 20:52:08 +01:00
Erik Faye-Lund	86089a7316	anv/autotools: make sure tests link with -msse2 Without this, I get the following error when building the tests with autotools on i686: ---8<--- src/intel/common/gen_clflush.h: In function ‘gen_clflush_range’: src/intel/common/gen_clflush.h:37:7: warning: implicit declaration of function ‘__builtin_ia32_clflush’; did you mean ‘__builtin_ia32_pause’? [-Wimplicit-function-declaration] __builtin_ia32_clflush(p); ^~~~~~~~~~~~~~~~~~~~~~ __builtin_ia32_pause src/intel/common/gen_clflush.h: In function ‘gen_flush_range’: src/intel/common/gen_clflush.h:45:4: warning: implicit declaration of function ‘__builtin_ia32_mfence’; did you mean ‘__builtin_ia32_fnclex’? [-Wimplicit-function-declaration] __builtin_ia32_mfence(); ^~~~~~~~~~~~~~~~~~~~~ __builtin_ia32_fnclex ---8<--- The erros are generated for each of these files: - mesa/src/intel/vulkan/tests/state_pool_no_free.c - mesa/src/intel/vulkan/tests/state_pool.c - mesa/src/intel/vulkan/tests/block_pool_no_free.c - mesa/src/intel/vulkan/tests/state_pool_free_list_only.c This is obviously because gen_clflush.h contains code that uses intrinsics that are only available with SSE3. Since the driver already uses SSE3, it seems reasonable to add this to the tests as well. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Eric Engeström <eric@engestrom.ch>	2018-12-31 17:28:21 +01:00
Erik Faye-Lund	89679e18a9	anv/meson: make sure tests link with -msse2 Without this, I get the following error when building the tests using meson on i686: ---8<--- In file included from ../../../mesa/src/intel/vulkan/anv_private.h:46, from ../../../mesa/src/intel/vulkan/tests/state_pool_no_free.c:26: ../../../mesa/src/intel/common/gen_clflush.h: In function ‘gen_clflush_range’: ../../../mesa/src/intel/common/gen_clflush.h:37:7: error: implicit declaration of function ‘__builtin_ia32_clflush’; did you mean ‘__builtin_ia32_pause’? [-Werror=implicit-function-declaration] __builtin_ia32_clflush(p); ^~~~~~~~~~~~~~~~~~~~~~ __builtin_ia32_pause ../../../mesa/src/intel/common/gen_clflush.h: In function ‘gen_flush_range’: ../../../mesa/src/intel/common/gen_clflush.h:45:4: error: implicit declaration of function ‘__builtin_ia32_mfence’; did you mean ‘__builtin_ia32_fnclex’? [-Werror=implicit-function-declaration] __builtin_ia32_mfence(); ^~~~~~~~~~~~~~~~~~~~~ __builtin_ia32_fnclex ---8<--- The errors are generated for each of these files: - mesa/src/intel/vulkan/tests/state_pool_no_free.c - mesa/src/intel/vulkan/tests/state_pool.c - mesa/src/intel/vulkan/tests/block_pool_no_free.c - mesa/src/intel/vulkan/tests/state_pool_free_list_only.c This is obviously because gen_clflush.h contains code that uses intrinsics that are only available with SSE3. Since the driver already uses SSE3, it seems reasonable to add this to the tests as well. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Eric Engeström <eric@engestrom.ch>	2018-12-31 17:27:33 +01:00
Ilia Mirkin	207fb558e4	nv30: fix some s3tc layout issues s3tc layouts are a bit finicky - they're packed, but not swizzled. Adjust logic to allow for that case: - Don't set a uniform pitch for POT-sized compressed textures - Adjust define_rect API to be less confused about block sizes - Only mark a texture as linear if it has a uniform pitch set This has been tested to fix xonotic (as well as the s3tc-* piglits) on nv3x and keeps it working on nv4x. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-30 23:32:21 -05:00
Ilia Mirkin	ad251330e8	nv30: use correct helper to get blocks in y direction This doesn't matter since all compressed formats supported by this hardware use square blocks, but best to use the correct helper. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-30 23:32:21 -05:00
Ilia Mirkin	b04c1907c8	nv30: add support for multi-layer transfers This logic mirrors what we do on nv50. The relatively new texture_subdata callback can cause this to happen with 3D textures, which is triggered at least by xonotic, and probably many piglits. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-30 23:32:21 -05:00
Ilia Mirkin	b34cfd4749	nv30: fix rare issue with fp unbinding not finding the bufctx If the last-active context gets deleted, the pushbuf doesn't have a bufctx to reference. Then there could be a sequence of binds which would trigger a reset on that bin before validation was done. Instead we just pass in the bufctx in question directly. All other instances of PUSH_RESET happen strictly after a validation is run. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102349 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-30 19:44:43 -05:00
Ilia Mirkin	ef3eac9545	nv30: avoid setting user_priv without setting cur_ctx The whole user_priv thing is a mess, but as long as it's there, it basically has to map 1:1 to the cur_ctx. Unfortunately we were setting user_priv to some context, then that context could get deleted without any draws/validations in it, leading user_priv to become NULL, with cur_ctx still pointing at some old context. Then we wouldn't run the switch logic, which in turn led to a NULL bufctx being dereferenced. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102349 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-30 19:44:43 -05:00
Eric Anholt	ad1e59cf8d	v3d: Add support for gl_HelperInvocation. We can just look at the MSF flags -- if they're unset, then we're definitely in a helper invocation. Fixes dEQP-GLES31.functional.shaders.helper_invocation.* with GLES3.1 enabled.	2018-12-30 08:05:11 -08:00
Eric Anholt	20021e3473	v3d: Add support for textureSize() on MSAA textures. Fixes failures in dEQP-GLES31.functional.shaders.builtin_functions.texture_size.samples_1_texture_2d in the GLES3.1 suite.	2018-12-30 08:05:11 -08:00
Eric Anholt	f695d62fe5	v3d: Add support for requesting the sample offsets.	2018-12-30 08:05:11 -08:00
Eric Anholt	906fca1b4b	v3d: Add support for non-constant texture offsets. Fixes dEQP-GLES31.functional.texture.gather.offset_dynamic.min_required_offset.2d.rgba8.size_pot.clamp_to_edge_repeat and others.	2018-12-30 08:05:11 -08:00
Eric Anholt	47caefc7b4	v3d: Force sampling from base level for tg4. This is what the GLSL ES 310 spec tells us to do, but apparently the "gather mode" flag doesn't imply it in the HW. Fixes dEQP-GLES31.functional.texture.gather.basic.2d.rgba8.filter_mode.min_nearest_mipmap_linear_mag_linear	2018-12-30 08:05:11 -08:00
Eric Anholt	f9bdce9966	v3d: Add a note for a potential performance win on multop/umul24. Noticed while debugging a testcase.	2018-12-30 08:05:11 -08:00
Eric Anholt	b36757448d	v3d: Dead-code eliminate unused flags updates. The greedy comparison folding in bcsel means that we may have left the original bool-generating NIR ALU instruction dead, but DCE wasn't eliminating the VIR code for it because of the flags updates. total instructions in shared programs: 5186024 -> 5100894 (-1.64%) instructions in affected programs: 1448695 -> 1363565 (-5.88%)	2018-12-30 08:05:11 -08:00
Eric Anholt	20e3526298	v3d: Don't generate temps for comparisons. This was just generated work for vir_opt_dead_code and cluttered up the dumps.	2018-12-30 08:04:54 -08:00
Eric Anholt	ebde5afb93	v3d: Move "does this instruction have flags" from sched to generic helpers. I wanted to reuse it for DCE of flags updates.	2018-12-30 08:03:51 -08:00
Eric Anholt	39b1112189	v3d: Drop incorrect dependency for flpop. It is just shifting probably-means-flags bits out of a value, it doesn't actually update the flags on its own.	2018-12-30 08:03:51 -08:00
Eric Anholt	a7c9fd7573	v3d: Drop unused count_nir_instrs() helper. This was for shader-db, but I haven't cared about NIR instruction counts in a long time.	2018-12-30 08:03:51 -08:00
Eric Anholt	696f63f1b4	v3d: Hook up some shader-db output to GL_ARB_debug_output. This allows the original shader-db project's run.c runner to parse things easily, and is probably a good thing to have for GL_ARB_debug_output in general. I formatted it more like Intel's so I can mostly reuse their report script.	2018-12-30 08:03:51 -08:00
Eric Anholt	87b251a940	v3d: Add a "precompile" debug flag for shader-db. I've been using my apitrace-based shader-db so far, but it's slow (apitrace decompression), intrusive (apitrace windows spamming the screen), and doesn't have much coverage. The original shader-db provides a lot more coverage and compiles faster, at the expense of not having the actual runtime variant key. As v3d has a lot less runtime variation than vc4 did, this tradeoff makes more sense.	2018-12-29 13:52:09 -08:00
Eric Anholt	9ec6a3d621	v3d: Fix uniform pretty printing assertion failure with branches. Fixes: `248a7fb392` ("v3d: Do uniform pretty-printing in the QPU dump.")	2018-12-29 13:52:09 -08:00
Dylan Baker	133a5b8383	meson: Override C++ standard to gnu++11 when building with altivec on ppc64 Otherwise there will be symbol collisions for the vector name. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108943 Distro Bug: https://bugs.gentoo.org/673622 Fixes: `42ea0631f1` ("meson: build clover") Acked-by: Matt Turner <mattst88@gmail.com>	2018-12-28 11:04:57 -08:00
Lionel Landwerlin	f7bccf6ab4	intel/aub_viewer: highlight true booleans Useful to spot PIPE_CONTROL flags. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-12-28 16:48:46 +00:00
Lionel Landwerlin	6ba61ea391	intel/aub_viewer: fold binding/sampler table items Makes things easier to read rather than a long block of text. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-12-28 16:48:43 +00:00
Lionel Landwerlin	7ab8c80625	intel/aub_viewer: fix shader view Not decoding the shader at the right offset. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-12-28 16:48:40 +00:00
Lionel Landwerlin	f3ed4a058d	intel/aub_viewer: print address of missing shader Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-12-28 16:48:21 +00:00
Lionel Landwerlin	0382e11989	intel/aub_viewer: fixup 0x address prefix Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-12-28 16:48:18 +00:00
Lionel Landwerlin	8e2fda411a	intel/aub_viewer: fix shader get_bo Instruction addresses are always in ppgtt space. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-12-28 16:48:08 +00:00
Nicholas Kazlauskas	e260493f2a	radeonsi: Enable adaptive_sync by default for radeon It's better to let most applications make use of adaptive sync by default. Problematic applications can be placed on the blacklist or the user can manually disable the feature. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>	2018-12-28 17:08:14 +01:00
Nicholas Kazlauskas	2e12fe425f	loader/dri3: Enable adaptive_sync via _VARIABLE_REFRESH property The DDX driver can be notified of adaptive sync suitability by flagging the application's window with the _VARIABLE_REFRESH property. This property is set on the first swap the application performs when adaptive_sync is set to true in the drirc. It's performed here instead of when the loader is initialized for two reasons: (1) The window's drawable can be missing during loader init. This can be observed during the Unigine Superposition benchmark. (2) Adaptive sync will only be enabled closer to when the application actually begins rendering. If adaptive_sync is false then the _VARIABLE_REFRESH property is deleted on loader init. The property is only managed on the glx DRI3 backend for now. This should cover most common applications and games on modern hardware. Vulkan support can be implemented in a similar manner but would likely require splitting the function out into a common helper function. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>	2018-12-28 16:44:47 +01:00
Nicholas Kazlauskas	a9c36dbf9c	drirc: Initial blacklist for adaptive sync Applications that don't present at a predictable rate (ie. not games) shouldn't have adapative sync enabled. This list covers some of the common desktop compositors, web browsers and video players. [ Michel Dänzer: Added entry for firefox-esr ] Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>	2018-12-28 16:44:27 +01:00
Nicholas Kazlauskas	7407670036	util: Add adaptive_sync driconf option This option lets the user decide whether mesa should notify the window manager / DDX driver that the current application is adaptive sync capable. It's off by default. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>	2018-12-28 16:38:06 +01:00
Nicholas Kazlauskas	759b940389	util: Get program name based on path when possible Some programs start with the path and command line arguments in argv[0] (program_invocation_name). Chromium is an example of an application using mesa that does this. This tries to query the real path for the symbolic link /proc/self/exe to find the program name instead. It only uses the realpath if it was a prefix of the invocation to avoid breaking wine programs. Cc: Timothy Arceri <tarceri@itsqueeze.com> Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-28 15:41:01 +01:00
Tomeu Vizoso	bf1dfcc3e8	etnaviv: Consolidate buffer references from framebuffers We were leaking surfaces because the references taken in etna_set_framebuffer_state weren't being released on context destroy. Instead of just directly releasing those references in etna_context_destroy, use the util_copy_framebuffer_state helper. Take the chance to remove the duplicated buffer references in compiled_framebuffer_state to avoid confusion. The leak can be reproduced with a client that continuously creates and destroys contexts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reported-by: Sjoerd Simons <sjoerd.simons@collabora.co.uk> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2018-12-28 10:22:01 +01:00
Dave Airlie	d1ce7eba8b	virgl/vtest: fix front buffer flush with protocol version 0. Older versions of virglrenderer before 33da7361aec486290df0aec4ad8dfa8ff6adde2c in vtest mode, misrender gears. Fixes: `9d81cd8e7c` (virgl: Pass resource size and transfer offsets) Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2018-12-28 16:50:38 +10:00
Dylan Baker	6adbd9ac74	docs/autoconf: Mark autoconf as being replaced I know it's not what anyone wants, but how about we start with a message in the documentation that encourages people to try meson. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Eric Engeström <eric@engestrom.ch>	2018-12-27 09:03:20 -08:00
Dylan Baker	4c32964f49	docs/install: Update python dependency section Note that meson requires python 3, scons requires python 2, and autotools works with either. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Eric Engeström <eric@engestrom.ch>	2018-12-27 09:03:20 -08:00
Dylan Baker	a57dbe6971	docs/meson: Update LLVM section with information about native files Reviewed-by: Eric Engeström <eric@engestrom.ch>	2018-12-27 09:03:17 -08:00
Dylan Baker	40ec5fec0a	docs/install: Add meson to the main install page Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Eric Engeström <eric@engestrom.ch>	2018-12-27 09:03:07 -08:00
Juan A. Suarez Romero	fe7919acad	docs: update calendar, add news item and link release notes for 18.2.8 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-12-27 17:37:33 +01:00
Juan A. Suarez Romero	0d53451890	docs: add sha256 checksums for 18.2.8 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `24c31bc0e2`)	2018-12-27 17:35:04 +01:00
Juan A. Suarez Romero	008478e340	docs: add release notes for 18.2.8 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `785e09e3b3`)	2018-12-27 17:35:02 +01:00
Ilia Mirkin	2269ab8588	nv50,nvc0: add missing CAPs for unsupported features Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-26 20:28:07 -05:00
Ilia Mirkin	1d10bb2025	nvc0: enable GL_NV_shader_atomic_float on pre-Maxwell Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-26 20:04:57 -05:00
Ilia Mirkin	0dd55db10f	nv50/ir: add support for converting ATOMFADD to proper ir Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-26 20:04:57 -05:00
Ilia Mirkin	9867f2a1f7	st/mesa: expose GL_NV_shader_atomic_float when ATOMFADD is supported Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-26 20:04:57 -05:00
Ilia Mirkin	4d5a6a1649	st/mesa: select ATOMFADD when source type is float Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-26 20:04:57 -05:00
Ilia Mirkin	d139231b32	gallium: add PIPE_CAP_TGSI_ATOMFADD to indicate support ATOMFADD is a little special -- make drivers have to specify it explicitly. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-26 20:04:57 -05:00
Ilia Mirkin	5574414edc	tgsi: add ATOMFADD operation This is supported by at least NVIDIA hardware, and exposeable via GL extensions. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-26 20:04:57 -05:00
Ilia Mirkin	bac8534267	st/mesa: allow glDrawElements to work with GL_SELECT feedback Not sure if this ever worked, but the current logic for setting the min/max index is definitely wrong for indexed draws. While we're at it, bring in all the usual logic from the non-indirect drawing path. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109086 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-12-26 19:30:33 -05:00
Eric Anholt	7d7ecfbcbc	gallium/ttn: Fix setup of outputs_written. We need a 64-bit value, otherwise we only handle the low 32, and happen to sign-extend to claim to write all varying slots if VARYING_SLOT_VAR2 was used. Fixes: `4d0b2c7aaa` ("ttn: Update shader->info as we generate code.") Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-12-26 11:42:09 -08:00
Lionel Landwerlin	e2ae5f2f0a	anv: don't do partial resolve on layer > 0 We've made the choice not to use fast clears on layer > 0 with multilayer images. This is partly because we would need to store multiple clear colors for each layer, making the existing memory layout, already including aux surfaces, fast clear color, image state, etc... even more complex. Partial resolves are the operations transfering the clear colors into the auxiliary buffers. This operation is currently implemented in Blorp by loading the clear color from the image's BO, into a shader that then samples from the auxiliary buffer and writes the color only if it isn't there already. The problem here is that because we store only one clear color for all layers and it is used for partial resolves. If you trigger a partial clear on a layer > 0, then you're likely to deal with a color that is not what you actually want. In the particular issues below, we have multiple layers, each cleared with a different color but the partial resolve just writes the wrong color into the auxiliary buffers for layers > 0. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108910 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108911 Cc: mesa-stable@lists.freedesktop.org	2018-12-24 09:42:46 +00:00
Axel Davy	c6b37e5412	st/nine: Increase the limit of cached ff shaders 100 is too small for some games, which triggers recompilations every frame. Increase to 1024. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-12-23 08:14:50 +01:00
Axel Davy	104681c5d5	st/nine: Add src reference to nine_context_range_upload Just like nine_context_box_upload, nine_context_range_upload should reference the src, which holds the ram source buffer. Fixes: https://github.com/iXit/Mesa-3D/issues/327 Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Cc: mesa-stable@lists.freedesktop.org	2018-12-23 08:14:50 +01:00
Axel Davy	42d672fa6a	st/nine: Bind src not dst in nine_context_box_upload nine_context_box_upload uploads a ram buffer (from src) to a pipe_resource (dst). We already have a refcount on the pipe_resource, what needs to be protected from release is the ram buffer, thus a reference to src. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Cc: mesa-stable@lists.freedesktop.org	2018-12-23 08:14:50 +01:00
Axel Davy	f91f748fab	st/nine: Fix volumetexture dtor on ctor failure The dtor is called on allocation failure, thus we must check the volumes are allocated before trying to release them. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Cc: mesa-stable@lists.freedesktop.org	2018-12-23 08:14:50 +01:00
Axel Davy	1cc8192ad0	st/nine: Switch to presentation buffer if resize is detected This enables to match the window size on resize on all cases, as it only works currently with presentation buffers. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-12-23 08:14:50 +01:00
Axel Davy	c442dd7890	st/nine: Use helper to release swapchain buffers later This patch introduces a structure to release the present_handles only when they are fully released by the server, thus making "DestroyD3DWindowBuffer" actually release the buffer right away when called. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-12-23 08:14:50 +01:00
Rob Clark	51a44c3aac	freedreno/a6xx: fix 3d texture layout Maybe not 100% perfect, but seems to be a pretty good approximation of that. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:29:15 -05:00
Rob Clark	8f60f1381d	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:28:50 -05:00
Rob Clark	be9ec158d7	freedreno/a6xx: improve setup_slices() debug msgs Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:28:24 -05:00
Rob Clark	2b497fc507	freedreno/a6xx: simplify special case for 3d layout This logic can be re-written as the two cases for 3d (ie. before/after the miplevel sizes start reducing) vs everything else. I think it is easier to read this way. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:27:57 -05:00
Rob Clark	d71a50f831	freedreno: combine fd_resource_layer_offset()/fd_resource_offset() We really only need this logic in one place. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:27:37 -05:00
Rob Clark	6667dde098	freedreno/ir3: don't treat all inputs/outputs as vec4 This was a hold-over from the early TGSI days, and mostly not needed with NIR. This avoids burning an entire 4 consecutive scalar regs for vec3 outputs, for example. Which fixes a few places that we were doing worse that we should on register usage. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:27:21 -05:00
Rob Clark	3453814622	freedreno/ir3: fix fallout of extra assert Fixes the following crash that happened after `d6110d4d` The problem happens if we first compile a "vanilla" shader with nothing lowered in NIR, which perform the final lowering passes on so->shader-> nir (including nir_lower_locals_to_regs()), and then later we have compile a shader with some lowering. The second time through we would have already done nir_lower_locals_to_regs(). Arguably this was already a bug, just one we hadn't noticed yet. Fixes: `d6110d4d54` intel/compiler: move nir_lower_bool_to_int32 before nir_lower_locals_to_regs Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-21 19:04:22 -05:00
Kenneth Graunke	626f2477ab	st/nir: Drop unused gl_program parameter in VS input handling helper. Nobody uses this, so let's drop it. This makes the helper callable from places without a gl_program. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:32 -08:00
Kenneth Graunke	3a78b46e59	st/nir: Gather info after applying lowering FS variant features DrawPixels lowering, for example, adds new varyings that need to be accounted for in inputs_read. The earlier info gathering at link time cannot account for this. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:30 -08:00
Kenneth Graunke	bcb6f19947	st/mesa: Combine the DrawPixels and Bitmap passthrough VS programs. They're now identical, so we can just compile it once. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:29 -08:00
Kenneth Graunke	80dd9dfe33	st/mesa: Don't open code the drawpixels vertex shader. Now that we always copy color, we can just use the util function. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:28 -08:00
Kenneth Graunke	ed1a356c5e	st/mesa: Drop !passColor optimization in drawpixels shaders. The glDrawPixels passthrough vertex shader copies position and texcoord vertex attributes to varying outputs. It also optionally copies a third gl_Color attribute, which sometimes is unnecessary. Until now, we've compiled separate variants of the shader, one of which does this extra copy, and the other of which doesn't. We have done this since 2007. But, the vertex shader runs for a whopping four vertices, and so the cost of a copying a single input to output is likely inconsequential. In theory, we could bind one fewer vertex element - but we always bind all three regardless. So, we don't even get that savings. This patch unifies the two, so we always copy the optional color, and save having to compile the variant. It also makes the VS input interface match up with the vertex element state without any dead (unused) input attributes. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:25 -08:00
Kenneth Graunke	42d31e0516	st/mesa: Drop dead 'passthrough_fs' field. Dead since 2015 (commit `5142564734`). Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:20 -08:00
Bas Nieuwenhuizen	bba5749484	radv: Fix wrongly positioned paren. Trivial. Fixes: `9f0bfbed11` "radv: Work around non-renderable 128bpp compressed 3d textures on GFX9."	2018-12-21 21:06:55 +01:00
Dylan Baker	1e872d1486	docs: add note about using backticks for rbs in gitlab So that gitlab will render the < and > correctly allowing the tag to be copy-n-pasted without additional formatting. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-12-21 17:43:56 +00:00
Alex Deucher	516160d717	pci_ids: add new VegaM pci id Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Cc: mesa-stable@lists.freedesktop.org	2018-12-21 11:51:34 -05:00
Roland Scheidegger	171983dc89	gallivm: abort when trying to use non-existing intrinsic Whenever llvm removes an intrinsic (we're using), we're hitting segfaults due to llvm doing calls to address 0 in the jitted code instead. However, Jose figured out we can actually detect this with LLVMGetIntrinsicID(), so use this to abort, so we don't have to wonder what got broken. (Of course, someone still needs to fix the code to no longer use this intrinsic.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-12-21 17:37:00 +01:00
Roland Scheidegger	f3b1acff48	gallivm: don't use pavg.b intrinsic on llvm >= 6.0 This intrinsic disppeared with llvm 6.0, using it ends up in segfaults (due to llvm issuing call to NULL address in the jited shaders). Add code doing the same thing as the autoupgrade code in llvm so it can be matched and replaced back with a pavgb. While here, also improve lp_test_format, so it tests both with and without cache (as it was, it tested the cache versions only, whereas cache is actually disabled in llvmpipe, and in any case even with it enabled vertex and geometry shaders wouldn't use it). (Although at least for the unorm8 uncached fetch, the code is still quite different to what llvmpipe is using, since that would use unorm8x16 type, whereas the test code is using unorm8x4 type, hence disabling some intrinsic paths.) Fixes: `6f4083143b` ("gallivm: use llvm jit code for decoding s3tc") Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com>	2018-12-21 17:35:05 +01:00
Emil Velikov	a8d020c3dc	travis: meson: port gallium build combinations over This commit adds a number of build combinations: - Gallium Drivers {SWR, RadeonSI, Others) Each one has different LLVM requirements. Building SWR alone is twice as slow as all other drivers combined. - Gallium ST Clover LLVM {5,6,7} Because C++ API changes all the time. Analogous to above building Clover takes as much time as building all other ST combined. - Gallium ST Others Nouveau is used, instead of i915g since meson has explicit target tracking. Meaning that a configure error is thrown if we use i915g with say va, vdpau or others. Note: LLVM prior to 5.0 is intentionally dropped. If needed we can add that later. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-13 01:34:59 +00:00
Emil Velikov	39634f2f35	travis: meson: add explicit handling to gallium ST Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 13:52:20 +00:00
Emil Velikov	51318c32fe	travis: meson: explicitly control the DRI loaders Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 13:42:36 +00:00
Emil Velikov	e890aaabed	travis: meson: add unwind handling Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-12 13:33:14 +00:00
Emil Velikov	266ae2225e	travis: meson: use FOO_DRIVERS directly It makes for a shorter MESON_OPTIONS and cleaner handling. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 13:18:54 +00:00
Dylan Baker	31c162ad22	travis: meson: enable unit tests v2: [Emil] pass the argument directly to meson Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (v1) Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-11 10:34:51 -08:00
Dylan Baker	116f0fb216	travis: Don't try to read libdrm out of configure.ac Since we're going to delete it shortly Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-11 11:09:21 -08:00
Dylan Baker	ecf96413bb	travis: meson: use native files to override llvm-config This is the supported way to do this, and should be more robust and reliable. v2: [Emil] - enable backslash escapes - don't hardcode the path - pass the argument directly to meson Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (v1) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-11 10:40:25 -08:00
Emil Velikov	81173fd69f	travis: printout llvm-config --version Provides quick and easy feedback. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-13 10:38:20 +00:00
Emil Velikov	de72c1fe6c	travis: meson: print the configured state Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 17:43:07 +00:00
Emil Velikov	7c38d7b7c8	travis: flip to distro xenial, drop sudo false The latter is the default these days and Travis will be removing sudo soonish. Flipping to xenial, allows us to remove a bunch of hacks we have. Plus it prevents us from adding new ones, to workaround what seems like a gcc/binutils bug. For example (from the upcoming meson build): FAILED: ccache c++ -o src/gallium/targets/pipe-loader/pipe_r600.so ... ... src/util/libmesa_util.a ... /usr/lib/x86_64-linux-gnu/libz.so ... src/util/libmesa_util.a(disk_cache.c.o): In function `deflate_and_write_to_disk': _build/../src/util/disk_cache.c:746: undefined reference to `deflateInit_' _build/../src/util/disk_cache.c:765: undefined reference to `deflate' ... As we can see, even though libz.so is explicitly passed after the object that requires it - the linker still fails to see the symbols. Avoid all those situations - flip the switch. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-13 11:20:41 +00:00
Emil Velikov	12187550f9	configure: add CXX11_CXXFLAGS to LLVM_CXXFLAGS Seemingly with LLVM7 and GCC 5.0, the former won't properly advertise -std=c++11 and the latter will choke. dd this temporary workaround, otherwise we'll get errors like: In file included from /usr/include/c++/5/type_traits:35:0, from /usr/lib/llvm-7/include/llvm/Support/type_traits.h:18, from /usr/lib/llvm-7/include/llvm/ADT/Optional.h:22, from /usr/lib/llvm-7/include/llvm/ADT/STLExtras.h:20, from /usr/lib/llvm-7/include/llvm/ADT/StringRef.h:13, from /usr/lib/llvm-7/include/llvm/Target/TargetMachine.h:17, from ../../../src/amd/common/ac_llvm_helper.cpp:36: /usr/include/c++/5/bits/c++0x_warning.h:32:2: error: #error This file requires compiler and library support for the ISO C++ 2011 standard. This support must be enabled with the -std=c++11 or -std=gnu++11 compiler options. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-13 11:56:40 +00:00
Emil Velikov	f331419f26	glx/test: meson: assorted include fixes Swap '..' with the symbolic inc_glx and add glproto as dependency. That will pull the correct include, effectively fixing the tests on macOS. Fixes: `a47c525f32` ("meson: build glx") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 19:24:14 +00:00
Emil Velikov	e139d7a8a3	glx: meson: wire up the dispatch-index-check test Accidentally dropped with earlier commit.! Fixes: `4ccb981673` ("meson: Use consistent style for tests") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 19:07:52 +00:00
Emil Velikov	b44875e2dc	glx: meson: drop includes from a link-only library When producing the final libGL.so/libGLX_mesa.so we only link the local static helper lib (libglx). Thus there's no reason for the includes. Fixes: `a47c525f32` ("meson: build glx") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 17:55:08 +00:00
Emil Velikov	9527f9ea26	TODO: glx: meson: build dri based glx tests, only with -Dglx=dri The library itself (libGL) is only built when -Dglx=dri, yet it's accompanying tests are build even with -Dglx=xlib. Adjust the guards, so we don't build the tests when they are not applicable v2: - Reword commit message (Dylan) - Drop build_by_default hunk (Dylan) Fixes: `a47c525f32` ("meson: build glx") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 17:47:36 +00:00
Emil Velikov	2eedb79e1a	pipe-loader: meson: reference correct library The library is called libgalliumvl_stub - note singular. Fixes: `42ea0631f1` ("meson: build clover") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-13 04:10:50 +00:00
Emil Velikov	9d10581897	meson: don't require glx/egl/gbm with gallium drivers The gallium drivers do not require a DRI loader. Drop the artificial and unnecessary restriction. Fixes: `af9d276134` ("meson: build libmesa_gallium") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-13 03:54:03 +00:00
Emil Velikov	e0dbfc9953	bin/get-pick-list.sh: warn when commit lists invalid sha We had cases where people would list old/invalid sha in the commit. Add a trivial checker to catch those and throw a warning. CC: Juan A. Suarez <jasuarez@igalia.com> CC: Dylan Baker <dylan@pnwbakers.com> CC: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Andres Gomez <agomez@igalia.com>	2018-12-21 14:39:52 +00:00
Emil Velikov	6b296f64af	bin/get-pick-list.sh: rework handing of sha nominations Currently our is_sha_nomination does: - folds any whitespace, attempting to extract sha-like information - checks that at least one of the shas has landed Split it in two and do sha-like validation first. This way, commits with mesa-stable and sha nominations will feature the fixes/revert/etc instead of stable (a) or will be omitted if not applicable for the respective branch (b). Misc examples from 18.3 (a) -[ stable ] `5bc509363b` glx: make xf86vidmode mandatory for direct rendering +[ fixes ] `5bc509363b` glx: make xf86vidmode mandatory for direct rendering (b) -[ stable ] `9a7b319903` anv/query: flush render target before copying results CC: Juan A. Suarez <jasuarez@igalia.com> CC: Dylan Baker <dylan@pnwbakers.com> CC: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Andres Gomez <agomez@igalia.com>	2018-12-21 14:39:34 +00:00
Eric Anholt	17218a0406	vc4: Hook up perf_debug() output to GL_ARB_debug_output as well. This is the right channel to report these things, so that end-users don't need to know each driver's custom debug options.	2018-12-20 11:31:25 -08:00
Rhys Kidd	acc481ad79	vc4: Wire up core pipe_debug_callback This lets the driver use pipe_debug_message() for GL_ARB_debug_output. Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-12-20 11:31:19 -08:00
Eric Anholt	ba36312fbd	v3d: Hook up perf_debug() output to GL_ARB_debug output as well. This is the right channel to report these things, so that end-users don't need to know each driver's custom debug options.	2018-12-20 11:31:19 -08:00
Rhys Kidd	d3991d2472	v3d: Wire up core pipe_debug_callback This lets the driver use pipe_debug_message() for GL_ARB_debug_output. Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-12-20 11:31:16 -08:00
Eric Anholt	d80761b8f3	v3d: Drop shadow comparison state from shader variant key. The shadow state is now in the sampler.	2018-12-20 11:29:30 -08:00
Eric Anholt	0e2758daad	v3d: Fix simulator mode on i915 render nodes. i915 render nodes refuse the dumb ioctls, so the simulator would crash on the original non-apitrace shader-db. Replace them with direct i915 calls if we detect that we're on one of their gem fds.	2018-12-20 11:29:30 -08:00
Dylan Baker	0ff7eed289	docs/meson: Recommend not using CFLAGS and friends Because of the many caveats involved, using -Dc_args instead of CFLAGS is recommended both by meson upstream and by us. v2: - Fix typo Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (v1) Reviewed-by: Eric Anholt <eric@anholt.net>	2018-12-20 11:16:40 -08:00
Samuel Pitoiset	9606310081	radv: enable shaderStorageImageMultisample feature on GFX8+ Untested on older chips. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:19 +01:00
Samuel Pitoiset	6b976024a8	radv: add support for FMASK expand Original patch by Dave Airlie. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:17 +01:00
Samuel Pitoiset	fa16da53d8	radv: initialize FMASK for images in fully expanded mode The value depends on the number of samples. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:15 +01:00
Samuel Pitoiset	65d82c84d2	ac/nir: restrict fmask lookup to image load intrinsics We don't ever want to do the fmask lookup on a atomic or store, the fmask should have been decompressed if the surface has been moved to IMAGE_LAYOUT. Original patch by Dave Airlie. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:11 +01:00
Samuel Pitoiset	f45e43e156	spirv: add support for SpvCapabilityStorageImageMultisample Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:09 +01:00
Samuel Pitoiset	5b1ec10e4c	radv: compute optimal VM alignment for imported buffers This fixes GPU hangs on GFX9 with dEQP-VK.memory.external_memory_host.bind_image_memory_and_render.with_zero_offset.* Copied from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 17:34:04 +01:00
Bas Nieuwenhuizen	9f0bfbed11	radv: Work around non-renderable 128bpp compressed 3d textures on GFX9. Exactly what title says, the new addrlib does not allow the above with certain dimensions that the CTS seems to hit. Work around it by not allowing the app to render to it via compat with other 128bpp formats and do not render to it ourselves during copies. Fixes: `776b911365` "amd/addrlib: update Mesa's copy of addrlib" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-20 15:07:20 +01:00
Samuel Pitoiset	5c7935f8fc	radv: fix subpass image transitions with multiviews The driver needs to decompress all image layers if a fast depth/color clear has been performed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 13:36:37 +01:00
Samuel Pitoiset	0a7e767e58	radv: drop the amdgpu-skip-threshold=1 workaround for LLVM 8 This workaround has been introduced by `135e4d434f` for fixing DXVK GPU hangs with many games. It is no longer needed since LLVM r345718. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 12:09:57 +01:00
Samuel Pitoiset	576040f2e5	ac/nir: remove the bitfield_extract workaround for LLVM 8 This workaround has been introduced by `3d41757788` and it is no longer needed since LLVM r346422. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-20 09:40:16 +01:00
Iago Toral Quiroga	d6110d4d54	intel/compiler: move nir_lower_bool_to_int32 before nir_lower_locals_to_regs The former expects to see SSA-only things, but the latter injects registers. The assertions in the lowering where not seeing this because they asserted on the bit_size values only, not on the is_ssa field, so add that assertion too. Fixes: `11dc130779` "nir: Add a bool to int32 lowering pass" CC: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-20 08:02:44 +01:00
Ilia Mirkin	1250383e36	st/mesa: remove sampler associated with buffer texture in pbo logic A long time ago, when this was first implemented, not having a sampler bound would cause problems on Fermi. I didn't work out the reasons, but the solution was simple -- just put the samplers back in. Since then, regular texturing paths appear to have lost their associated samplers which required a fuller investigation and fix in nouveau. Now that this is done, this code should no longer need a sampler state for fetching texels from a buffer texture. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-20 00:27:16 -05:00
Roland Scheidegger	6f4083143b	gallivm: use llvm jit code for decoding s3tc This is (much) faster than using the util fallback. (Note that there's two methods here, one would use a cache, similar to the existing code (although the cache was disabled), except the block decode is done with jit code, the other directly decodes the required pixels. For now don't use the cache (being direct-mapped is suboptimal, but it's difficult to come up with something better which doesn't have too much overhead.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-12-20 06:03:20 +01:00
Jason Ekstrand	ec1d5841fa	radv/query: Use 1-bit booleans in query shaders Fixes: `44227453ec` "nir: Switch to using 1-bit Booleans for almost..." Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Tested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-19 16:36:40 -06:00
Jason Ekstrand	6896c91c10	radv/query: Add a nir_test_flag helper This is little more than an iadd_imm right now but it will help in the next commit where we refactor things further. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Tested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-19 16:36:26 -06:00
Eduardo Lima Mitev	c2ebc38052	freedreno/ir3: Handle GL_NONE in get_num_components_for_glformat() An earlier patch that introduced the function failed to handle the case where an image format layout qualifier is not specified, which is allowed on desktop GL profiles. In these cases, nir_variable's image format is GL_NONE, and we don't need to print a debug message for those. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-12-19 22:49:05 +01:00
Eric Anholt	90818558f0	docs: Add an encouraging note about providing reviews and acks. Across several projects I've seen new contributors say "I wasn't sure if I should provide a review tag since I'm not really an expert in this area." Everyone I know already applies some implicit weighting to reviews from different people, so encourage participation. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-19 12:49:17 -08:00
Eric Anholt	463df0ffe2	docs: Add a note that MRs should still include any r-b or a-b tags. v2: Mention "Tested-by" too Reviewed-by: Dylan Baker <dylan@pnwbakers.com> (v1) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-19 12:48:13 -08:00
Eric Anholt	fcfb7f573c	v3d: Load and store aligned utiles all at once. This calls the expensive uif offset function once per utile, but it still gets us a 212.218% +/- 2.41216% (n=10) win on 1024x1024 glTexImage over calling it on each pixel.	2018-12-19 10:27:26 -08:00
Eric Anholt	7c56b7a6ea	v3d: Add a fallthrough path for utile load/store of 32 byte lines. Now that V3D has 8 byte per pixel formats exposed, we've got stride==32 utiles to load and store. Just handle them through the non-NEON paths for now.	2018-12-19 10:27:26 -08:00
Eric Anholt	f6a0f4f41e	vc4: Move the utile load/store functions to a header for reuse by v3d. These implementations of whole-utile load/stores would be the same for v3d, though the layouts of blocks of utiles has changed.	2018-12-19 10:27:26 -08:00
Eric Anholt	8ee752194c	v3d: Implement texture_subdata to reduce teximage upload copies. This lets us store the non-PBO glTexImage data directly into the tiled image without making an extra untiled memcpy for the gallium transfer. Improves 1024x1024 TexImage perf by ~19%, mostly from not thrashing around in the kernel mapping and unmapping the transfer's temporary area.	2018-12-19 10:27:26 -08:00
Eric Anholt	e09d8aecb4	v3d: Remove dead prototypes for load/store utile functions.	2018-12-19 10:27:26 -08:00
Eric Anholt	fcf881adda	v3d: Don't try to create shadow tiled temporaries for 1D textures. They're raster order anyway, so we'd assertion fail along with wasting bandwidth. Fixes: `6ad9e8690d` ("v3d: Add support for texturing from linear.")	2018-12-19 10:27:21 -08:00
Eric Anholt	b5adc744ba	v3d: Fix check for TFU job completion in the simulator. We're waiting for the jobs-completed count to increment (with wrapping), not to reach its starting state. This mostly ended up working out because the next v3d_hw_tick() for a submit CL would end up doing the TFU operation first, but it did fail when a blit was used for glReadPixels() at the end of a test. Fixes: `ee0549ff9a` ("v3d: Add the V3D TFU submit interface to the simulator.")	2018-12-19 10:26:04 -08:00
Eric Anholt	365728dc5d	v3d: Put the dst bo first in the list of BOs for TFU calls. In the UAPI, the first BO is the destination, and the one the kernel should do an exclusive reservation on. Currently we only do exclusive reservations, anyway. However, in the simulator path I was only copying back the "destination" BO (actually src in this case), and this caused regressions once I fixed the simulator to actually complete TFU before returning (since otherwise, the TFU op would happen at the start of the next CL submit and the draw would get the right contents). Fixes: `976ea90bdc` ("v3d: Add support for using the TFU to do some blits.")	2018-12-19 10:26:04 -08:00
Caio Marcelo de Oliveira Filho	947f7b452a	nir: properly find the entry to keep in copy_prop_vars When copy propagation handles a store/copy, it iterates the current copy entries to remove aliases, but keeps the "equal" entry (if exists) to be updated. The removal step may swap the entries around (to ensure there are no holes), invalidating previous iteration pointers. The bug was saving such pointer to use later. Change the code to first perform the removals and then find the remaining right entry. This was causing updates to be lost since they were being made to an entry that was not part of the current copies. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108624 Fixes: `b3c6146925` "nir: Copy propagation between blocks" Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-19 09:33:36 -08:00
Michel Dänzer	9d8395bf0e	winsys/amdgpu: Pull in LLVM CFLAGS Fixes build failure if the LLVM headers aren't in a standard include directory. Fixes: `ec22dd34c8` "radeonsi: move SI_FORCE_FAMILY functionality to winsys" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-12-19 17:54:18 +01:00
Caio Marcelo de Oliveira Filho	0ddc911f4d	nir: properly clear the entry sources in copy_prop_vars When updating a copy entry source value from a "non-SSA" (the data come from a copy instruction) to a "SSA" (the data or parts of it come from SSA values), it was possible to hold invalid data in ssa[0] depending on the writemask. Because the union, ssa[0] could contain a pointer to a nir_deref_instr left-over from previous non-SSA usage. Change code to clean up the array before use to avoid invalid data around. Fixes: `62332d139c` "nir: Add a local variable-based copy propagation pass" Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-19 08:35:48 -08:00
Eric Engestrom	0e4c7c3d5b	docs: format code blocks a bit nicely Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-19 16:32:30 +00:00
Eric Engestrom	b0319d0768	docs: add meson cross compilation instructions Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-19 16:31:51 +00:00
Gurchetan Singh	b45aa6290b	virgl: move resource creation / import / destruction to common code We can remove some duplicated code. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	1d3d311133	virgl: move resource metadata into base resource A resource is just a buffer with some metadata. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	db77573d7b	virgl: modify how we handle GL_MAP_FLUSH_EXPLICIT_BIT Previously, we ignored the the glUnmap(..) operation and flushed before we flush the cbuf. Now, let's just flush the data when we unmap. Neither method is optimal, for example: glMapBufferRange(.., 0, 100, GL_MAP_FLUSH_EXPLICIT_BIT) glFlushMappedBufferRange(.., 25, 30) glFlushMappedBufferRange(.., 65, 70) We'll end up flushing 25 --> 70. Maybe we can fix this later. v2: Add fixme comment in the code (Elie) Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	11939f6fa2	virgl: make virgl_buffers use resource helpers We can reuse the helpers we created. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	4e2c77cd51	virgl: make transfer code with PIPE_BUFFER targets util_format_get_blocksize returns 1 for R8 formats (all PIPE_BUFFERs are R8). Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	174f530008	virgl: consolidate transfer code We could allocate and destroy transfers in one place. v2: Keep l_stride around. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	13626b46f1	virgl: store layer_stride in metadata Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	2a44acc83b	virgl: move vrend_get_tex_image_offset to common code Will be reused. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	f749229a8e	virgl: move virgl_resource_layout to common code Will be reused. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	a63da9c062	virgl: move texture metadata to common code Will be reused. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	6e7d396ad3	virgl: remove unnessecary code With commit 89b479, we moved to tracking buffer cleanliness when binding. TEST=dEQP-GLES31.functional.image_load_store.buffer.load_store.r32ui Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Gurchetan Singh	6d13d1aadb	virgl: texture_transfer_pool --> transfer_pool It's used for all types of resources. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-12-19 13:29:16 +01:00
Nicolai Hähnle	d73a25f2c0	radeonsi: const-ify the si_query_ops Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:02:07 +01:00
Nicolai Hähnle	c85b0dea0a	radeonsi: split perfcounter queries from si_query_hw Remove a level of indirection to make the code more explicit -- should make it easier to follow what's going on. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:02:04 +01:00
Nicolai Hähnle	e0f0d3675d	radeonsi: factor si_query_buffer logic out of si_query_hw This is a move towards using composition instead of inheritance for different query types. This change weakens out-of-memory error reporting somewhat, though this should be acceptable since we didn't consistently report such errors in the first place. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:02:01 +01:00
Nicolai Hähnle	0fc6e573dd	radeonsi: move query suspend logic into the top-level si_query struct Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:59 +01:00
Nicolai Hähnle	e2b9329f17	radeonsi: move remaining perfcounter code into si_perfcounter.c Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:57 +01:00
Nicolai Hähnle	7dd289d9e4	radeonsi: track constant buffer bind history in si_pipe_set_constant_buffer Other callers of si_set_constant_buffer don't need it. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:54 +01:00
Nicolai Hähnle	829d417914	radeonsi: use si_set_rw_shader_buffer for setting streamout buffers Reduce the number of places that encode buffer descriptors. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:52 +01:00
Nicolai Hähnle	ce785f5ffd	radeonsi: add an si_set_rw_shader_buffer convenience function Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:50 +01:00
Nicolai Hähnle	556c4c42b7	radeonsi: avoid using hard-coded SI_NUM_RW_BUFFERS Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:48 +01:00
Nicolai Hähnle	1e49d72317	radeonsi: show the fixed function TCS in debug dumps This is rather important for merged VS/TCS as LSHS shaders... Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:45 +01:00
Nicolai Hähnle	6e67e79de4	radeonsi: const-ify si_set_tesseval_regs Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:42 +01:00
Nicolai Hähnle	5c841a1b1e	radeonsi: rename SI_RESOURCE_FLAG_FORCE_TILING to clarify its purpose Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:39 +01:00
Nicolai Hähnle	0d58dcc3cf	radeonsi: don't set RAW_WAIT for CP DMA clears There is never a read-after-write hazard because the command doesn't read. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:34 +01:00
Nicolai Hähnle	23af72af25	radeonsi/gfx9: use SET_UCONFIG_REG_INDEX packets when available Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:32 +01:00
Nicolai Hähnle	f18b2ac0db	radeonsi: add si_init_draw_functions and make some functions static Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:30 +01:00
Nicolai Hähnle	555cb668cc	radeonsi: extract declare_vs_blit_inputs Prepare for some later refactoring. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:27 +01:00
Nicolai Hähnle	ec22dd34c8	radeonsi: move SI_FORCE_FAMILY functionality to winsys This helps some debugging cases by initializing addrlib with slightly more appropriate settings. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:25 +01:00
Nicolai Hähnle	0ef263d62f	ac/surface: 3D and cube surfaces are never displayable Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:22 +01:00
Nicolai Hähnle	8efaffa893	amd/common: add i1 special case to ac_build_{inclusive,exclusive}_scan Allow for a unified but efficient treatment of adding a bitmask over a wave or an entire threadgroup. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:19 +01:00
Nicolai Hähnle	300876a9a7	amd/common: scan/reduce across waves of a workgroup Order-aware scan/reduce can trade-off LDS traffic for external atomics memory traffic in producer/consumer compute shaders. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:17 +01:00
Nicolai Hähnle	3963402fd3	amd/common: add ac_build_ifcc Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:15 +01:00
Nicolai Hähnle	3c77f26ccc	amd/common: whitespace fixes Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:12 +01:00
Nicolai Hähnle	76c5ad1995	amd/sid_tables: add additional python3 compatibility imports This happened to bite me while doing some experiments. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:09 +01:00
Nicolai Hähnle	6f0322b16a	r600: remove redundant semicolon Reviewed-By: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:00:49 +01:00
Nicolai Hähnle	7230cb8f2b	ddebug: always flush when requested, even when hang detection is disabled Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 11:59:18 +01:00
Nicolai Hähnle	539fdc49f1	ddebug: simplify watchdog loop and fix crash in the no-timeout case The following race condition could occur in the no-timeout case: API thread Gallium thread Watchdog ---------- -------------- -------- dd_before_draw u_threaded_context draw dd_after_draw add to dctx->records signal watchdog dump & destroy record execute draw dd_after_draw_async use-after-free! Alternatively, the same scenario would assert in a debug build when destroying the record because record->driver_finished has not signaled. Fix this and simplify the logic at the same time by - handing the record pointers off to the watchdog thread before each draw call and - waiting on the driver_finished fence in the watchdog thread Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 11:59:10 +01:00
Tapani Pälli	3627c9efff	anv/android: turn on VK_ANDROID_external_memory_android_hardware_buffer Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:42 +02:00
Tapani Pälli	3dc424a4f4	anv: ignore VkSamplerYcbcrConversion on non-yuv formats This fulfills a requirement for clients that want to utilize same code path for images with external formats (VK_FORMAT_UNDEFINED) and "regular" RGBA images where format is known. This is similar to how OES_EGL_image_external works. To support this, we allow color conversion samplers for non-YUV formats but skip setting up conversion when format does not have can_ycbcr flag set. v2: add comment and bundle can_ycbcr to the existing break condition (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	a7b7772cfb	anv: support VkSamplerYcbcrConversionInfo in vkCreateImageView If a conversion struct was passed, then initialize view using format from the conversion structure. v2: use vk_format directly from the anv_format struct v3: added some assertions (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	bb0721aea4	anv: add VkFormat field as part of anv_format Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	c070b0e25f	anv: support VkExternalFormatANDROID in vkCreateSamplerYcbcrConversion If external format is used, we store the external format identifier in conversion to be used later when creating VkImageView. v2: rebase to `b43f955037` changes v3: added assert, ignore components when creating external format conversion (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	f1654fa7e3	anv/android: support creating images from external format Since we don't know the exact format at creation time, some initialization is done only when bound with memory in vkBindImageMemory. v2: demand dedicated allocation in vkGetImageMemoryRequirements2 if image has external format v3: refactor prepare_ahw_image, support vkBindImageMemory2, calculate stride correctly for rgb(x) surfaces, rename as 'resolve_ahw_image' v4: rebase to `b43f955037` changes v5: add some assertions to verify input correctness (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	517103abf1	anv/android: add ahardwarebuffer external memory properties v2: have separate memory properties for android, set usage flags for buffers correctly v3: code cleanup (Jason) + limit maxArrayLayers to 1 for AHardwareBuffer based images v4: rebase to `b43f955037` changes Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	c79a528d2b	anv/android: support import/export of AHardwareBuffer objects v2: add support for non-image buffers (AHARDWAREBUFFER_FORMAT_BLOB) v3: properly handle usage bits when creating from image v4: refactor, code cleanup (Jason) v5: rebase to `b43f955037` changes, initialize bo flags as ANV_BO_EXTERNAL (Lionel) v6: add assert that anv_bo_cache_import succeeds, add comment about multi-bo support to clarify current implementation (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	5c65c60d6c	anv: refactor, remove else block in AllocateMemory This makes it cleaner to introduce more cases where we import memory from different types of external memory buffers. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	884fc90fde	anv: add anv_ahw_usage_from_vk_usage helper function v2: rebase to `b43f955037` changes Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	1e6a44400a	anv/android: add GetAndroidHardwareBufferPropertiesANDROID Use the anv_format address in formats table as implementation-defined external format identifier for now. When adding YUV format support this might need to change. v2: code cleanup (Jason) v3: set anv_format address as identifier v4: setup suggestedYcbcrModel and suggested[X\|Y]ChromaOffset as expected for HAL_PIXEL_FORMAT_NV12_Y_TILED_INTEL v5: set linear tiling for GPU_DATA_BUFFER usage, add comment about multi-bo support to clarify current implementation (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	aa94e01bfe	anv: add from/to helpers with android and vulkan formats v2: handle R8G8B8X8 as R8G8B8_UNORM (Jason) v3: add HAL_PIXEL_FORMAT_NV12_Y_TILED_INTEL, we make it define for now to avoid direct dependency to minigbm headers Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	c1f15a0a1a	anv: make anv_get_image_format_features public This will be utilized later by GetAndroidHardwareBufferPropertiesANDROID. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	8a469fd335	anv: refactor make_surface to use data from anv_image Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Tapani Pälli	2a98e5bbb9	anv: add create_flags as part of anv_image This will make it possible for next patch to rip anv_image_create_info out from make_surface function. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-19 09:38:41 +02:00
Ian Romanick	96c4b135e3	nir/algebraic: Don't put quotes around floating point literals The quotation marks around 1.0 cause it to be treated as a string instead of a floating point value. The generator then treats it as an arbitrary variable replacement, so any iand involving a ('ineg', ('b2i', a)) matches. v2: Remove misleading comment about sized literals (suggested by Timothy). Add assertion that the name of a varible is entierly alphabetic (suggested by Jason). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Timothy Arceri <tarceri@itsqueeze.com> [v1] Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> [v1] Fixes: `6bcd2af086` ("nir/algebraic: Add some optimizations for D3D-style Booleans") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109075	2018-12-18 23:28:31 -08:00
Vinson Lee	0f7ba5758b	meson: Fix libsensors detection. Fixes: `5e71efef44` ("meson: Add lmsensors support") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-18 19:24:01 -08:00
Vinson Lee	84f39e5971	meson: Fix typo. Fixes: `6b4c7047d5` ("meson: build gallium nine state_tracker") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-18 19:14:11 -08:00
Sagar Ghuge	933c44bcc4	nir: Add a new lowering option to lower 3D surfaces from txd to txl. Tested on gen9. v2: Rename lower_txd_3d_surafaces flag to lower_txd_3d (Jason Ekstrand) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-18 13:44:09 -08:00
Christian Gmeiner	7ea8e54dd6	meson: add etnaviv to the tools option Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-18 21:50:58 +01:00
Adam Jackson	e36d136102	specs: Bump GLX_MESA_query_renderer to version 9 Note that we have an official GL extension number, pick the appropriate section of the GLX spec to modify, and add changelog. Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Adam Jackson <ajax@redhat.com>	2018-12-18 15:46:10 -05:00
Adam Jackson	9e8332ebc2	specs: Remove GLX_RENDERER_ID_MESA from GLX_MESA_query_renderer This has not even had an attempt at implementation. If you asked for renderer 0 - which, the spec implies, should always work - then dri2_convert_glx_attribs would fail, we'd silently fall back to creating an indirect context, and xserver would also not recognize the attribute and would throw BadValue at you. The API would be difficult to use in any case, since there's no way to enumerate how many renderers the screen has. I'd be tempted to add that by defining: glXQueryRendererIntegerMESA(dpy, screen, /* renderer = */ -1, 0, &value); to return the number of renderers, but a new entrypoint might be cleaner. Still, better to not specify it at all than to lie about it. Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Adam Jackson <ajax@redhat.com>	2018-12-18 15:46:10 -05:00
Adam Jackson	c63c391756	specs: Remove GLES profile interaction text from GLX_MESA_query_renderer In one place we say, if GLES isn't supported then the profile version will be 0.0. Then later we say, if the GLES profile extension isn't supported then GLX_RENDERER_OPENGL_ES_PROFILE_VERSION_MESA is not mentioned in the spec. A strict reading of the latter would mean that GLX_RENDERER_OPENGL_ES_PROFILE_VERSION_MESA is not a recognized token, and the query should instead return False. The implementation does not check for the GLES profile extensions, and the additional complexity doesn't seem worth it. Removing the interaction text makes the spec match the implementation. Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Adam Jackson <ajax@redhat.com>	2018-12-18 15:46:10 -05:00
Eduardo Lima Mitev	5820e63418	freedreno/ir3: Make imageStore use num components from image format emit_intrinsic_store_image() is always using 4 components when collecting registers for the value. When image has less than 4 components (e.g, r32f, rg32i, etc) this results in extra mov instructions. This patch uses the actual number of components from the image format. For example, in a shader like: layout (r32f, binding=0) writeonly uniform imageBuffer u_image; ... void main(void) { ... imageStore (u_image, some_offset, vec4(1.0)); ... } instruction count is reduced in at least 3 instructions (note image format is r32f, 1 component only). This obviously reduces register pressure as well. v2: - Added support for image formats from NV_image_format extension (Ilia Mirkin). - Return 4 components by default instead of asserting. (Rob Clark). v3: Added more missing formats (Ilia Mirkin). v4: Added a debug message for unknown image formats (Rob Clark). Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-12-18 21:15:20 +01:00
Jason Ekstrand	5dad1abfdc	nir/dead_write_vars: Get modes directly from derefs Instead of going all the way back to the variable, just look at the deref. The modes are guaranteed to be the same by nir_validate whenever the variable can be found. This fixes clear_unused_for_modes for derefs that don't have an accessible variable. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-18 13:13:28 -06:00
Jason Ekstrand	fa40a58fd9	nir/copy_prop_vars: Get modes directly from derefs Instead of going all the way back to the variable, just look at the deref. The modes are guaranteed to be the same by nir_validate whenever the variable can be found. This fixes apply_barrier_for_modes for derefs that don't have an accessible variable. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-18 13:13:28 -06:00
Jason Ekstrand	cf7fb39805	nir/lower_wpos_center: Look at derefs for modes This is instead of looking all the way back to the variable which may not exist for all derefs. This makes this code properly ignore casts with modes other than the mode[s] we care about (where casts aren't allowed). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-18 13:13:28 -06:00
Jason Ekstrand	867fe35a16	nir/lower_io_to_scalar: Look at derefs for modes This is instead of looking all the way back to the variable which may not exist for all derefs. This makes this code properly ignore casts with modes other than the mode[s] we care about (where casts aren't allowed). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-18 13:13:28 -06:00
Jason Ekstrand	3fe0363dda	nir/lower_io_arrays_to_elements: Look at derefs for modes This is instead of looking all the way back to the variable which may not exist for all derefs. This makes this code properly ignore casts with modes other than the mode[s] we care about (where casts aren't allowed). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-18 13:13:28 -06:00
Jason Ekstrand	8cc0f92492	nir/linking_helpers: Look at derefs for modes This is instead of looking all the way back to the variable which may not exist for all derefs. This makes this code properly ignore casts with modes other than the mode[s] we care about (where casts aren't allowed). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-18 13:13:28 -06:00
Jason Ekstrand	8410cf66d7	nir/propagate_invariant: Skip unknown vars If we can't find the variable from the deref, just assume it isn't invariant and continue on. This can happen if, for instance, we're writing to a deref that points into an SSBO. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-18 13:13:28 -06:00
Ian Romanick	29e4b949b4	Revert "nir/lower_indirect: Bail early if modes == 0" "There's no point in walking the program if we're never going to actually lower anything." Except we might lower compacted local arrays. In that case, modes will be 0, but there is still lowering to be done. This reverts commit `7f75cf2a94`. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109081 Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Clayton Craft <clayton.a.craft@intel.com> Cc: Kenneth Graunke <kenneth@whitecape.org>	2018-12-18 10:47:54 -08:00
Lucas Stach	433ca3127a	st/dri: replace format conversion functions with single mapping table Each time I have to touch the buffer import/export functions in the dri state tracker I get lost in the maze of functions converting between DRI_IMAGE_FOURCC, DRI_IMAGE_FORMAT, DRI_IMAGE_COMPONENTS and pipe format. Rip it out and replace by a single table, which defines the correspondence between the different representations. Also this now stores all the known representations in the __DRIimageRec, to avoid the loss of information we currently have when importing a buffer with a fourcc, which doesn't have a corresponding dri format. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-18 19:19:45 +01:00
Lucas Stach	67174d40f1	st/dri: allow both render and sampler compatible dma-buf formats Currently all the EGL APIs are missing a way to specify how an imported dma-buf is intended to be used. Demanding the format to be both usable for sampling and rendering artificially restricts the list of formats a driver is able to import. Looking at how the Intel driver implements those DRI2 image APIs it doesn't distinguish between render or sampler compatible formats. So this patch aligns behavior between Intel and Gallium based drivers. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-18 19:19:40 +01:00
Lucas Stach	a3e592e839	etnaviv: use surface format directly There is no need to do the detour over the resource behind the surface to get the format. Use the surface format directly. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de>	2018-12-18 19:07:10 +01:00
Dylan Baker	7a90886921	meson: Add toggle for glx-direct GNU Hurd needs to turn off glx-direct, rather than special case it, we'll just add a toggle. CC: 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-18 09:20:53 -08:00
Dylan Baker	8c77f4c76d	meson: Add support for gnu hurd CC: 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-18 09:20:49 -08:00
Dylan Baker	6cf5f25bc5	meson: remove duplicate definition Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-18 09:18:12 -08:00
Dylan Baker	e430a034b9	meson: Fix ppc64 little endian detection Old versions of meson returned ppc64le as the cpu_family for little endian power8 cpus, versions >=0.48 don't do this, so the check wouldn't work in that case. This generalizes the check to work for both old and new versions of meson. Fixes: `34bbb24ce7` ("meson: Add support for ppc assembly/optimizations") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-18 09:17:54 -08:00
Jason Ekstrand	3feda3cf35	anv: Bump the patch version to 96 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-18 09:40:46 -06:00
Kenneth Graunke	3c71ba3baa	i965: Don't override subslice count to 4 on Gen11. Gen9-10 have fewer than 4 subslices per slice, so they need this to be rounded up. Gen11 isn't documented as needing this hack, and it can also have more than 4 subslices, so the hack actually can break things. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2018-12-17 14:03:45 -08:00
Ian Romanick	af07141b33	intel/compiler: More peephole_select for pre-Gen6 No shader-db changes on any Gen6+ platform. All of the shaders with cycles hurt by more than ~2% are from Master of Orion. All of the shaders have instructions helped. It looks like the pass enables some control flow to be converted to bcsels, then the scheduler does dumb things. These are new shaders (just added before doing this shader-db run), so there's probably some low-hanging fruit. Iron Lake total instructions in shared programs: 8214327 -> 8213684 (<.01%) instructions in affected programs: 84469 -> 83826 (-0.76%) helped: 114 HURT: 26 helped stats (abs) min: 2 max: 18 x̄: 7.75 x̃: 9 helped stats (rel) min: 0.17% max: 13.73% x̄: 2.52% x̃: 1.05% HURT stats (abs) min: 2 max: 20 x̄: 9.23 x̃: 8 HURT stats (rel) min: 0.70% max: 2.48% x̄: 1.66% x̃: 1.61% 95% mean confidence interval for instructions value: -5.87 -3.32 95% mean confidence interval for instructions %-change: -2.32% -1.17% Instructions are helped. total cycles in shared programs: 187736850 -> 187749314 (<.01%) cycles in affected programs: 506750 -> 519214 (2.46%) helped: 104 HURT: 36 helped stats (abs) min: 2 max: 72 x̄: 21.96 x̃: 16 helped stats (rel) min: 0.02% max: 6.16% x̄: 0.97% x̃: 0.63% HURT stats (abs) min: 4 max: 1402 x̄: 409.67 x̃: 40 HURT stats (rel) min: 0.33% max: 23.12% x̄: 5.79% x̃: 1.39% 95% mean confidence interval for cycles value: 28.32 149.74 95% mean confidence interval for cycles %-change: -0.07% 1.61% Inconclusive result (%-change mean confidence interval includes 0). GM45 total instructions in shared programs: 5044014 -> 5043652 (<.01%) instructions in affected programs: 46751 -> 46389 (-0.77%) helped: 63 HURT: 13 helped stats (abs) min: 2 max: 29 x̄: 7.65 x̃: 9 helped stats (rel) min: 0.17% max: 13.73% x̄: 2.93% x̃: 1.04% HURT stats (abs) min: 2 max: 20 x̄: 9.23 x̃: 8 HURT stats (rel) min: 0.66% max: 2.35% x̄: 1.58% x̃: 1.52% 95% mean confidence interval for instructions value: -6.54 -2.99 95% mean confidence interval for instructions %-change: -3.04% -1.28% Instructions are helped. total cycles in shared programs: 128143042 -> 128150188 (<.01%) cycles in affected programs: 324564 -> 331710 (2.20%) helped: 57 HURT: 19 helped stats (abs) min: 6 max: 74 x̄: 30.70 x̃: 32 helped stats (rel) min: 0.08% max: 4.74% x̄: 1.22% x̃: 0.81% HURT stats (abs) min: 10 max: 1400 x̄: 468.21 x̃: 60 HURT stats (rel) min: 0.56% max: 19.94% x̄: 5.80% x̃: 1.70% 95% mean confidence interval for cycles value: 6.90 181.15 95% mean confidence interval for cycles %-change: -0.52% 1.59% Inconclusive result (%-change mean confidence interval includes 0). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 13:47:06 -08:00
Ian Romanick	378f996771	nir/opt_peephole_select: Don't peephole_select expensive math instructions On some GPUs, especially older Intel GPUs, some math instructions are very expensive. On those architectures, don't reduce flow control to a csel if one of the branches contains one of these expensive math instructions. This prevents a bunch of cycle count regressions on pre-Gen6 platforms with a later patch (intel/compiler: More peephole select for pre-Gen6). v2: Remove stray #if block. Noticed by Thomas. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 13:47:06 -08:00
Ian Romanick	8fb8ebfbb0	intel/compiler: More peephole select Shader-db results: The one shader hurt for instructions is a compute shader that had both spills and fills hurt. v2: Fix typo in comment noticed by Caio. v3: Fix inverted condition in brw_nir.c. Noticed by Lionel. Skylake, Broadwell, and Haswell had similar results. (Skylake shown) total instructions in shared programs: 15072761 -> 15047884 (-0.17%) instructions in affected programs: 895539 -> 870662 (-2.78%) helped: 3623 HURT: 1 helped stats (abs) min: 1 max: 181 x̄: 6.89 x̃: 4 helped stats (rel) min: 0.10% max: 25.00% x̄: 3.93% x̃: 3.20% HURT stats (abs) min: 92 max: 92 x̄: 92.00 x̃: 92 HURT stats (rel) min: 1.92% max: 1.92% x̄: 1.92% x̃: 1.92% 95% mean confidence interval for instructions value: -7.10 -6.63 95% mean confidence interval for instructions %-change: -4.03% -3.82% Instructions are helped. total cycles in shared programs: 369738930 -> 369535732 (-0.05%) cycles in affected programs: 68027851 -> 67824653 (-0.30%) helped: 2609 HURT: 1035 helped stats (abs) min: 1 max: 4508 x̄: 181.44 x̃: 77 helped stats (rel) min: <.01% max: 71.31% x̄: 9.14% x̃: 5.47% HURT stats (abs) min: 1 max: 33336 x̄: 261.04 x̃: 20 HURT stats (rel) min: <.01% max: 47.61% x̄: 2.93% x̃: 1.47% 95% mean confidence interval for cycles value: -96.43 -15.09 95% mean confidence interval for cycles %-change: -6.07% -5.36% Cycles are helped. total spills in shared programs: 10158 -> 10159 (<.01%) spills in affected programs: 166 -> 167 (0.60%) helped: 1 HURT: 1 total fills in shared programs: 22105 -> 22116 (0.05%) fills in affected programs: 837 -> 848 (1.31%) helped: 4 HURT: 1 Ivy Bridge total instructions in shared programs: 12021190 -> 11990256 (-0.26%) instructions in affected programs: 910561 -> 879627 (-3.40%) helped: 3344 HURT: 18 helped stats (abs) min: 1 max: 99 x̄: 9.29 x̃: 6 helped stats (rel) min: 0.11% max: 31.18% x̄: 5.19% x̃: 3.31% HURT stats (abs) min: 2 max: 20 x̄: 7.89 x̃: 6 HURT stats (rel) min: 0.70% max: 2.59% x̄: 1.63% x̃: 1.70% 95% mean confidence interval for instructions value: -9.49 -8.91 95% mean confidence interval for instructions %-change: -5.32% -4.98% Instructions are helped. total cycles in shared programs: 179077826 -> 178570196 (-0.28%) cycles in affected programs: 63205667 -> 62698037 (-0.80%) helped: 2767 HURT: 620 helped stats (abs) min: 1 max: 7531 x̄: 217.58 x̃: 88 helped stats (rel) min: <.01% max: 75.86% x̄: 9.59% x̃: 6.09% HURT stats (abs) min: 1 max: 31255 x̄: 152.27 x̃: 11 HURT stats (rel) min: <.01% max: 36.36% x̄: 2.77% x̃: 0.58% 95% mean confidence interval for cycles value: -173.94 -125.81 95% mean confidence interval for cycles %-change: -7.68% -6.97% Cycles are helped. Sandy Bridge total instructions in shared programs: 10852569 -> 10843758 (-0.08%) instructions in affected programs: 235803 -> 226992 (-3.74%) helped: 800 HURT: 0 helped stats (abs) min: 1 max: 88 x̄: 11.01 x̃: 8 helped stats (rel) min: 0.11% max: 23.08% x̄: 4.69% x̃: 3.36% 95% mean confidence interval for instructions value: -11.93 -10.10 95% mean confidence interval for instructions %-change: -4.99% -4.39% Instructions are helped. total cycles in shared programs: 154732047 -> 154608941 (-0.08%) cycles in affected programs: 4063110 -> 3940004 (-3.03%) helped: 606 HURT: 253 helped stats (abs) min: 1 max: 2524 x̄: 227.93 x̃: 62 helped stats (rel) min: 0.02% max: 39.24% x̄: 4.36% x̃: 1.81% HURT stats (abs) min: 1 max: 1966 x̄: 59.36 x̃: 11 HURT stats (rel) min: 0.02% max: 67.10% x̄: 3.22% x̃: 0.67% 95% mean confidence interval for cycles value: -170.49 -116.13 95% mean confidence interval for cycles %-change: -2.61% -1.65% Cycles are helped. No change on Iron Lake or GM45. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 13:47:06 -08:00
Ian Romanick	09b7e1d8e4	nir/opt_peephole_select: Don't try to remove flow control around indirect loads That flow control may be trying to avoid invalid loads. On at least some platforms, those loads can also be expensive. No shader-db changes on any Intel platform (even with the later patch "intel/compiler: More peephole select"). v2: Add a 'indirect_load_ok' flag to nir_opt_peephole_select. Suggested by Rob. See also the big comment in src/intel/compiler/brw_nir.c. v3: Use nir_deref_instr_has_indirect instead of deref_has_indirect (from nir_lower_io_arrays_to_elements.c). v4: Fix inverted condition in brw_nir.c. Noticed by Lionel. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 13:47:06 -08:00
Ian Romanick	4cd1a0be76	i965/vec4: Propagate conditional modifiers from more compares to other compares If there is a CMP.NZ that compares a single component (via a .zzzz swizzle, for example) with 0, it can propagate its conditional modifier back to a previous CMP that writes only that component. The specific case that I saw was: cmp.l.f0(8) g42<1>.xF g61<4>.xF (abs)g18<4>.zF ... cmp.nz.f0(8) null<1>D g42<4>.xD 0D In this case we can just delete the second CMP. No changes on Broadwell or Skylake because they do not use the vec4 backend. Also no changes on GM45 or Iron Lake. Sandy Bridge, Ivy Bridge, and Haswell had similar results. (Sandy Bridge shown) total instructions in shared programs: 10856676 -> 10852569 (-0.04%) instructions in affected programs: 228322 -> 224215 (-1.80%) helped: 1331 HURT: 0 helped stats (abs) min: 1 max: 7 x̄: 3.09 x̃: 4 helped stats (rel) min: 0.11% max: 6.67% x̄: 1.88% x̃: 1.83% 95% mean confidence interval for instructions value: -3.19 -2.99 95% mean confidence interval for instructions %-change: -1.93% -1.83% Instructions are helped. total cycles in shared programs: 154788865 -> 154732047 (-0.04%) cycles in affected programs: 2485892 -> 2429074 (-2.29%) helped: 1097 HURT: 59 helped stats (abs) min: 2 max: 168 x̄: 51.96 x̃: 64 helped stats (rel) min: 0.12% max: 12.70% x̄: 3.44% x̃: 2.22% HURT stats (abs) min: 2 max: 16 x̄: 3.02 x̃: 2 HURT stats (rel) min: 0.18% max: 0.83% x̄: 0.64% x̃: 0.71% 95% mean confidence interval for cycles value: -51.04 -47.26 95% mean confidence interval for cycles %-change: -3.40% -3.07% Cycles are helped. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 13:47:06 -08:00
Ian Romanick	9a83c3d3b3	i965/fs: Eliminate unary op on operand of compare-with-zero The (-abs(x) >= 0) => (x == 0) optimization is removed from the vec4 and scalar parts. In the VS part, adding the new pattern was not helpful. The pattern that is removed is really old, and it has been handled by NIR for ages. All Gen7+ platforms had similar results. (Broadwell shown) total instructions in shared programs: 14715715 -> 14715709 (<.01%) instructions in affected programs: 474 -> 468 (-1.27%) helped: 6 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 1.12% max: 1.35% x̄: 1.28% x̃: 1.35% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -1.40% -1.15% Instructions are helped. total cycles in shared programs: 559569911 -> 559569809 (<.01%) cycles in affected programs: 5963 -> 5861 (-1.71%) helped: 6 HURT: 0 helped stats (abs) min: 16 max: 18 x̄: 17.00 x̃: 17 helped stats (rel) min: 1.45% max: 1.88% x̄: 1.73% x̃: 1.85% 95% mean confidence interval for cycles value: -18.15 -15.85 95% mean confidence interval for cycles %-change: -1.95% -1.51% Cycles are helped. Iron Lake and Sandy Bridge had similar results. (Iron Lake shown) total instructions in shared programs: 7780915 -> 7780913 (<.01%) instructions in affected programs: 246 -> 244 (-0.81%) helped: 2 HURT: 0 total cycles in shared programs: 177876108 -> 177876106 (<.01%) cycles in affected programs: 3636 -> 3634 (-0.06%) helped: 1 HURT: 0 GM45 total instructions in shared programs: 4799152 -> 4799151 (<.01%) instructions in affected programs: 126 -> 125 (-0.79%) helped: 1 HURT: 0 total cycles in shared programs: 122052654 -> 122052652 (<.01%) cycles in affected programs: 3640 -> 3638 (-0.05%) helped: 1 HURT: 0 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 13:47:06 -08:00
Ian Romanick	440c051340	i965/vec4/dce: Don't narrow the write mask if the flags are used In an instruction sequence like cmp(8).ge.f0.0 vgrf17:D, vgrf2.xxxx:D, vgrf9.xxxx:D (+f0.0) sel(8) vgrf1:UD, vgrf8.xyzw:UD, vgrf1.xyzw:UD The other fields of vgrf17 may be unused, but the CMP still needs to generate the other flag bits. To my surprise, nothing in shader-db or any test suite appears to hit this. However, I have a change to brw_vec4_cmod_propagation that creates cases where this can happen. This fix prevents a couple dozen regressions in that patch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5df88c20` ("i965/vec4: Rewrite dead code elimination to use live in/out.")	2018-12-17 13:47:06 -08:00
Ian Romanick	111bcc8d02	i965/vec4: Silence unused parameter warnings in vec4 compiler tests src/intel/compiler/test_vec4_copy_propagation.cpp: In member function ‘virtual brw::dst_reg* copy_propagation_vec4_visitor::make_reg_for_system_value(int)’: src/intel/compiler/test_vec4_copy_propagation.cpp:57:51: warning: unused parameter ‘location’ [-Wunused-parameter] virtual dst_reg make_reg_for_system_value(int location) ^~~~~~~~ src/intel/compiler/test_vec4_copy_propagation.cpp: In member function ‘virtual void copy_propagation_vec4_visitor::emit_urb_write_header(int)’: src/intel/compiler/test_vec4_copy_propagation.cpp:77:43: warning: unused parameter ‘mrf’ [-Wunused-parameter] virtual void emit_urb_write_header(int mrf) ^~~ src/intel/compiler/test_vec4_copy_propagation.cpp: In member function ‘virtual brw::vec4_instruction copy_propagation_vec4_visitor::emit_urb_write_opcode(bool)’: src/intel/compiler/test_vec4_copy_propagation.cpp:82:57: warning: unused parameter ‘complete’ [-Wunused-parameter] virtual vec4_instruction emit_urb_write_opcode(bool complete) ^~~~~~~~ src/intel/compiler/test_vec4_register_coalesce.cpp: In member function ‘virtual brw::dst_reg register_coalesce_vec4_visitor::make_reg_for_system_value(int)’: src/intel/compiler/test_vec4_register_coalesce.cpp:60:51: warning: unused parameter ‘location’ [-Wunused-parameter] virtual dst_reg make_reg_for_system_value(int location) ^~~~~~~~ src/intel/compiler/test_vec4_register_coalesce.cpp: In member function ‘virtual void register_coalesce_vec4_visitor::emit_urb_write_header(int)’: src/intel/compiler/test_vec4_register_coalesce.cpp:80:43: warning: unused parameter ‘mrf’ [-Wunused-parameter] virtual void emit_urb_write_header(int mrf) ^~~ src/intel/compiler/test_vec4_register_coalesce.cpp: In member function ‘virtual brw::vec4_instruction register_coalesce_vec4_visitor::emit_urb_write_opcode(bool)’: src/intel/compiler/test_vec4_register_coalesce.cpp:85:57: warning: unused parameter ‘complete’ [-Wunused-parameter] virtual vec4_instruction emit_urb_write_opcode(bool complete) ^~~~~~~~ src/intel/compiler/test_vec4_cmod_propagation.cpp: In member function ‘virtual brw::dst_reg cmod_propagation_vec4_visitor::make_reg_for_system_value(int)’: src/intel/compiler/test_vec4_cmod_propagation.cpp:60:51: warning: unused parameter ‘location’ [-Wunused-parameter] virtual dst_reg make_reg_for_system_value(int location) ^~~~~~~~ src/intel/compiler/test_vec4_cmod_propagation.cpp: In member function ‘virtual void cmod_propagation_vec4_visitor::emit_urb_write_header(int)’: src/intel/compiler/test_vec4_cmod_propagation.cpp:85:43: warning: unused parameter ‘mrf’ [-Wunused-parameter] virtual void emit_urb_write_header(int mrf) ^~~ src/intel/compiler/test_vec4_cmod_propagation.cpp: In member function ‘virtual brw::vec4_instruction cmod_propagation_vec4_visitor::emit_urb_write_opcode(bool)’: src/intel/compiler/test_vec4_cmod_propagation.cpp:90:57: warning: unused parameter ‘complete’ [-Wunused-parameter] virtual vec4_instruction *emit_urb_write_opcode(bool complete) ^~~~~~~~ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 13:47:06 -08:00
Bas Nieuwenhuizen	f67dea5e19	radv: Fix multiview depth clears We were not using the view mask for depth clears, causing only the first view to be cleared. Fixes: `2e86f6b259` "radv: Add multiview clears." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-17 20:16:26 +00:00
Bas Nieuwenhuizen	9add63a3a5	radv: Remove redundant format check. The switch directly after the check has a default case that returns NULL too, so the effective return value is not changed. Also this check is wrong once we start dealing with formats introduced by an extension (e.g. YUV formats). Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-17 20:09:38 +00:00
Eric Anholt	708d8f4d0a	nir: Fix clamping of uints for image store lowering. I botched some copy-and-paste and clamped to signed int max instead of uint max. Fixes KHR-GL46.shader_image_load_store.multiple-uniforms on skl. Fixes: `d3e046e76c` ("nir: Pull some of intel's image load/store format conversion to nir_format.h") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-17 20:02:22 +00:00
Eric Anholt	00e2cbc049	v3d: Fix the argument type for vir_BRANCH(). Apparently this has been spewing warnings for Jason's clang, but not my gcc.	2018-12-17 09:52:23 -08:00
Eric Anholt	376054fff3	vc4: Reuse nir_format_convert.h in our blend lowering. These helpers came along after and have effectively the same implementation.	2018-12-17 09:52:23 -08:00
Samuel Pitoiset	445867c80d	radv: report Vulkan version 1.1.90 for real I thought the value was correctly propagated, but actually not. Fixes: `2ac6d55f38` ("radv: bump reported version to 1.1.90") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-17 17:51:48 +01:00
Jason Ekstrand	cae373117c	anv,radv: Re-enable VK_EXT_pci_bus_info Now at version 2 with the fixed header. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 10:42:35 -06:00
Jason Ekstrand	e5b59fe6f5	vulkan: Update the XML and headers to 1.1.96 Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-17 10:41:56 -06:00
Rhys Perry	ef198e8c6a	radv: switch from nir_bcsel to nir_b32csel Fixes: `191a1dce92` ('nir: Add 1-bit Boolean opcodes') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-17 14:52:39 +00:00
Rhys Perry	bba94a3d85	radv: don't set surf_index for stencil-only images Fixes: `f8d5b377c8` ('radv: set cb base tile swizzles for MRT speedups (v4)') Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108116 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-17 14:52:10 +00:00
Ian Romanick	9dc135efa1	nir: Release per-block metadata in nir_sweep nir_sweep already marks all metadata invalid, so it is safe to release the memory here too. mean soft fp64 using uint64: 1,342,759,331 => 1,010,670,475 gfxbench5 aztec ruins high 11: 63,555,571 => 61,889,811 deus ex mankind divided 148: 62,845,304 => 62,829,640 deus ex mankind divided 2890: 71,922,686 => 71,922,686 dirt showdown 676: 69,238,607 => 69,238,607 dolphin ubershaders 210: 77,822,072 => 77,822,072 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-16 14:39:56 -08:00
Ian Romanick	7adafd6e1c	nir: Fix holes in nir_instr Found using pahole. Changes in peak memory usage according to Valgrind massif: mean soft fp64 using uint64: 1,343,991,403 => 1,342,759,331 gfxbench5 aztec ruins high 11: 63,619,971 => 63,555,571 deus ex mankind divided 148: 62,887,728 => 62,845,304 deus ex mankind divided 2890: 72,399,750 => 71,922,686 dirt showdown 676: 69,464,023 => 69,238,607 dolphin ubershaders 210: 78,359,728 => 77,822,072 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-16 14:39:56 -08:00
Ian Romanick	8161a87b24	nir/phi_builder: Use per-value hash table to store [block] -> def mapping Replace the old array in each value with a hash table in each value. Changes in peak memory usage according to Valgrind massif: mean soft fp64 using uint64: 5,499,875,082 => 1,343,991,403 gfxbench5 aztec ruins high 11: 63,619,971 => 63,619,971 deus ex mankind divided 148: 62,887,728 => 62,887,728 deus ex mankind divided 2890: 72,402,222 => 72,399,750 dirt showdown 676: 74,466,431 => 69,464,023 dolphin ubershaders 210: 109,630,376 => 78,359,728 Run-time change for a full run on shader-db on my Haswell desktop (with -march=native) is 1.22245% +/- 0.463879% (n=11). This is about +2.9 seconds on a 237 second run. The first time I sent this version of this patch out, the run-time data was quite different. I had misconfigured the script that ran the test, and none of the tests from higher GLSL versions were run. These are generally more complex shaders, and they are more affected by this change. The previous version of this patch used a single hash table for the whole phi builder. The mapping was from [value, block] -> def, so a separate allocation was needed for each [value, block] tuple. There was quite a bit of per-allocation overhead (due to ralloc), so the patch was followed by a patch that added the use of the slab allocator. The results of those two patches was not quite as good: mean soft fp64 using uint64: 5,499,875,082 => 1,343,991,403 gfxbench5 aztec ruins high 11: 63,619,971 => 63,619,971 deus ex mankind divided 148: 62,887,728 => 62,887,728 deus ex mankind divided 2890: 72,402,222 => 72,402,222 * dirt showdown 676: 74,466,431 => 72,443,591 * dolphin ubershaders 210: 109,630,376 => 81,034,320 * The * denote tests that are better now. In the tests that are the same in both patches, the "after" peak memory usage was at a different location. I did not check the local peaks. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-16 14:39:56 -08:00
Ian Romanick	e3043e1276	util/hash_table: Add _mesa_hash_table_init function Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-16 14:39:56 -08:00
Jason Ekstrand	db197fdb6c	st/nir: Use nir_src_as_uint for tokens Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-16 15:07:28 -06:00
Jason Ekstrand	47e1e0692c	radv: Fix a stupid if in gather_intrinsic_info Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 15:06:07 -06:00
Jason Ekstrand	6bcd2af086	nir/algebraic: Add some optimizations for D3D-style Booleans D3D Booleans use a 32-bit 0/-1 representation. Because this previously matched NIR exactly, we didn't have to really optimize for it. Now that we have 1-bit Booleans, we need some specific optimizations to chew through the D3D12-style Booleans. Shader-db results on Kaby Lake: total instructions in shared programs: 15136811 -> 14967944 (-1.12%) instructions in affected programs: 2457021 -> 2288154 (-6.87%) helped: 8318 HURT: 10 total cycles in shared programs: 373544524 -> 359701825 (-3.71%) cycles in affected programs: 151029683 -> 137186984 (-9.17%) helped: 7749 HURT: 682 total loops in shared programs: 4431 -> 4399 (-0.72%) loops in affected programs: 32 -> 0 helped: 21 HURT: 0 total spills in shared programs: 10290 -> 10051 (-2.32%) spills in affected programs: 2532 -> 2293 (-9.44%) helped: 18 HURT: 18 total fills in shared programs: 22203 -> 21732 (-2.12%) fills in affected programs: 3319 -> 2848 (-14.19%) helped: 18 HURT: 18 Note that a large chunk of the improvement fixing regressions caused by switching to 1-bit Booleans. Previously, our ability to optimize D3D booleans was improved by using the D3D representation directly in NIR. Now that NIR does 1-bit bools, we need a few more optimizations. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	3b30814791	nir/algebraic: Optimize 1-bit Booleans Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	44227453ec	nir: Switch to using 1-bit Booleans for almost everything This is a squash of a few distinct changes: glsl,spirv: Generate 1-bit Booleans Revert "Use 32-bit opcodes in the NIR producers and optimizations" Revert "nir/builder: Generate 32-bit bool opcodes transparently" nir/builder: Generate 1-bit Booleans in nir_build_imm_bool Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	11dc130779	nir: Add a bool to int32 lowering pass We also enable it in all of the NIR drivers. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	191a1dce92	nir: Add 1-bit Boolean opcodes We also have to add support for 1-bit integers while we're here so we get 1-bit variants of iand, ior, and inot. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	615cc26b97	nir/algebraic: Generalize an optimization This just makes it nicely scale across bit sizes. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	487514ae61	nir/large_constants: Properly handle 1-bit bools Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	3191a82372	nir: Add support for 1-bit data types This commit adds support for 1-bit Booleans and integers. Booleans obviously take a value of true or false. Because we have to define the semantics of 1-bit signed and unsigned integers, we define uint1_t to take values of 0 and 1 and int1_t to take values of 0 and -1. 1-bit arithmetic is then well-defined in the usual way, just with fewer bits. The definition of int1_t and uint1_t doesn't usually matter but we do need something for purposes of constant folding. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	2fe8708ffd	nir/constant_expressions: Rework Boolean handling This commit contains three related changes. First, we define boolN_t for N = 8, 16, and 64 and move the definition of boolN_vec to the loop with the other vec definitions. Second, there's no reason why we need the != 0 on the source because that happens implicitly when it's converted to bool. Third, for destinations, we use a signed integer type and just do -(int)bool_val which will give us the 0/-1 behavior we want and neatly scales to all bit widths. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	80e8dfe9de	nir: Rename Boolean-related opcodes to include 32 in the name This is a squash of a bunch of individual changes: nir/builder: Generate 32-bit bool opcodes transparently nir/algebraic: Remap Boolean opcodes to the 32-bit variant Use 32-bit opcodes in the NIR producers and optimizations Generated with a little hand-editing and the following sed commands: sed -i 's/nir_op_ball_fequal/nir_op_b32all_fequal/g' */.c sed -i 's/nir_op_bany_fnequal/nir_op_b32any_fnequal/g' */.c sed -i 's/nir_op_ball_iequal/nir_op_b32all_iequal/g' */.c sed -i 's/nir_op_bany_inequal/nir_op_b32any_inequal/g' */.c sed -i 's/nir_op_$[fiu]lt$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ge$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ne$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]eq$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fi]$ne32g/nir_op_\1neg/g' */.c sed -i 's/nir_op_bcsel/nir_op_b32csel/g' */.c Use 32-bit opcodes in the NIR back-ends Generated with a little hand-editing and the following sed commands: sed -i 's/nir_op_ball_fequal/nir_op_b32all_fequal/g' */.c sed -i 's/nir_op_bany_fnequal/nir_op_b32any_fnequal/g' */.c sed -i 's/nir_op_ball_iequal/nir_op_b32all_iequal/g' */.c sed -i 's/nir_op_bany_inequal/nir_op_b32any_inequal/g' */.c sed -i 's/nir_op_$[fiu]lt$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ge$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ne$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]eq$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fi]$ne32g/nir_op_\1neg/g' */.c sed -i 's/nir_op_bcsel/nir_op_b32csel/g' */.c Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	b569093566	nir/algebraic: Make an optimization more specific Later in this series, bool is not going to imply 32-bit. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	517099809a	nir: Drop support for lower_b2f This was originally added for the out-of-tree Mali driver but I think we've all agreed it's easy enough for them to just do in their back-end. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	4bb1a34727	nir/algebraic: Optimize x2b(xneg(a)) -> a Shader-db results on Kaby Lake: total instructions in shared programs: 15072525 -> 15072525 (0.00%) instructions in affected programs: 0 -> 0 helped: 0 HURT: 0 This helps prevent regressions in later commits. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	3595a0abf4	nir/constant_folding: Fix source bit size logic Instead of looking at input_sizes[i] which contains the number of components for each source, we look at the bit size of input_types[i]. This fixes a regression in the 1-bit boolean series though I have no idea how we haven't seen it before now. Fixes: `35baee5dce` "nir/constant_folding: fix incorrect bit-size check" Fixes: `9076c4e289` "nir: update opcode definitions for different bit sizes" Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	9f7bd843af	nir/tgsi: Use nir_bany in ttn_kill_if Reviewed-by: Eric Anholt <eric@anholt.net>	2018-12-16 21:03:02 +00:00
Jason Ekstrand	e17426058c	nir/lower_idiv: Use ilt instead of bit twiddling The previous code was creating a boolean by doing an arithmetic right- shift by 31 which produces a boolean which is true if the argument is negative. This is the same as the expression r < 0 which is much simpler and doesn't depend on NIR's representation of booleans. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-12-16 21:03:02 +00:00
Eric Anholt	2977c77758	v3d: Use the original bit size when scalarizing uniform loads. Prevents a regression in jekstrand's 1-bit series. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-16 21:03:01 +00:00
Eric Anholt	91a0251dbc	vc4: Use the original bit size when scalarizing uniform loads. Prevents a regression in jekstrand's 1-bit series. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-16 21:03:01 +00:00
Rhys Perry	bde9f482de	ac: split 16-bit ssbo loads that may not be dword aligned Fixes: `7e7ee82698` ('ac: add support for 16bit buffer loads') Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108114 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-16 14:56:10 +00:00
Rhys Perry	12dc7cb202	ac: refactor visit_load_buffer This is so that we can split different types of loads more easily. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-16 14:56:10 +00:00
Rhys Perry	ed4020fabe	nir: fix constness in nir_intrinsic_align() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-16 14:56:10 +00:00
Jan Vesely	e4f9a37ace	clover: Fix build after clang r348827 CodeGenOptions were moved to Basic. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> Tested-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Kai Wasserbäch <kai@dev.carbon-project.org> CC: mesa-stable@lists.freedesktop.org	2018-12-16 06:38:10 -05:00
Jon Turney	d512b35b62	glx: Fix compilation with GLX_USE_WINDOWSGL Sadly, the GLX_USE_APPLEGL and GLX_USE_WINDOWSGL cases are not identical (because GLX_USE_WINDOWSGL uses vtables rather than a maze of ifdefs) Include <sys/time.h> again, as functions prototyped by it are used in the GLX_USE_WINDOWSGL path. Make the include guard around the __glxGetMscRate() definition match the one at it's declaration again, as it's referenced from dri_common.c which is built for GLX_USE_WINDOWSGL. Fixes: `a95ec138` ("glx: mandate xf86vidmode only for "drm" dri platforms") Signed-off-by: Jon Turney <jon.turney@dronecode.org.uk> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-15 13:49:24 +00:00
Eric Anholt	29927e7524	v3d: Drop in a bunch of notes about performance improvement opportunities. These have all been floating in my head, and while I've thought about encoding them in issues on gitlab once they're enabled, they also make sense to just have in the area of the code you'll need to work in.	2018-12-14 17:48:01 -08:00
Eric Anholt	248a7fb392	v3d: Do uniform pretty-printing in the QPU dump. If you're trying to trace what's going on in a QPU dump, this will definitely help you find your way.	2018-12-14 17:48:01 -08:00
Eric Anholt	a370ed76ab	v3d: Use the uniform pretty-printer in v3d_write_uniforms()'s debug code. This will be a lot easier than my usual "38400.000000? that looks like a viewport scale" decoding strategy.	2018-12-14 17:48:01 -08:00
Eric Anholt	532b6c5671	v3d: Move uniform pretty-printing to its own helper function. I want to reuse it in the QPU dump.	2018-12-14 17:48:01 -08:00
Eric Anholt	78ef05bde4	v3d: Move uinfo->data[] dereference to the top of v3d_write_uniforms(). Follows `3954331aff` ("vc4: Pull uinfo->data[i] dereference out to the top of the loop.") which showed a large performance win for vc4, but also cleans up the code a decent bit.	2018-12-14 17:48:01 -08:00
Eric Anholt	a7e15a5086	v3d: Avoid assertion failures when removing end-of-shader instructions. After generating VIR, we leave c->cursor pointing at the end of the shader. If the shader had dead code at the end (for example from preamble instructions in a shader with no side effects), we would assertion fail that we were leaving the cursor pointing at freed memory. Since anything following DCE should be setting up a new cursor anyway, just clear the cursor at the start.	2018-12-14 17:48:01 -08:00
Eric Anholt	5b2cc03852	v3d: Add support for draw indirect for GLES3.1. In trying to enable compute shaders, I found that a bunch of deqp-gles31's compute stuff wanted to interact with indirect dispatch. This was easy to do on its own.	2018-12-14 17:48:01 -08:00
Eric Anholt	ff80e58b38	v3d: Add missing flagging of SYNCB as a TSY op. Fixes: `f2e41daac5` ("broadcom/vc5: Update QPU instruction pack/unpack for v4.2.")	2018-12-14 17:48:01 -08:00
Eric Anholt	3f9bcf9136	v3d: Make sure that a thrsw doesn't split a multop from its umul24. The thrsw will invalidate rtop, just like accumulators and flags. Caught by simulator assertions in CS imulextended/umulextended tests. Fixes: `90269ba353` ("broadcom/vc5: Use THRSW to enable multi-threaded shaders.")	2018-12-14 17:48:01 -08:00
Eric Anholt	332a5cf6a5	v3d: Add safety checks for resource_create(). This should ease my debugging next time I screw it up.	2018-12-14 17:48:01 -08:00
Eric Anholt	6ad9e8690d	v3d: Add support for texturing from linear. Just like vc4, we have to support linear shared BOs for X11 on arbitrary displays. When we're faced with a request to texture from one of those, make a shadow image that we copy using the TFU at the start of the draw call.	2018-12-14 17:48:01 -08:00
Eric Anholt	976ea90bdc	v3d: Add support for using the TFU to do some blits. This will be useful in particular for blits from raster to UIF for X11.	2018-12-14 17:48:01 -08:00
Eric Anholt	e5b4d1f55f	v3d: Don't forget to bump the number of writes when doing TFU ops. generatemipmap is just filling out the rest of the mipmap that's already been written (by a mapping or a draw call), so it didn't matter. As I reuse the TFU code for linear-to-UIF conversions, it'll start mattering.	2018-12-14 17:48:01 -08:00
Eric Anholt	485df2574e	v3d: Set up the right stride for raster TFU. I didn't have any raster images in the generatemipmap path, so the pixels-vs-bytes mixup didn't matter here.	2018-12-14 17:48:01 -08:00
Eric Anholt	e731d53716	v3d: Don't forget to wait for our TFU job before rendering from it. Otherwise we may race to read old contents. This didn't show up in the CTS and piglit for me, but it did once I started using the TFU to do linear->UIF blits for X11. Fixes: `2ebca177dc` ("v3d: Use the TFU to do generatemipmap.")	2018-12-14 17:48:01 -08:00
Ilia Mirkin	153d3fc5f9	nvc0: always keep TSC slot 0 bound to fix TXF Same as on nv50, the TXF op always uses the TSC bound to slot 0, returning blank values if nothing is bound. An earlier change arranges for the TSC entries list to always have valid data at entry 0, so here we just make use of it. Fixes arb_texture_buffer_object-subdata-sync among others. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-14 20:01:31 -05:00
Ilia Mirkin	4aeaf89aa7	nvc0: replace use of explicit default_tsc with entry 0 This was used for implementing FBFETCH. However that uses TXF, which doesn't do much with a TSC. The only important bit is that sRGB-decoding works as expected, which we can achieve since all samplers we ever generate enable sRGB-decoding. Always point to entry 0 in the TSC table, and ensure that even before it ever gets initialized, the sRGB-decoding enable bit is set. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-14 20:01:31 -05:00
Rob Clark	5f9085638a	freedreno/a6xx: fix corrupted uniforms For older gen's fd_wfi() is used to conditionally insert a WFI if there hasn't already been one since last draw. But this doesn't work out well with stateobj since the order the stateobj is evaluated might not be what you expect. (Ie. stateobj might not be evaluated until a later draw if there is no geometry from the current draw in a given tile.) Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-14 15:01:30 -05:00
Alex Deucher	4db4b3447d	pci_ids: add new vega20 pci id Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Cc: mesa-stable@lists.freedesktop.org	2018-12-14 14:48:39 -05:00
Alex Deucher	56cf25a114	pci_ids: add new vega10 pci ids Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Cc: mesa-stable@lists.freedesktop.org	2018-12-14 14:48:18 -05:00
Rafael Antognolli	5c454661c6	i965/gen9: Add workarounds for object preemption. Gen9 hardware requires some workarounds to disable preemption depending on the type of primitive being emitted. We implement this by adding a function that checks the primitive type and number of instances right before the 3DPRIMITIVE. For now, we just ignore blorp. The only primitive it emits is 3DPRIM_RECTLIST, and since it's not listed in the workarounds, we can safely leave preemption enabled when it happens. Or it will be disabled by a previous 3DPRIMITIVE, which should be fine too. v3: - Apply missing workarounds for instanced rendering and line loop (Ken) - Move workaround code to brw_draw_single_prim() Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Cc: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-12-14 09:40:27 -08:00
Rafael Antognolli	d8b50e152a	i965/gen10+: Enable object level preemption. Set bit when initializing context. v3: - Always toggle preemption bool to false before enabling it for the first time, so the state gets emitted (Chris Wilson). - Emit end of pipe sync with PIPE_CONTROL_RENDER_TARGET_FLUSH (Ken) Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-12-14 09:40:27 -08:00
Rafael Antognolli	019a92ffa4	intel/genxml: Add register for object preemption. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-12-14 09:40:27 -08:00
Ian Romanick	a6b7d1151c	util/slab: Rename slab_mempool typed parameters to mempool Now everything with type 'struct slab_child_pool ' is name pool, and everything with type 'struct slab_mempool ' is named mempool. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-14 07:36:05 -08:00
Ian Romanick	ba5402ec9a	nir/phi_builder: Internal users should use nir_phi_builder_value_set_block_def too Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-14 07:36:05 -08:00
Christian Gmeiner	489ffaf0c1	etnaviv: drop redundant ctx function parameter There is no need to have an extra ctx paramter as all the other parameters carry all the needed information. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de>	2018-12-14 11:23:00 +01:00
Kenneth Graunke	0b44644ca6	genxml: Consistently use a numeric "MOCS" field When we first started using genxml, we decided to represent MOCS as an actual structure, and pack values. However, in many places, it was more convenient to use a numeric value rather than treating it as a struct, so we added secondary setters in a bunch of places as well. We were not entirely consistent, either. Some places only had one. Gen6 had both kinds of setters for STATE_BASE_ADDRESS, but newer gens only had the struct-based setters. The names were sometimes "Constant Buffer Object Control State" instead of "Memory", making it harder to find. Many had prefixes like "Vertex Buffer MOCS"...in a vertex buffer packet...which is a bit redundant. On modern hardware, MOCS is simply an index into a table, but we were still carrying around the structure with an "Index to MOCS Table" field, in addition to the direct numeric setters. This is clunky - we really just want a number on new hardware. This patch eliminates the struct-based setters, and makes the numeric setters be consistently called "MOCS". We leave the struct definition around on Gen7-8 for reference purposes, but it is unused. v2: Drop bonus "Depth Buffer MOCS" fields on Gen7.5 and Gen9 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2018-12-14 00:44:54 -08:00
Timothy Arceri	a2ec78883f	nir: fix opt_if_loop_last_continue() The pass did not correctly handle loops ending in: if ssa_7 { block block_8: /* preds: block_7 / continue / succs: block_1 / } else { block block_9: / preds: block_7 / break / succs: block_11 */ } The break will get eliminated by another opt but if this pass gets called first (as it does on RADV) we ended up inserting instructions after the break. Fixes: `5921a19d4b` ("nir: add if opt opt_if_loop_last_continue()") Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-12-14 17:21:35 +11:00
Rob Clark	0ac5acaeaa	freedreno/a6xx: fix resource_copy_region() pctx->resource_copy_region() needs to fall back to sw copy for non-renderable formats. But previously for things that we could not use the blitter for, would fall back to 3d. Which won't work if 3d can't render to the dst format either. Instead rework things to fallback to fd_resource_copy_region(), which will try 3d core and then fall back to memcpy(). Fixes (for example) dEQP-GLES3.functional.texture.format.sized.2d.rgb9_e5_pot Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	4ec2f6129b	freedreno: move fd_resource_copy_region() Code-motion prep for next patch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	57b76ee2a8	freedreno/a6xx: more blitter fixes Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	d15fc787bc	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	532f8c0043	gallium/aux: add is_unorm() helper We already had one for is_snorm() but not unorm. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	85cd4df47f	freedreno/a6xx: fix blitter crash Fixes a crash with unsupported formats in dEQP-GLES3.functional.texture.format.sized.2d.rgb9_e5_pot Also fixes gpu hangs with some formats that are supported, but which we don't know what internal-format to use for the blitter, for ex dEQP-GLES3.functional.texture.format.sized.2d_array.rgb10_a2_pot Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	cca1e9606c	freedreno/ir3: don't remove unused input components Fixes: `0d240c2214` freedreno/ir3: don't fetch unused tex components Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	c19c4bf488	freedreno/ir3: fix crash Fixes a crash in dEQP-GLES3.functional.shaders.fragdepth.compare.fragcoord_z Fixes: `0d240c2214` freedreno/ir3: don't fetch unused tex components Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	3e8e033f4c	freedreno: also set DUMP flag on shaders If we emit shader as a pointer to a GEM object, also set the RELOC_DUMP flag as a hint to kernel that this is a useful buffer to snapshot for debug dumps. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	4cd016b5d6	freedreno: debug GEM obj names With a recent enough kernel, set debug names for GEM BOs, which will show up in $debugfs/gem Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	7ef722861b	freedreno/drm: sync uapi and enable softpin Pull in updated UAPI and use kernel API version to enable softpin. Since MSM_SUBMIT_BO_DUMP flag was added at same time, use that to signal to kernel that cmdstream buffers are useful to dump for debugging/cmdstream-traces. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Eric Anholt	4407e688cd	nir: Move intel's half-float image store lowering to to nir_format.h. I needed the same function for v3d. This was originally in `d3e046e76c` ("nir: Pull some of intel's image load/store format conversion to nir_format.h") before we made am istake about simplifying the function. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-13 12:24:26 -08:00
Eric Anholt	3a417a044e	Revert "intel: Simplify the half-float packing in image load/store lowering." This reverts commit `06fbcd2cd5`. nir_pack_half_2x16_split isn't vectorizable, it's 1-component only, thus why we had this split-scalar code in the first place. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-13 12:24:24 -08:00
Eric Anholt	c2c44dba7a	nir: Print the format of image variables. This helps a lot when debugging image load/store lowering on large testcases. Unfortunately the Mesa enum name stuff is under src/mesa and we can't get at it from the compiler. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-13 12:24:12 -08:00
Eric Anholt	19ffcba161	mesa/st: Expose compute shaders when NIR support is advertised. We have a NIR path, and V3D doesn't have TGSI input for compute (only what TTN can handle for the various gallium-internal shaders). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-12-13 11:44:47 -08:00
Dave Airlie	b3f2b03ece	radv/xfb: fix counter buffer bounds checks. If we gave this function 0 counter buffers, we'd still try and access pCounterBuffers[0] as this check was incorrect. Fixes crash with ext_transform_feedback-pipeline-basic-primgen on zink on radv. Fixes: `677b496b6` (radv: fix begin/end transform feedback with 0 counter buffers.) Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-13 19:27:05 +00:00
Jason Ekstrand	9ebc00f32e	i965: Enable nir_opt_idiv_const for 32 and 64-bit integers The pass should work for all bit sizes but it's less clear that the extra instructions are worth it on small integers. Also, the hardware doesn't do mul_high on anything other than 32-bit integers and, absent any decent mechanism for testing the pass on 8 and 16-bit types, it's probably best to just leave it disabled for now. Shader-db results on Sky Lake: total instructions in shared programs: 15105795 -> 15111403 (0.04%) instructions in affected programs: 72774 -> 78382 (7.71%) helped: 0 HURT: 265 Note that hurt here actually means helped because we're getting rid of integer quotient operations (which are a send on some platforms!) and replacing them with fairly cheap ALU ops. Reviewed-by: Ian Romanick ian.d.romanick@intel.com	2018-12-13 17:49:48 +00:00
Jason Ekstrand	455ec7327d	i965/vec4: Implement nir_op_uadd_sat Reviewed-by: Ian Romanick ian.d.romanick@intel.com	2018-12-13 17:49:48 +00:00
Ian Romanick	e639d39faf	i965/fs: Implement nir_op_uadd_sat Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-13 17:49:48 +00:00
Jason Ekstrand	74492ebad9	nir: Add a pass for lowering integer division by constants It's a reasonably well-known fact in the world of compilers that integer divisions by constants can be replaced by a multiply, an add, and some shifts. This commit adds such an optimization to NIR for easiest case of udiv. Other division operations will be added in following commits. In order to provide some additional driver control, the pass takes a minimum bit size to optimize. Reviewed-by: Ian Romanick ian.d.romanick@intel.com	2018-12-13 17:49:48 +00:00
Ian Romanick	090e282407	nir: Add a saturated unsigned integer add opcode Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-13 17:49:48 +00:00
Jason Ekstrand	39198a1238	nir/lower_int64: Add support for [iu]mul_high Reviewed-by: Ian Romanick ian.d.romanick@intel.com	2018-12-13 17:49:48 +00:00
Jason Ekstrand	9525971e2b	nir: Allow [iu]mul_high on non-32-bit types Reviewed-by: Ian Romanick ian.d.romanick@intel.com	2018-12-13 17:49:48 +00:00
Emil Velikov	a95ec13879	glx: mandate xf86vidmode only for "drm" dri platforms Currently we have the three dri "platforms" - drm, apple and windows. Since xf86vidmode is a thing only for the drm one, adjust the preprocessor guards and correctly check for the dependency. v2: terminate the GLX_USE_WINDOWSGL hunk Cc: Jon TURNEY <jon.turney@dronecode.org.uk> Fixes: `5bc509363b` ("glx: make xf86vidmode mandatory for direct rendering") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-13 17:38:19 +00:00
Alejandro Piñeiro	c7bdcd67aa	nir: remove unused variable To avoid the following warning: ./src/compiler/nir/nir_loop_analyze.c:807:16: warning: unused variable ‘ns’ [-Wunused-variable] nir_shader *ns = impl->function->shader; Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-13 16:35:21 +01:00
Erik Faye-Lund	e888f28d1f	virgl: work around bad assumptions in virglrenderer Virglrenderer does the wrong thing when given an instance divisor; it tries to use the element-index rather than the binding-index as the argument to glVertexBindingDivisor(). This worked fine as long as there was a 1:1 relationship between elements and bindings, which was the case util `19a91841c3` "st/mesa: Use Array._DrawVAO in st_atom_array.c.". So let's detect instance divisors, and restore a 1:1 relationship in that case. This will make old versions of virglrenderer behave correctly. For newer versions, we can consider making a better interface, where the instance divisor isn't specified per element, but rather per binding. But let's save that for another day. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Fixes: `19a91841c3` "st/mesa: Use Array._DrawVAO in st_atom_array.c." Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de> Tested-By: Gert Wollny <gert.wollny@collabora.com>	2018-12-13 16:12:10 +01:00
Erik Faye-Lund	8447b64238	virgl: wrap vertex element state in a struct This just has one member for now; the handle. But this is about to change. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de> Tested-By: Gert Wollny <gert.wollny@collabora.com>	2018-12-13 16:12:10 +01:00
Erik Faye-Lund	b702ff5378	virgl: simplify virgl_hw_set_index_buffer Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de> Tested-By: Gert Wollny <gert.wollny@collabora.com>	2018-12-13 16:12:10 +01:00
Erik Faye-Lund	00143a6241	virgl: simplify virgl_hw_set_vertex_buffers Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de> Tested-By: Gert Wollny <gert.wollny@collabora.com>	2018-12-13 16:12:10 +01:00
Juan A. Suarez Romero	0991085f66	docs: update calendar, add news item and link release notes for 18.2.7 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-12-13 15:45:20 +01:00
Juan A. Suarez Romero	e0b0995dcf	docs: add sha256 checksums for 18.2.7 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `e90429cc6d`)	2018-12-13 15:42:49 +01:00
Juan A. Suarez Romero	c8a17b45ea	docs: add release notes for 18.2.7 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `419ee20097`)	2018-12-13 15:42:46 +01:00
Samuel Pitoiset	5088ba2aeb	radv: don't check if format is depth in radv_image_can_enable_hile() This is always TRUE if htile_size is not 0. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-13 09:21:21 +01:00
Samuel Pitoiset	eb0034fe28	radv: check if addrlib enabled HTILE in radv_image_can_enable_htile() When hile_size is 0, we can't enable HTILE. This doesn't change anything, except not calling radv_image_alloc_htile(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-13 09:21:19 +01:00
Samuel Pitoiset	d8325f1f07	radv: switch on EOP when primitive restart is enabled with triangle strips Otherwise, Yakuza hangs the GPU with DXVK. We don't know if linetrip and pointlist are affected, so my point is to do that only for triangle strips. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-13 09:21:16 +01:00
Samuel Pitoiset	74cf3b627c	radv: allow to skip DCC decompressions with the new predicate Feral games aren't affected because they don't decompress DCC. F1 2018 has one DCC decompression per frame, but I don't see any performance improvements. This new predicate will be probably more useful for DCC/MSAA. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-13 09:21:14 +01:00
Samuel Pitoiset	3a5adc2879	radv: add a predicate for reflecting DCC decompression state It's somehow similar to the FCE predicate. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-13 09:21:10 +01:00
Jordan Justen	c506eae53d	i965/compute: Emit GPGPU_WALKER in genX_state_upload Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-12 22:28:06 -08:00
Jordan Justen	1b85c605a6	i965/genX_state: Add register access functions Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-12 22:28:02 -08:00
Eric Anholt	06fbcd2cd5	intel: Simplify the half-float packing in image load/store lowering. This was noted by Jason in review when I tried to make a helper for the old path. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-12 16:09:48 -08:00
Eric Anholt	d3e046e76c	nir: Pull some of intel's image load/store format conversion to nir_format.h I needed the same functions for v3d. Note that the color value in the Intel lowering has already been cut down to image.chans num_components. v2: Drop the half float one, since it was a 1-liner after cleanup. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-12 16:09:43 -08:00
Eric Anholt	19c7cba2ab	nir: Add some more consts to the nir_format_convert.h helpers. Most of the bits were constant, but a few were missed. Avoids warnings from v3d's upcoming static const bits declarations. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-12 16:09:37 -08:00
Timothy Arceri	9e6b39e1d5	nir: detect more induction variables This allows loop analysis to detect inductions variables that are incremented in both branches of an if rather than in a main loop block. For example: loop { block block_1: /* preds: block_0 block_7 / vec1 32 ssa_8 = phi block_0: ssa_4, block_7: ssa_20 vec1 32 ssa_9 = phi block_0: ssa_0, block_7: ssa_4 vec1 32 ssa_10 = phi block_0: ssa_1, block_7: ssa_4 vec1 32 ssa_11 = phi block_0: ssa_2, block_7: ssa_21 vec1 32 ssa_12 = phi block_0: ssa_3, block_7: ssa_22 vec4 32 ssa_13 = vec4 ssa_12, ssa_11, ssa_10, ssa_9 vec1 32 ssa_14 = ige ssa_8, ssa_5 / succs: block_2 block_3 / if ssa_14 { block block_2: / preds: block_1 / break / succs: block_8 / } else { block block_3: / preds: block_1 / / succs: block_4 / } block block_4: / preds: block_3 / vec1 32 ssa_15 = ilt ssa_6, ssa_8 / succs: block_5 block_6 / if ssa_15 { block block_5: / preds: block_4 / vec1 32 ssa_16 = iadd ssa_8, ssa_7 vec1 32 ssa_17 = load_const (0x3f800000 / 1.000000/) / succs: block_7 / } else { block block_6: / preds: block_4 / vec1 32 ssa_18 = iadd ssa_8, ssa_7 vec1 32 ssa_19 = load_const (0x3f800000 / 1.000000/) / succs: block_7 / } block block_7: / preds: block_5 block_6 / vec1 32 ssa_20 = phi block_5: ssa_16, block_6: ssa_18 vec1 32 ssa_21 = phi block_5: ssa_17, block_6: ssa_4 vec1 32 ssa_22 = phi block_5: ssa_4, block_6: ssa_19 / succs: block_1 */ } Unfortunatly GCM could move the addition out of the if for us (making this patch unrequired) but we still cannot enable the GCM pass without regressions. This unrolls a loop in Rise of The Tomb Raider. vkpipeline-db results (VEGA): Totals from affected shaders: SGPRS: 88 -> 96 (9.09 %) VGPRS: 56 -> 52 (-7.14 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 2168 -> 4560 (110.33 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 4 -> 4 (0.00 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=32211	2018-12-13 10:58:35 +11:00
Timothy Arceri	c03d6e61cc	nir: reword code comment Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-12-13 10:58:35 +11:00
Timothy Arceri	48b40380e3	nir: in loop analysis track actual control flow type This will allow us to improve analysis to find more induction variables. Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-12-13 10:58:35 +11:00
Danylo Piliaiev	5921a19d4b	nir: add if opt opt_if_loop_last_continue() Removing the last continue can allow more loops to unroll. Also inserting code into the if branch can allow the various if opts to progress further. The insertion of some loops into the if branch also reduces VGPR use in some shaders. vkpipeline-db results (VEGA): Totals from affected shaders: SGPRS: 6552 -> 6576 (0.37 %) VGPRS: 6544 -> 6532 (-0.18 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 481952 -> 478032 (-0.81 %) bytes LDS: 13 -> 13 (0.00 %) blocks Max Waves: 241 -> 242 (0.41 %) Wait states: 0 -> 0 (0.00 %) Shader-db results radeonsi (VEGA): Totals from affected shaders: SGPRS: 168 -> 168 (0.00 %) VGPRS: 144 -> 140 (-2.78 %) Spilled SGPRs: 157 -> 157 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 8524 -> 8488 (-0.42 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 7 -> 7 (0.00 %) Wait states: 0 -> 0 (0.00 %) v2: (Timothy Arceri): - allow for continues in either branch - move any trailing loops inside the if as well as blocks. - leave nir_opt_trivial_continues() to actually remove the continue. Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Signed-off-by: Timothy Arceri <tarceri@itsqueeze.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=32211	2018-12-13 10:58:35 +11:00
Timothy Arceri	721566bddb	nir: rework force_unroll_array_access() Here we rework force_unroll_array_access() so that we can reuse the induction variable detection in a following patch. Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-12-13 10:39:51 +11:00
Timothy Arceri	48135f175c	nir: factor out some of the complex loop unroll code to a helper Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-12-13 10:34:48 +11:00
Jordan Justen	7fe4e0ad5d	docs: Document GitLab merge request process (email alternative) This documents a process for using GitLab Merge Requests as an second way to submit code changes for Mesa. Only one of the two methods is allowed for each patch series. We will not require all patches to be emailed. Some code changes may be reviewed and merged without any discussion on the mesa-dev email list. v2: * No longer require email. Allow submitter to choose email or a GitLab merge request. * Various feedback from Brian, Daniel, Dylan, Eric, Erik, Jason, Matt, Michel and Rob. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Acked-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Rob Clark <robdclark@gmail.com>	2018-12-12 10:05:29 -08:00
Rhys Kidd	ff6f1dd0d3	meson: libfreedreno depends upon libdrm (for fence support) Error message building freedreno Gallium driver with meson: ../src/gallium/drivers/freedreno/freedreno_fence.c:27:21: fatal error: libsync.h: No such file or directory \#include <libsync.h> Fixes: `4aa69cc425` ("meson: build freedreno") Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 09:01:06 -08:00
Jason Ekstrand	ca98902d09	nir: Document the function inlining process This has thrown a few people off recently and it's good to have the process and all the rational for it documented somewhere. A comment at the top of nir_inline_functions seems as good a place as any. Acked-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-12-12 08:32:32 -06:00
Jason Ekstrand	5749c0ebc4	intel/blorp: Assert that we don't re-layout a compressed surface Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-12 08:32:32 -06:00
Jason Ekstrand	e4fdc650f1	anv/pipeline: Set the correct binding count for compute shaders Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-12-12 08:32:25 -06:00
Samuel Pitoiset	2ac6d55f38	radv: bump reported version to 1.1.90 After going through the spec changelog, it looks like RADV is up to date. Note that ANV also reports 1.1.90. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-12 13:51:16 +01:00
Erik Faye-Lund	f856f50194	virgl: force linear texturing support When I made sure that half-float texture-filtering was required for ES3, I didn't realize that virgl doesn't report support for this correctly. This regressed the GLES version available on top of several drivers, including i965 from 3.2 to 2.0. This is going to need protocol changes to fix properly, so let's just restore the previous behavior by enabling floating-point filtering unconditionally for now. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Fixes: `fcf9fcee3c` "mesa/main: do not require float-texture filtering for es3" Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-12-12 11:44:47 +01:00
Iago Toral Quiroga	3918943211	intel/compiler: do not copy-propagate strided regions to ddx/ddy arguments The implementation of these opcodes in the generator assumes that their arguments are packed, and it generates register regions based on that assumption. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-12 08:09:45 +01:00
Jason Ekstrand	a10a450db2	anv: Advertise support for MinLod on Skylake+ These are usually used for dealing with sparse resources but there's no reason why we can't hook them up before we have sparse. We have the hardware; let's light it up. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-11 21:26:23 -06:00
Jason Ekstrand	cb98e0755f	intel/fs: Support min_lod parameters on texture instructions We have to lower some shadow instructions because they don't exist in hardware and we have to lower txb+offset+clamp because the message gets too big and we run into the sampler message length limit of 11 regs. Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-11 21:26:23 -06:00
Jason Ekstrand	4ef8f46fd1	nir/lower_tex: Add lowering for some min_lod cases Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-11 21:26:23 -06:00
Jason Ekstrand	4a691cfa7e	nir/lower_tex: Modify txd instructions instead of replacing them I don't know if one is better than the other or not but this approach has the advantage that we never forget to copy information over and we're not hard-coding quite as many assumptions. It's also a lot simpler and much less code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-11 21:26:23 -06:00
Jason Ekstrand	5a968ae473	nir/lower_tex: Simplify lower_gradient logic Instead of having to call two different lower_gradient functions based on whether or not it's a cube, just make lower_gradient handle cubes. This significantly simplifies some of the logic. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-11 21:26:23 -06:00
Jason Ekstrand	caeffe7549	spirv: Add support for MinLod Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-11 21:26:23 -06:00
Jason Ekstrand	e1ef6c3c29	intel/ir: Don't allow allocating zero registers This simple check helps catch bugs early that can end up propagating into later stages of the compile and triggering strange asserts. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-12-11 21:26:23 -06:00
Roland Scheidegger	86c45fe960	gallivm: remove unused float coord wrapping for aos sampling AoS sampling tries to use integers for coord wrapping when possible, as it should be faster. However, for AVX, this was suboptimal, because only floats can use 8x32bit vectors, whereas integers have to be split into 4x32bit vectors. (I believe part of why it was slower was also that at least earlier llvm versions had trouble optimizing it properly, since you can still do simple bit ops with 8x32bit vectors, so a sequence of int add / and / int add / and with such vectors would actually end up doing 128bit inserts/extracts between the operations instead of just doing the cheap 128bit ands.) Hence, a special float coord wrapping path was added to AoS sampling. But this path was actually disabled for a long time already, since we found that just splitting everything before entering the AoS path was still sligthly faster usually, so none of this float coord wrapping code was used anymore (AoS sampling code, when avx2 isn't supported, never sees vectors with length > 4). I thought it might be useful some day again, but I'm not interested anymore in optimizing for very weird instruction sets which have support for 256bit vectors for floats but not for ints, so just drop it. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-12-12 03:50:03 +01:00
Emil Velikov	721c296bdc	docs: update calendar, add news item and link release notes for 18.3.1 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-11 21:25:18 +00:00
Emil Velikov	5391b65ed1	docs: add sha256 checksums for 18.3.1 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-11 21:21:42 +00:00
Emil Velikov	512bd8d3dd	docs: add release notes for 18.3.1 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-11 21:21:41 +00:00
Neil Roberts	8600aa35bd	freedreno: Add .dir-locals to the common directory The commit `aa0fed10d3` moved a bunch of Freedreno code to a common directory. The previous directory had a .dir-locals file for Emacs. This patch copies it to the new directory as well. Reviewed-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-12-11 13:14:08 -08:00
Rob Clark	cfe8220904	mesa/st/nir: fix missing nir_compact_varyings LinkedTransformFeedback is normally populated, which had nerf'd varying packing since the check was introduced. Fixes: `dbd52585fa` st/nir: Disable varying packing when doing transform feedback. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-11 15:51:34 -05:00
Rob Clark	9e3fc0c1e0	nir: fix spelling typo Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-12-11 15:51:34 -05:00
Jason Ekstrand	8f401b0ce6	anv,radv: Disable VK_EXT_pci_bus_info The Vulkan working group recently discovered that we made a mistake in assuming that PCI domains are 16-bit even though they can potentially be 32-bit values. To fix this, the next spec update will change the types in the VK_EXT_pci_bus_info struct to be 32 bits which will be a backwards-incompatible change. Normally, Khronos tries very hard to never make backwards incompatible changes to specs. Hopefully, the extension is new enough (2 months) that there are no shipping apps which use the extension so this should be safe. This commit disables the extension for both anv and radv in mesa and should be back-ported to 18.3 ASAP so we avoid any potential issues with new apps running on old drivers. I'll send out a commit (which we can also back-port to 18.3 if we really care) to re-enable the extension in both drivers once this week's spec update ships. The one known use of this extension is internal to mesa and will continue working with the extension disabled and will naturally update when we get a new header. Cc: "18.3" <mesa-stable@lists.freedesktop.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-11 11:30:05 -06:00
Juan A. Suarez Romero	fb88dcf5ca	docs: extends 18.2 lifecycle As 18.3 was published with some delay, let's extend 18.2 life for another extra release. CC: Andres Gomez <agomez@igalia.com> CC: Dylan Baker <dylan@pnwbakers.com> CC: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Andres Gomez <agomez@igalia.com> Acked-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-11 15:20:10 +01:00
Kristian H. Kristensen	c0de7c21a3	glapi: fixup EXT_multisampled_render_to_texture dispatch There's a few missing and convoluted bits: - FramebufferTexture2DMultisampleEXT Missing sanity check, should be desktop="false" - RenderbufferStorageMultisampleEXT Missing sanity check, is aliased to RenderbufferStorageMultisample. Thus it's set only when desktop GL or GLES2 v3.0+, while the extension is GLES2 2.0+. If we flip the aliasing we'll break indirect GLX, so loosen the version to 2.0. Not perfect, yet this is the most sane thing I could think of. v2: [Emil] Fixup RenderbufferStorageMultisampleEXT, commmit message Cc: Kristian H. Kristensen <hoegsberg@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108974 Fixes: `1b331ae505` ("mesa: Add core support for EXT_multisampled_render_to_texture{,2}") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-10 15:09:07 -08:00
Kristian H. Kristensen	9578dde1c8	freedreno: Fix the Makefile.am fix Commit `b028ce29f0` fixed a typo in src/freedreno/Makefile.am, but ended up breaking the build for freedreno. The typo inadvertently made things work, as we were not supposed to link with libnir or libmesautil to begin with. Those come in through libmesagallium and the typo prevented the duplicated linkage. Fixes: `b028ce29f` ("freedreno: add the missing _la in libfreedreno_ir3_la") Cc: Emil Velikov <emil.velikov@collabora.com>	2018-12-10 14:28:09 -08:00
Matt Turner	f447a13032	i965/fs: Handle V/UV immediates in dump_instructions()	2018-12-10 10:46:56 -08:00
Sagar Ghuge	694eb342a2	intel/compiler: Always print flag subregister number While disassembling the predicate always print flag subregister number to keep grammar same across the generation for assembler tool. v2: Combine consecutive format calls (Matt Turner) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-12-10 10:07:11 -08:00
Sagar Ghuge	e7598c5a62	intel/compiler: Set swizzle to BRW_SWIZZLE_XXXX for scalar region When RepCtrl is set, the swizzle field is ignored by the hardware. In order to ensure a 1-to-1 correspondence between the human-readable disassembly and the binary instruction encoding always set the swizzle to XXXX (all zeros) when it is unused due to RepCtrl Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-12-10 10:06:55 -08:00
Dylan Baker	6d3cbbbe15	meson: Add nir_algebraic_parser_test to suites Just to make it easier to run a nir tests together. Fixes: `a0ae12ca91` ("nir/algebraic: Add unit tests for bitsize validation") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-10 09:14:44 -08:00
Emil Velikov	27c4fdfdf8	amd/addrlib: drop si_ci_vi_merged_enum.h from the list Fixes: `776b911365` ("amd/addrlib: update Mesa's copy of addrlib") Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-10 16:35:01 +00:00
Emil Velikov	b028ce29f0	freedreno: add the missing _la in libfreedreno_ir3_la Fixes: `aa0fed10d3` ("freedreno: move ir3 to common location") Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-10 16:35:01 +00:00
Emil Velikov	b30e37ec64	freedreno: drop duplicate MKDIR_GEN declaration Fixes: `aa0fed10d3` ("freedreno: move ir3 to common location") Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-10 16:35:01 +00:00
Rhys Kidd	05c7e726f7	travis: radeonsi and radv require LLVM 7.0 Fixes: `3fbdcd942f` ("amd: remove support for LLVM 6.0") Cc: Marek Olšák <marek.olsak@amd.com> Cc: Jan Vesely <jan.vesely@rutgers.edu> Cc: Andres Gomez <agomez@igalia.com> Cc: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-10 16:20:12 +00:00
Kirill Burtsev	a539316485	loader: free error state, when checking the drawable type Currently we distinguish if the drawable is a window or pixmap by checking xcb_present_select_input throws an error or not. Yet, we don't always free the error state returned by xcb. Cc: Kirill Burtsev <kirill.burtsev@qt.io> Cc: Boyan Ding <boyan.j.ding@gmail.com> Fixes: `6bd9ba7d07` ("loader: Add dri3 helper") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> [Emil: add commit message, fixes tag] Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-10 16:19:55 +00:00
Timothy Arceri	032f247921	nir: make use of new nir_cf_list_clone_and_reinsert() helper Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-10 13:59:50 +11:00
Timothy Arceri	6b961eb534	nir: add a new nir_cf_list_clone_and_reinsert() helper Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-10 13:59:50 +11:00
Timothy Arceri	03d7c65ad8	nir: clarify some nit_loop_info member names Following commits will introduce additional fields such as guessed_trip_count. Renaming these will help avoid confusion as our unrolling feature set grows. Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-10 13:59:50 +11:00
Timothy Arceri	de0aee7638	nir: small tidy ups for nir_loop_analyze() Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-10 13:59:50 +11:00
Kenneth Graunke	41a4a6ba6f	i965: Flip arguments to load_register_reg helpers. load_register_imm and load_register_mem take the destination as the first argument, so I'd like load_register_reg to do the same the sake of consistency. Otherwise, reading sequences of mixed LRI/LRM/LRR is needlessly confusing. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-09 18:39:16 -08:00
Kenneth Graunke	34c9dc2537	i965: Delete dead brw_meta_resolve_color prototype. Dead since commit `09e041d61d` (May 2016).	2018-12-09 18:39:16 -08:00
Karol Herbst	77944fb2b7	nv50/ir: fix use-after-free in ConstantFolding::visit opnd() might delete the passed in instruction, but it's used through i->srcExists() later in visit v2: use continue instead return v3: use brackets for the outer if/else chain Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-09 18:19:59 +01:00
Karol Herbst	d63a133082	nouveau: use atomic operations for driver statistics multiple threads can write to those at the same time Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-09 04:43:20 +01:00
Karol Herbst	a28ff22295	nv50/ir: initialize relDegree staticly this race condition is pretty harmless, but also pretty trivial to fix Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-09 04:43:17 +01:00
Eric Anholt	cc6a5e937b	shader-packing	2018-12-07 16:51:12 -08:00
Eric Anholt	09ad0d870c	tfu	2018-12-07 16:49:41 -08:00
Eric Anholt	f1d98204c3	v3d: Fix a leak of the disassembled instruction string during debug dumps. Fixes: `ade416d023` ("broadcom: Add VC5 NIR compiler.")	2018-12-07 16:48:23 -08:00
Eric Anholt	7f8d8b7d27	vc4: Fix a leak of the transfer helper on screen destroy. Fixes: `d009463a65` ("vc4: Switch to using u_transfer_helper for MSAA maps.")	2018-12-07 16:48:23 -08:00
Eric Anholt	3bd73d31a8	v3d: Fix a leak of the transfer helper on screen destroy. Fixes: `7a30517cce` ("broadcom/vc5: Start adding support for rendering to Z32F_S8X24_UINT.")	2018-12-07 16:48:23 -08:00
Eric Anholt	bad95bb13c	v3d: Add VIR dumping of TMU config p0/p1. I had a bit of it for V3D 3.x, but didn't update it for 4.x.	2018-12-07 16:48:23 -08:00
Eric Anholt	1fc78ff3f1	v3d: Simplify VIR uniform dumping using a temporary.	2018-12-07 16:48:23 -08:00
Eric Anholt	5932575299	v3d: Garbage collect unused uniforms code.	2018-12-07 16:48:23 -08:00
Eric Anholt	62a3192112	v3d: Split most of TEXTURE_SHADER_STATE setup out of sampler views. For shader image load/store, we want most of this logic to be shared.	2018-12-07 16:48:23 -08:00
Eric Anholt	8cb1f3bab7	v3d: Avoid confusing auto-indenting in TEXTURE_SHADER_STATE packing Having "v3dx_pack() {" under each #if branch would confuse emacs's indenter.	2018-12-07 16:48:23 -08:00
Eric Anholt	ee9b758053	v3d: Fix handling of texture first_layer offsets for 3D textures. I think this bug predated adding v3d_layer_offset(). Noticed during an unrelated refactor.	2018-12-07 16:48:23 -08:00
Eric Anholt	acecee4c2d	v3d: Return the right gl_SampleMaskIn[] value. It's supposed to be the dispatched sample mask for this pixel, not the GL state's sample mask.	2018-12-07 16:48:23 -08:00
Eric Anholt	6870111051	v3d: Fix a comment typo	2018-12-07 16:48:23 -08:00
Eric Anholt	ca0e4ae4bc	v3d: Convert to using nir_src_as_uint() from const_value derefs. Follows `16870de8a0` ("nir: Use nir_src_is_const and nir_src_as_* in core code") to clean up v3d.	2018-12-07 16:48:23 -08:00
Eric Anholt	503b55c622	v3d: Don't forget to flush writes to UBOs. If someone did TF into a UBO, we might have left the TF job un-flushed at the point of reading.	2018-12-07 16:48:23 -08:00
Eric Anholt	504d06e4c1	v3d: Make an array for frag/vert texture state in the context. This simplifies a bunch of our texture handling, while introducing the slots necessary for adding new shader stages.	2018-12-07 16:48:23 -08:00
Eric Anholt	d1965344ac	v3d: Re-use the wrap mode uniform on V3D 3.3.	2018-12-07 16:48:23 -08:00
Eric Anholt	e94d034a38	v3d: Put default vertex attribute values into the state uploader as well. The default attributes are long-lived (the state struct is cached), and only 256 bytes each.	2018-12-07 16:48:23 -08:00
Eric Anholt	b38e4d313f	v3d: Create a state uploader for packing our shaders together. Shaders are usually quite short, and are private to the context. We can save memory and reduce the work the kernel needs to do at exec time by packing them together in a stream uploader for long-lived state.	2018-12-07 16:48:23 -08:00
Eric Anholt	1911888760	v3d: Update simulator cache flushing code to match the kernel better. We were missing the invalidate between bin and render (possibly relevant for SSBOs), and still trying to flush the nonexistent L2C on 3.3+.	2018-12-07 16:48:23 -08:00
Eric Anholt	2ebca177dc	v3d: Use the TFU to do generatemipmap. This is a separate, dedicated hardware unit for texture layout conversions and mipmap generation.	2018-12-07 16:48:23 -08:00
Eric Anholt	ee0549ff9a	v3d: Add the V3D TFU submit interface to the simulator. The TFU lets us format raster and SAND images into formats that can be read by the texture engine, and do mipmap generation. The UAPI comes from drm-next e69aa5f9b97f ("Merge tag 'drm-misc-next-2018-12-06' of git://anongit.freedesktop.org/drm/drm-misc into drm-next")	2018-12-07 16:48:23 -08:00
Eric Anholt	42652ea51e	v3d: Use combined input/output segments. The HW apparently has some issues (or at least a much more complicated VCM calculation) with non-combined segments, and the closed source driver also uses combined I/O. Until I get the last CTS failure resolved (which does look plausibly like some VPM stomping), let's use combined I/O too.	2018-12-07 16:48:23 -08:00
Eric Anholt	fb9bcf5602	v3d: Add missing OES_half_float_linear support. We were exposing ARB_texture_float, but apparently not the OES subset flag. Fixes regression from GLES3 support to GLES2. Fixes: `fcf9fcee3c` ("mesa/main: do not require float-texture filtering for es3")	2018-12-07 16:48:23 -08:00
Eric Anholt	90e98295a4	v3d: Add support for RGBA_SRGB along with BGRA_SRGB. This is the actual native format for the hardware, without swizzling. Noticed while debugging why GLES3 disappeared.	2018-12-07 16:48:23 -08:00
Kenneth Graunke	f0d51e81c9	intel/blorp: Expand blorp_address::offset to be 64 bits. In the softpin world, surface state base address may be a fixed 64-bit address (with no associated BO). It makes sense to store this in the offset field. But it needs to be the full size. We also update the clear color address to be consistently uint64_t everywhere so we can continue passing intel_miptree_get_clear_color a pointer to the blorp_address's offset field without type mismatches. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-12-07 16:35:51 -08:00
Rob Clark	d014af98b7	freedreno/drm: fix memory leak Fix an emberrasing memory leak with the non-softpin submit/rb implementation. Fixes: `f3cc0d2747` freedreno: import libdrm_freedreno + redesign submit Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 14:12:12 -05:00
Rob Clark	5c2c1f0a2d	freedreno/ir3: track max flow control depth for a5xx/a6xx Rather than just hard-coding BRANCHSTACK size. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	9517037bdc	freedreno/ir3: code-motion Split up ir3_compiler_nir.c a bit before starting to add new stuff for a6xx SSBO/image instructions. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	e37351fa57	freedreno/ir3: sync instr/disasm Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	0d240c2214	freedreno/ir3: don't fetch unused tex components Detect when a component of an (for example) texture fetch is unused and propagate the updated wrmask back to the parent instruction. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	b971afd19e	freedreno/a6xx: blitter fixes Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	237ae7daf2	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	e779725f0b	freedreno/drm: fix relocs in nested stateobjs If we have an reloc from stateobjA to stateobjB, we would previously leave stateobjB's bos out of the submit's bos table. Handle this case by copying into stateobjA's reloc_bos table. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	9f7c6c78bc	freedreno/a5xx+a6xx: remove unused fs/vs pvt mem copy/pasta from older gens Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	c500e7b747	gallium: fix typo Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Rob Clark	f6ad286c80	freedreno: remove unused fd_surface fields Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-07 13:49:21 -05:00
Nicolai Hähnle	4275cae95c	meson: link LLVM 'native' component when LLVM is available Linking against LLVM built with BUILD_SHARED_LIBS fails otherwise, as the component is required for the draw module. Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-07 16:26:14 +01:00
Connor Abbott	2845c49218	nir: Fixup algebraic test for variable-sized conversions b2i can now take any size boolean in preparation for 1-bit booleans, so the error message printed is slightly different. Fixes: `dca6cd9ce6` ("nir: Make boolean conversions sized just like the others") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108961 Cc: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-07 16:07:51 +01:00
Samuel Pitoiset	e8a383ce67	gallium: add missing PIPE_CAP_SURFACE_SAMPLE_COUNT default value Fixes: `2710c40e3c` ("gallium: Add new PIPE_CAP_SURFACE_SAMPLE_COUNT") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com>	2018-12-07 15:06:29 +01:00
Emil Velikov	96d4ecbb11	docs: update calendar, add news item and link release notes for 18.3.0 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-07 11:50:12 +00:00
Emil Velikov	0144bbdb98	docs: add sha256 checksums for 18.3.0 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `d81beab96a`)	2018-12-07 11:44:33 +00:00
Emil Velikov	b1e0336497	docs: update 18.3.0 release notes Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `d603cd9d84`)	2018-12-07 11:44:31 +00:00
Kristian H. Kristensen	3e55df4f83	freedreno: Add support for EXT_multisampled_render_to_texture There is not much to do in freedreno - tile layout and multisample state for gmem renderings is programmed based on the pfb sample count, while resolve blits take the destination sample count from the resource. Reviewed-by: Rob Clark <robdclark@gmail.com> Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-12-06 16:56:37 -08:00
Rob Clark	913eb7fa58	freedreno/a6xx: MSAA Reviewed-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-06 16:55:59 -08:00
Kristian H. Kristensen	14ea811c67	st/mesa: Add support for EXT_multisampled_render_to_texture In gallium, we model the attachment sample count as a new nr_samples field in pipe_surface. A driver can indicate support for the extension using the new pipe cap, PIPE_CAP_MULTISAMPLED_RENDER_TO_TEXTURE. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-12-06 16:55:46 -08:00
Kristian H. Kristensen	2710c40e3c	gallium: Add new PIPE_CAP_SURFACE_SAMPLE_COUNT This new pipe cap and the new nr_samples field in pipe_surface lets a state tracker bind a render target with a different sample count than the resource. This allows for implementing EXT_multisampled_render_to_texture and EXT_multisampled_render_to_texture2. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-12-06 16:55:43 -08:00
Kristian H. Kristensen	1b331ae505	mesa: Add core support for EXT_multisampled_render_to_texture{,2} This also turns on EXT_multisampled_render_to_texture which is a subset of EXT_multisampled_render_to_texture2, allowing only COLOR_ATTACHMENT0. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-12-06 16:55:30 -08:00
Vinson Lee	b4fd59075b	nir/algebraic: Make algebraic_parser_test.sh executable. Fixes make check permission error. ../../bin/test-driver: line 107: ./nir/tests/algebraic_parser_test.sh: Permission denied FAIL nir/tests/algebraic_parser_test.sh (exit status: 126) Fixes: `a0ae12ca91` ("nir/algebraic: Add unit tests for bitsize validation") Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2018-12-06 11:48:20 -08:00
Samuel Pitoiset	3fbdcd942f	amd: remove support for LLVM 6.0 User are encouraged to switch to LLVM 7.0 released in September 2018. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-06 14:02:56 +01:00
Kristian H. Kristensen	3b2ad8b290	gallium: Android build fixes A couple of simple fixes for building on Android with autotools. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-05 13:56:07 -08:00
Jason Ekstrand	dca6cd9ce6	nir: Make boolean conversions sized just like the others Instead of a single i2b and b2i, we now have i2b32 and b2iN where N is one if 8, 16, 32, or 64. This leads to having a few more opcodes but now everything is consistent and booleans aren't a weird special case anymore. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:03:07 -06:00
Jason Ekstrand	be98b1db38	nir/opt_algebraic: Add 32-bit specifiers to a bunch of booleans Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:03:03 -06:00
Jason Ekstrand	2715080d65	nir/opt_algebraic: Drop bit-size suffixes from conversions Suffixes are dropped from a bunch of conversion opcodes when it makes sense to do so. Others are kept if we really do want the bit-size restriction. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:03:01 -06:00
Jason Ekstrand	ff8e3d3b7b	nir/opt_algebraic: Simplify an optimization using the new search ops Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:02:58 -06:00
Jason Ekstrand	05af952a11	nir/algebraic: Add support for unsized conversion opcodes All conversion opcodes require a destination size but this makes constructing certain algebraic expressions rather cumbersome. This commit adds support to nir_search and nir_algebraic for writing conversion opcodes without a size. These meta-opcodes match any conversion of that type regardless of destination size and the size gets inferred from the sizes of the things being matched or from other opcodes in the expression. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:02:56 -06:00
Jason Ekstrand	4925290ab1	nir/algebraic: Refactor codegen a bit Instead of using an OrderedDict, just have a (necessarily sorted) array of transforms and a set of opcodes. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:02:54 -06:00
Jason Ekstrand	d6aac618fb	nir/algebraic: Clean up some __str__ cruft Both of these things are already handled in the Value base class so we don't need to handle them explicitly in Constant. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:02:52 -06:00
Jason Ekstrand	85f0ea9d8f	nir/opcodes: Rename tbool to tbool32 Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:02:49 -06:00
Jason Ekstrand	03571a7a6c	nir/opcodes: Pull in the type helpers from constant_expressions While we're at it, we rework them a bit to all use regular expressions and assert more. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2018-12-05 15:02:06 -06:00
Connor Abbott	a0ae12ca91	nir/algebraic: Add unit tests for bitsize validation The non-failure path can be tested by just compiling mesa and then testing it, but the failure paths won't be hit unless you make a mistake, so it's best to test them with some unit tests. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-05 17:57:40 +01:00
Connor Abbott	29a1450e28	nir/algebraic: Rewrite bit-size inference Before this commit, there were two copies of the algorithm: one in C, that we would use to figure out what bit-size to give the replacement expression, and one in Python, that emulated the C one and tried to prove that the C algorithm would never fail to correctly assign bit-sizes. That seemed pretty fragile, and likely to fall over if we make any changes. Furthermore, the C code was really just recomputing more-or-less the same thing as the Python code every time. Instead, we can just store the results of the Python algorithm in the C datastructure, and consult it to compute the bitsize of each value, moving the "brains" entirely into Python. Since the Python algorithm no longer has to match C, it's also a lot easier to change it to something more closely approximating an actual type-inference algorithm. The algorithm used is based on Hindley-Milner, although deliberately weakened a little. It's a few more lines than the old one, judging by the diffstat, but I think it's easier to verify that it's correct while being as general as possible. We could split this up into two changes, first making the C code use the results of the Python code and then rewriting the Python algorithm, but since the old algorithm never tracked which variable each equivalence class, it would mean we'd have to add some non-trivial code which would then get thrown away. I think it's better to see the final state all at once, although I could also try splitting it up. v2: - Replace instances of "== None" and "!= None" with "is None" and "is not None". - Rename first_src to first_unsized_src - Only merge the destination with the first unsized source, since the sources have already been merged. - Add a comment explaining what nir_search_value::bit_size now means. v3: - Fix one last instance to use "is not" instead of != - Don't try to be so clever when choosing which error message to print based on whether we're in the search or replace expression. - Fix trailing whitespace. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-05 17:57:40 +01:00
Samuel Pitoiset	49ef890733	radv: expose VK_EXT_scalar_block_layout Nothing to do, the compiler already handles that. All new dEQP.VK.ubo.* and dEQP.VK.ssbo.* pass, except some 16-bit tests that are quite related to fdo bug #108114. Only enable the extension on CIK+ because it might not work on SI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-05 17:38:20 +01:00
Samuel Pitoiset	c6465fec0c	spirv: add SpvCapabilityInt64Atomics Required for VK_KHR_shader_atomic_int64. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-05 14:39:55 +01:00
Michal Srb	63c0916ada	drisw: Use separate drisw_loader_funcs for shm The original code was modifying the global drisw_lf variable, which is bad when there are multiple contexts in single process, each initialized with different loader. One may support put_image_shm and the other not. Since there are currently only two possible combinations, lets create two global tables, one for each. Lets make them const, since we won't change them and they can be shared. This fixes crash in VLC. It used two GL contexts (each in different thread), one was initialized by its Qt GUI, the other by its video output plugin. The first one set the put_image_shm=drisw_put_image_shm, the second did not, but since the same structure was used, the drisw_put_image_shm was used too. Then it crashed because the second loader did not have putImageShm set. Downstream bug: https://bugzilla.opensuse.org/show_bug.cgi?id=1113533 v2: Added Fixes and described the VLC bug. Fixes: `63c427fa71` ("drisw: use putImageShm if available") Signed-off-by: Michal Srb <msrb@suse.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-05 13:16:09 +00:00
Michal Srb	c0ac038c97	gallium: Constify drisw_loader_funcs struct The content is not expected to change. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Michal Srb <msrb@suse.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-05 13:16:09 +00:00
Samuel Pitoiset	c7ada4901a	radv: wait on the high 32 bits of timestamp queries In case we are unlucky if the low part is 0xffffffff. Fixes: `5d6a560a29` ("radv: do not use the availability bit for timestamp queries") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-05 13:05:58 +01:00
Samuel Pitoiset	e899728769	radv: reset pending_reset_query when flushing caches If the driver used a compute shader for resetting a query pool, it should be completed when caches are flushed. This might reduce the number of stalls if operations are done between vkCmdResetQueryPool() and vkCmdBeginQuery() (or vkCmdWriteTimestamp()). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Alex Smith <asmith@feralinteractive.com>	2018-12-05 13:05:55 +01:00
Lionel Landwerlin	9a7b319903	anv/query: flush render target before copying results This change tracks render target writes in the pipeline and applies a render target flush before copying the query results to make sure the preceding operations have landed in memory before the command streamer initiates the copy. v2: Simplify logic in CopyQueryResults (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108909 Fixes: `37f9788e9a` ("anv: flush pipeline before query result copies") Cc: mesa-stable@lists.freedesktop.org	2018-12-05 11:43:34 +00:00
Alex Smith	c1b6cb068c	radv: Flush before vkCmdWriteTimestamp() if needed As done for vkCmdBeginQuery() already. Prevents timestamps from being overwritten by previous vkCmdResetQueryPool() calls if the shader path was used to do the reset. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108925 Fixes: `a41e2e9cf5` ("radv: allow to use a compute shader for resetting the query pool") Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-05 10:52:48 +00:00
Samuel Pitoiset	824cfc1ee5	radv: rework the TC-compat HTILE hardware bug with COND_EXEC After investigating on this, it appears that COND_WRITE doesn't work correctly in some situations. I don't know exactly why does it fail to update DB_Z_INFO.ZRANGE_PRECISION, but as AMDVLK also uses COND_EXEC I think there is a reason. Now the driver stores a new metadata value in order to reflect the last fast depth clear state. If a TC-compat HTILE is fast cleared with 0.0f, we have to update ZRANGE_PRECISION to 0 in order to work around that hardware bug. This fixes rendering issues with The Forest and DXVK and doesn't seem to introduce any regressions. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108914 Fixes: `68dead112e` ("radv: update the ZRANGE_PRECISION value for the TC-compat bug") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-05 09:26:31 +01:00
Dieter Nützel	2669dbf881	docs/features: Delete double nv50 entry and wrong enumeration trivial Fix commit `d9b2234042` Signed-off-by: Dieter Nützel <Dieter@nuetzel-hh.de> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-12-04 18:51:18 -05:00
Marek Olšák	5907412d04	st/mesa: expose EXT_render_snorm on GLES Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-04 15:33:29 -05:00
Marek Olšák	1660f3aa05	mesa: expose AMD_texture_texture4 because the closed driver exposes it. Tested by piglit. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-04 15:33:29 -05:00
Marek Olšák	908f817918	mesa: expose EXT_texture_compression_bptc in GLES tested by piglit. v2: rebase Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> (v1) Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-12-04 15:33:29 -05:00
Marek Olšák	34f07ddebb	mesa: expose EXT_texture_compression_rgtc on GLES The spec was modified to support GLES. Tested by piglit. v2: rebase Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> (v1) Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-12-04 15:33:29 -05:00
Erik Faye-Lund	91af56e383	mesa/main: fix up _mesa_has_rg_textures for gles2 rg-textures are supported in GLES 2.0 if EXT_texture_rg, so let's make sure the enums are accepted. Fixes: `510b642460` "mesa/main: do not allow rg-textures enums before gles3" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108936 Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Tested-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-04 21:14:26 +01:00
Erik Faye-Lund	5bf38bfb64	mesa/main: correct validation for GL_RGB565 Technically speaking, this validation was incorrect, because GL_RGB565 is only supported in OpenGL ES 1.x if OES_framebuffer_object is supported. This couldn't lead to any real incorrect behavior, because all drivers support OES_framebuffer_object. But let's keep the code self-documenting, by correcting the check as per the spec. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-04 21:14:16 +01:00
Marek Olšák	4b218984d8	mesa: expose GL_EXT_texture_view as an alias of GL_OES_texture_view There are no spec changes. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-04 12:50:36 -05:00
Marek Olšák	d9b2234042	st/mesa: expose GL_OES_texture_view For format fallbacks like ETC and ASTC, switching between sRGB and linear decoding is undefined, or at least is not bit-exact. Same as EXT_texture_sRGB_decode on GLES. There are no piglit or dEQP regresssions. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-04 12:50:36 -05:00
Eric Engestrom	95d62baac5	loader: deduplicate logger function declaration Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-12-04 16:29:32 +00:00
Eric Engestrom	eade6ffeee	mesa: drop unused & deprecated lib DeprecationWarning: the imp module is deprecated in favour of importlib Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-04 16:26:21 +00:00
Eric Engestrom	919bec1c47	anv: add unreachable() for VK_EXT_fragment_density_map This silences the -Wswitch compiler warning. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-04 16:22:55 +00:00
Eric Engestrom	a0b14c1b02	meson: skip asm check when asm is disabled Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-04 16:22:51 +00:00
Andrii Simiklit	6ae873b97d	intel/tools: make sure the binary file is properly read 1. tools/i965_disasm.c:58:4: warning: ignoring return value of ‘fread’, declared with attribute warn_unused_result fread(assembly, *end, 1, fp); v2: Fixed incorrect return value check. ( Eric Engestrom <eric.engestrom@intel.com> ) v3: Zero size file check placed before fread with exit() ( Eric Engestrom <eric.engestrom@intel.com> ) v4: - Title is changed. - The 'size' variable was moved to top of a function scope. - The assertion was replaced by the proper error handling. - The error message on a caller side was fixed. ( Eric Engestrom <eric.engestrom@intel.com> ) Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-04 16:19:26 +00:00
Toni Lönnberg	d7b99ab947	intel/aubinator_error_decode: Get rid of warning for missing switch case ../src/intel/tools/aubinator_error_decode.c: In function ‘instdone_register_for_ring’: ../src/intel/tools/aubinator_error_decode.c:177:4: warning: enumeration value ‘I915_ENGINE_CLASS_INVALID’ not handled in switch [-Wswitch] switch (class) { ^~~~~~ Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-12-04 12:47:49 +00:00
Ilia Mirkin	bacf8471dc	nouveau: set texture upload budget It doesn't seem like the exact number has too much effect on the performaince in "teximage". However setting it to just about anything prevents some OOMs from getting hit. These values are not well-tuned, but don't seem too bad. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-03 23:11:29 -05:00
Ilia Mirkin	08c64fe7a1	nv50,nvc0: add explicit handling of PIPE_CAP_MAX_VERTEX_ELEMENT_SRC_OFFSET Since the max attrib stride is 2048, the max src offset makes sense as 2047. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-03 23:11:29 -05:00
Ilia Mirkin	de49e06507	nv50: always keep TSC slot 0 bound All TXF operations implicitly use sampler 0, and fail if it's not bound to anything. This does not happen in LINKED_TSC mode, but we don't currently use this. We ensure that TSC entry at id 0 has the SRGB conversion bit enabled (and all samplers we normally generate will too). Then when the TSC at slot 0 (not to be confused with entry 0 in the global TSC table) is unbound, we bind it to entry 0. This way, TXF operations are not dependent on there being a regular sampler bound there. Fixes arb_texture_buffer_object-subdata-sync among others. (TBO's are particularly susceptible to this as they don't bind a sampler.) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-03 23:11:29 -05:00
Dave Airlie	1363a47c9c	radv: use 3d shader for gfx9 copies if dst is 3d This fixes some crucible 3d miptree tests I've been working on when executed using the compute shader path. Fixes: `d08f267814` (radv/gfx9: fix 3d image to image transfers on compute queues.) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-04 10:42:31 +10:00
Bas Nieuwenhuizen	12e35a64c0	radv: Check for shareable images in central place. One place to put the logic makes things easier to change. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-04 01:21:38 +01:00
Bas Nieuwenhuizen	3bf48741e1	radv/android: Use buffer metadata to determine scanout compat. These days we don't always allocate scanout compatible textures anymore. That does mean we have to fix the radv android WSI though. Fixes: `b1444c9ccb` "radv: Implement VK_ANDROID_native_buffer." Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-04 01:21:38 +01:00
Bas Nieuwenhuizen	51091b3e1f	radv/android: Mark android WSI image as shareable. Fixes: `b1444c9ccb` "radv: Implement VK_ANDROID_native_buffer." Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-04 01:21:38 +01:00
Matt Turner	dd53bb7e1f	Revert "st/mesa: silenced unhanded enum warning in st_glsl_to_tgsi.cpp" This reverts commit `198c50f487`. This needs to be reverted after commit `017199d2d2` ("mesa: Revert INTEL_fragment_shader_ordering support")	2018-12-03 16:20:43 -08:00
Matt Turner	017199d2d2	mesa: Revert INTEL_fragment_shader_ordering support This extension is not properly tested (testing for GL_ARB_fragment_shader_interlock is not sufficient), and since this was noted in review on August 28th no tests have been sent. Revert "i965: Add INTEL_fragment_shader_ordering support." Revert "mesa: Add GL/GLSL plumbing for INTEL_fragment_shader_ordering" This reverts commit `03ecec9ed2`. This reverts commit `119435c877`. Cc: mesa-stable@lists.freedesktop.org Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Eric Anholt <eric@anholt.net>	2018-12-03 15:37:37 -08:00
Dave Airlie	e3f075439c	virgl: fix const warning on debug flags. Fixes: `8d4bb6e5c` (virgl: Add command and flags to initiate debugging on the host (v2))	2018-12-04 08:11:13 +10:00
Jason Ekstrand	71271e167b	vulkan: Update the XML and headers to 1.1.95 Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-03 14:27:10 -06:00
Tobias Klausmann	9401a2f2e6	amd/vulkan: meson build - use radv_deps for libvulkan_radeon Without this the build breaks with: FAILED: src/amd/vulkan/src@amd@vulkan@@vulkan_radeon@sha/radv_pipeline.c.o cc -Isrc/amd/vulkan/src@amd@vulkan@@vulkan_radeon@sha -Isrc/amd/vulkan -I../src/amd/vulkan -Isrc/../include -I../src/../include -Isrc -I../src -Isrc/mapi -I../src/mapi -Isrc/mesa -I../src/mesa -I../src/gallium/include -Isrc/gallium/auxiliary -I../src/gallium/auxiliary -Isrc/amd -I../src/amd -Isrc/amd/common -I../src/amd/common -Isrc/compiler -I../src/compiler -Isrc/vulkan/util -I../src/vulkan/util -Isrc/vulkan/wsi -I../src/vulkan/wsi -Isrc/compiler/nir -I../src/compiler/nir -I/usr/include -I/usr/include/libdrm -fdiagnostics-color=always -pipe -D_FILE_OFFSET_BITS=64 -Wall -Winvalid-pch -std=c99 -O2 -g '-DVERSION="18.3.0-rc5"' -DPACKAGE_VERSION=VERSION '-DPACKAGE_BUGREPORT="https://bugs.freedesktop.org/enter_bug.cgi?product=Mesa"' -DGLX_USE_TLS -DHAVE_ST_VDPAU -DENABLE_ST_OMX_BELLAGIO=0 -DENABLE_ST_OMX_TIZONIA=0 -DHAVE_X11_PLATFORM -DGLX_INDIRECT_RENDERING -DGLX_DIRECT_RENDERING -DGLX_USE_DRM -DHAVE_DRM_PLATFORM -DENABLE_SHADER_CACHE -DHAVE___BUILTIN_BSWAP32 -DHAVE___BUILTIN_BSWAP64 -DHAVE___BUILTIN_CLZ -DHAVE___BUILTIN_CLZLL -DHAVE___BUILTIN_CTZ -DHAVE___BUILTIN_EXPECT -DHAVE___BUILTIN_FFS -DHAVE___BUILTIN_FFSLL -DHAVE___BUILTIN_POPCOUNT -DHAVE___BUILTIN_POPCOUNTLL -DHAVE___BUILTIN_UNREACHABLE -DHAVE_FUNC_ATTRIBUTE_CONST -DHAVE_FUNC_ATTRIBUTE_FLATTEN -DHAVE_FUNC_ATTRIBUTE_MALLOC -DHAVE_FUNC_ATTRIBUTE_PURE -DHAVE_FUNC_ATTRIBUTE_UNUSED -DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT -DHAVE_FUNC_ATTRIBUTE_WEAK -DHAVE_FUNC_ATTRIBUTE_FORMAT -DHAVE_FUNC_ATTRIBUTE_PACKED -DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL -DHAVE_FUNC_ATTRIBUTE_VISIBILITY -DHAVE_FUNC_ATTRIBUTE_ALIAS -DHAVE_FUNC_ATTRIBUTE_NORETURN -DUSE_SSE41 -DUSE_GCC_ATOMIC_BUILTINS -DUSE_X86_64_ASM -DMAJOR_IN_SYSMACROS -DHAVE_SYS_SYSCTL_H -DHAVE_LINUX_FUTEX_H -DHAVE_ENDIAN_H -DHAVE_DLFCN_H -DHAVE_STRTOF -DHAVE_MKOSTEMP -DHAVE_POSIX_MEMALIGN -DHAVE_TIMESPEC_GET -DHAVE_MEMFD_CREATE -DHAVE_STRTOD_L -DHAVE_DLADDR -DHAVE_DL_ITERATE_PHDR -DHAVE_ZLIB -DHAVE_PTHREAD -DHAVE_PTHREAD_SETAFFINITY -DHAVE_LIBDRM -DHAVE_LLVM=0x0600 -DMESA_LLVM_VERSION_PATCH=1 -DHAVE_WAYLAND_PLATFORM -DWL_HIDE_DEPRECATED -DHAVE_DRI3 -DHAVE_DRI3_MODIFIERS -Werror=implicit-function-declaration -Werror=missing-prototypes -Werror=return-type -fno-math-errno -fno-trapping-math -Wno-missing-field-initializers -Wno-format-truncation -O2 -Wall -D_FORTIFY_SOURCE=2 -fstack-protector-strong -funwind-tables -fasynchronous-unwind-tables -fstack-clash-protection -DNDEBUG -fPIC -pthread -D__STDC_FORMAT_MACROS -D_GNU_SOURCE -D__STDC_CONSTANT_MACROS -D__STDC_LIMIT_MACROS -fvisibility=hidden -Wno-override-init -DVK_USE_PLATFORM_XCB_KHR -DVK_USE_PLATFORM_XLIB_KHR -DVK_USE_PLATFORM_WAYLAND_KHR -DVK_USE_PLATFORM_DISPLAY_KHR -DVK_USE_PLATFORM_XLIB_XRANDR_EXT -MD -MQ 'src/amd/vulkan/src@amd@vulkan@@vulkan_radeon@sha/radv_pipeline.c.o' -MF 'src/amd/vulkan/src@amd@vulkan@@vulkan_radeon@sha/radv_pipeline.c.o.d' -o 'src/amd/vulkan/src@amd@vulkan@@vulkan_radeon@sha/radv_pipeline.c.o' -c ../src/amd/vulkan/radv_pipeline.c In file included from ../src/vulkan/util/vk_alloc.h:29, from ../src/amd/vulkan/radv_private.h:52, from ../src/amd/vulkan/radv_debug.h:27, from ../src/amd/vulkan/radv_pipeline.c:30: ../src/../include/vulkan/vulkan.h:54:10: fatal error: wayland-client.h: Datei oder Verzeichnis nicht gefunden #include <wayland-client.h> ^~~~~~~~~~~~~~~~~~ compilation terminated. The above command misses the include directory for wayland: -I/usr/include/wayland The missing include is contained in the (until now) unused radv_deps: if with_platform_wayland radv_deps += dep_wayland_client radv_flags += '-DVK_USE_PLATFORM_WAYLAND_KHR' libradv_files += files('radv_wsi_wayland.c') endif Fixes: `673dda8330` "meson: build "radv" vulkan driver for radeon hardware" Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-03 09:18:48 -08:00
Erik Faye-Lund	fcf9fcee3c	mesa/main: do not require float-texture filtering for es3 The OpenGL ES 3.0 specification, table 3.13 lists half-float textures as filterable, but not float textures. So we shouldn't depend on ARB_float_texture, which requires full filtering support for both. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	43015b2a89	mesa/st: do not probe for the same texture-formats twice This should be equalent of what we did before. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	212d270b4e	mesa/main: require EXT_texture_sRGB for gles3 sRGB textures is a requirement for OpenGL ES 3.0, so let's make sure we don't incorrectly enable a too high version. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	487010a099	mesa/main: require EXT_texture_type_2_10_10_10_REV for gles3 OpenGL ES 3.0 require this functionality, so we should also test for it to avoid incorrectly exposing a too high GLES version. On desktop, this has been required since all the way back in OpenGL 1.2 anyway. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	74eab1c62f	mesa/main: split float-texture support checking in two On OpenGL ES 2.0, there's separate extensions adding support for half-float and float textures. So we need to validate the enums separately as well. This also prevents these enums from incorrectly being allowed on OpenGL ES 1.x, where there's no extension that enables this in the first place. While we're at it, remove the pointless default-case, and the seemingly stale fallthrough comment. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	c4136ed5cc	mesa/main: do not allow EXT_texture_sRGB_R8 enums before gles3 ctx->Extensions.EXT_texture_sRGB_R8 is set regardless of the API that's used, so checking for those direcly will always allow the enums from this extensions when they are supported by the driver. There's no extension adding support for this on OpenGL ES before version 3.0, so let's tighten the check. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-By: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	d972939986	mesa/main: do not allow sRGB texture enums before gles3 ctx->Extensions.EXT_texture_sRGB is set regardless of the API that's used, so checking for those direcly will always allow the enums from this extensions when they are supported by the driver. There's no extension adding support for this on OpenGL ES before version 3.0, so let's tighten the check. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	3629ee025c	mesa/main: do not allow snorm-texture enums before gles3 ctx->Extensions.EXT_texture_snorm is set regardless of the API that's used, so checking for those direcly will always allow the enums from this extensions when they are supported by the driver. There's no extension adding support for this on OpenGL ES before version 3.0, so let's tighten the check. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	52dc8b4f7b	mesa/main: do not allow floating-point texture enums on gles1 ctx->Extensions.OES_texture_float is set regardless of the API that's used, so checking for those direcly will always allow the enums from this extensions when they are supported by the driver. There's no extension enabling floating-point textures for OpenGL ES 1.x, so we shouldn't allow those enums there. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	167dcd59ae	mesa/main: do not allow type_2_10_10_10_REV enums before gles3 ctx->Extensions.EXT_texture_type_2_10_10_10_REV is set regardless of the API that's used, so checking for those direcly will always enable extensions when they are supported by the driver. There's no corresponding extension for OpenGL ES 1.x/2.0, so we shouldn't allow these enums there. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	b112e62ba4	mesa/main: do not allow MESA_ycbcr_texture enums on gles This extension requies OpenGL, and shouldn't be available on OpenGL ES. So let's not allow the enums from it either. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	1b2e9aca77	mesa/main: do not allow EXT_texture_shared_exponent enums before gles3 ctx->Extensions.EXT_texture_shared_exponent is set regardless of the API that's used, so checking for those direcly will always allow the enums from this extensions when they are supported by the driver. We also need to make sure this is enabled on OpenGL ES 3. Because the check is repeated, let's introduce a helper. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	510b642460	mesa/main: do not allow rg-textures enums before gles3 EXT_packed_float isn't supported on OpenGL ES, we shouldn't allow these enums there, before OpenGL ES 3.0 which also introduce support for these enums. Since this check is repeated a lot, let's make a helper for this. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	59690bf0a3	mesa/main: do not allow EXT_packed_float enums before gles3 EXT_packed_float isn't supported on OpenGL ES, we shouldn't allow these enums there, before OpenGL ES 3.0 which also introduce support for these enums. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	83db9d3e3a	mesa/main: do not allow ARB_depth_buffer_float enums before gles3 Floating-point depth buffers are only supported on OpenGL 3.0, OpenGL ES 3.0, or if ARB_depth_buffer_float is supported. Because we checked a driver capability rather than using an extension-check helper, we ended up incorrectly allowing this on OpenGL ES 1.x and 2.x. Since this logic is repeated, let's make a helper for it. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	3bbd543b6e	mesa/main: do not allow integer-texture enums before gles3 Integer textures shouldn't be implicitly exposed on OpenGL ES 1.x and 2.x, but because the code checked against a driver-capability rather than using an extension-check helper, we ended up accidentally allowing these enums on older versions when the driver supports it. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	b5a370dc25	mesa/main: do not allow ARB_texture_rgb10_a2ui enums before gles3 ARB_texture_rgb10_a2ui isn't supported on OpenGL ES, we shouldn't expose it there even if the driver supports it. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	76b038bee7	mesa/main: do not allow stencil-texture enums on gles1 ctx->Extensions.ARB_texture_stencil8 is set regardless of the API that's used, so checking for those direcly will always allow the enums from this extensions when they are supported by the driver. So let's instead check for both ARB_texture_stencil8 and OES_texture_stencil8, so we support depth textures on OpenGL and OpenGL ES 2.0+. There's no extension enabling stencil-textures for OpenGL ES 1.x, so we shouldn't allow those enums there. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	19eb0bf28f	mesa/main: do not allow depth-texture enums on gles1 ctx->Extensions.ARB_depth_texture is set regardless of the API that's used, so checking for those direcly will always allow the enums from this extensions when they are supported by the driver. So let's instead check for both ARB_depth_texture and OES_depth_texture, so we support depth textures on OpenGL and OpenGL ES 2.0+. There's no extension enabling depth-textures for OpenGL ES 1.x, so we shouldn't allow those enums there. This fixes oes_packed_depth_stencil-depth-stencil-texture_gles1 on i965 Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	2dfcaf7554	mesa/main: do not allow astc enums on gles1 ctx->Extensions.KHR_texture_compression_astc_ldr is set regardless of the API that's used, so checking for those direcly will always enable extensions when they are supported by the driver. But there's no extension enabling ASTC for OpenGL ES 1.x, so we shouldn't allow those enums there. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	1aa134038c	mesa/main: do not allow etc2 enums on gles1 ctx->Extensions.ARB_ES3_compatibility is set regardless of the API that's used, so checking for those direcly will always enable extensions when they are supported by the driver. But there's no extension enabling ETC2 for OpenGL ES 1.x, so we shouldn't allow those enums there. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	27ca87ccca	mesa/main: do not allow s3tc enums on gles1 There's no extension enabling S3TC formats on OpenGL ES 1.x, so we shouldn't allow these even if the driver can support it. So let's check for EXT_texture_compression_s3tc instead of ANGLE_texture_compression_dxt, which is supported on all other OpenGL variations. We also need to use _mesa_has_EXT_texture_compression_s3tc() instead of checking the driver cap directly, otherwise we end up enabling this on OpenGL ES 1.x, as the API isn't checked. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	d70cfb322a	mesa/main: use _mesa_has_FOO_bar for compressed format checks _mesa_has_FOO_bar() knows about the APIs these extensions should be supported under, so let's use that to simplify these checks a bit. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	70bfd31287	mesa/main: clean up integer texture check This makes the logic a little bit easier to follow, and reduce a bit of repetition. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	5109742e7b	mesa/main: clean up ES2_compatibility check This makes the logic a little bit easier to follow; this is either about ES2 compatibility or about gles. GL_RGB565 was added already in OpenGL ES 1.0. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	2e753b77dd	mesa/main: clean up OES_texture_float_linear check Using the _mesa_has_FOO_bar helpers is generally more safe and should generally be prefered over checking driver-caps like this code did, because the _mesa_has_FOO_bar helpers also verify the API type and version. This shouldn't have any practical effect here, as this function only gets called for OpenGL ES 3.x right now. But if this was to change in the future, this makes the function behave a lot more predictable. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	1373d117c2	mesa/main: clean up S3_s3tc check S3_s3tc is the extension that enables this functionality on desktop, so let's check for that one. The _mesa_has_S3_s3tc() helper already verifies the API according to the extension-table. As for the second hunk, we currently already only expose EXT_texture_compression_s3tc on desktop so by using the helper instead, we get rid of this detail here, and once we enable it for GLES we'll automaticall get the interaction right. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	e8b331ae13	mesa/main: rename format-check function _mesa_es3_error_check_format_and_type isn't specific to OpenGL ES 3.x, it applies to all versions of OpenGL ES. So let's rename it to reflect this. While we're at it, let's also rename a helper function it uses similarly. As the helper is static, we can also remove the namespacing-prefix from the name. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:44 +01:00
Erik Faye-Lund	ca8e2a5277	mesa/main: make _mesa_has_tessellation return bool All other _mesa_has_foo functions return bool rather than GLboolean, so let's follow that style here as well. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-03 18:16:43 +01:00
Chad Versace	3ef0ca65c9	i965: Fix -Wswitch on INTEL_COPY_STREAMING_LOAD The warning is emitted when building without INLINE_SSE41. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-12-03 13:07:56 +02:00
Karol Herbst	fc0139d283	nv50,nvc0: Fix gallium nine regression regarding sampler bindings The new approach is that samplers don't get unbound even if they won't be used in a draw and we should just leave them be as well. Fixes a regression in multiple windows games using gallium nine and nouveau. v2: adjust num_samplers to keep track of the highest sampler bound v3: rework how to set the new value of num_samplers Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106577 Fixes: `4d6fab245e` "cso: don't track the number of sampler states bound" Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-12-02 00:05:04 +01:00
Andre Heider	b6f095f7ce	d3dadapter9: use snprintf(..., "%s", ...) instead of strncpy Fixes -Wstringop-truncation compiler warnings. See `f836d799f9` "intel/decoder: use snprintf(..., "%s", ...) instead of strncpy" Signed-off-by: Andre Heider <a.heider@gmail.com> Reviewed-by: Axel Davy <davyaxel0@gmail.com>	2018-12-01 21:32:53 +01:00
Mauro Rossi	37a2072e97	android: st/mesa: fix building error due to sched_getcpu() Android has cpufeatures library but pinning of threads is not supported PIPE_OS_LINUX code path causes build error due to sched_getcpu() unavailable thus we need to avoid setting HAVE_SCHED_GETCPU for Android Fixes: `48f2160` ("st/mesa: regularly re-pin driver threads to the CCX where the app thread is") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-01 10:15:58 +01:00
Vinson Lee	4f74580d30	st/xvmc: Add X11 include path. This patch fixes this build error. CC tests/xvmc_bench.o In file included from tests/xvmc_bench.c:35: tests/testlib.h:38:10: fatal error: 'X11/Xlib.h' file not found ^~~~~~~~~~~~ Signed-off-by: Vinson Lee <vlee@freedesktop.org> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-30 22:09:43 -08:00
Mauro Rossi	eed3f1121c	android: amd/addrlib: update Mesa's copy of addrlib Needed to fix build error in addrlib in mesa for Android Fixes: `776b911` ("amd/addrlib: update Mesa's copy of addrlib") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-12-01 01:13:53 +01:00
Gurchetan Singh	89b4798c06	virgl: don't mark buffers as unclean after a write We can mark the buffer unclean if it's ever bound as a TBO, SSBO, ABO, or image. This improves dEQP-GLES3.performance.buffer.data_upload.function_call.map_buffer_range.new_specified_buffer.flag_write_full.stream_draw from 9.58 MB/s to 451.17 MB/s. v2: Track buffer cleanliness as a function of bindings (Ilia). v3: virgl_modify_clean --> virgl_dirty_res (Erik) Tested-By: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-11-30 12:21:01 +01:00
Gurchetan Singh	d18492c64f	virgl: avoid large inline transfers We flush everytime the command buffer (16 kB) is full, which is quite costly. This improves dEQP-GLES3.performance.buffer.data_upload.function_call.buffer_data.new_buffer.usage_stream_draw from 111.16 MB/s to 1930.36 MB/s. In addition, I made the benchmark produce buffers from 0 --> VIRGL_MAX_CMDBUF_DWORDS * 4, and tried ((VIRGL_MAX_CMDBUF_DWORDS * 4) / 2), ((VIRGL_MAX_CMDBUF_DWORDS * 4) / 4), etc. I didn't notice any clear differences, so let's just go with the most obvious heuristic. Tested-By: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-11-30 12:20:41 +01:00
Gurchetan Singh	c0773315af	virgl: quadruple command buffer size Tested running WebGL aquarium on Nvidia host (10,000 fishes) This moves us from 7 fps to 9 fps. After quadrupling, performance gains diminish. v2: Remove change ID (Erik) Tested-By: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-11-30 12:20:06 +01:00
Lionel Landwerlin	37f9788e9a	anv: flush pipeline before query result copies Pipeline state pending bits should be taken into account when copying results. In the particular bug below, the results of the vkCmdCopyQueryPoolResults() command was being overwritten by the preceding vkCmdCopyBuffer() with a same destination buffer. This is because we copy the buffers using the 3D pipeline whereas we copy the query results using the command streamer. Those pieces of HW work in parallel and the results are somewhat undefined. v2: Unconditionally flush the pipeline before copying the results (Jason) v3: Wrap & expressions (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108894 Cc: mesa-stable@lists.freedesktop.org	2018-11-29 22:07:31 +00:00
Marek Olšák	39b20b7d4f	Revert "winsys/amdgpu: overallocate buffers for faster address translation on Gfx9" I didn't mean to push this. I don't think it makes any difference. This reverts commit `f737fe00a0`.	2018-11-29 14:46:06 -05:00
Roland Scheidegger	fbf95ce074	draw: fix infinite loop in line stippling The calculated length of a line may be infinite, if the coords we get are bogus. This leads to an infinite loop in line stippling. To prevent this test for this explicitly (although technically on at least x86 sse it would actually work without the explicit test, as long as we use the int-converted length value). While here also get rid of some always-true condition. Note this does not actually solve the root cause, which is that the coords we receive are bogus after clipping. This seems a difficult problem to solve. One issue is that due to float arithmetic, clip w may become 0 after clipping if the incoming geometry is "sufficiently degenerate", hence x/y/z ndc (and window) coords will be all inf (or nan). Even with w not quite 0, I believe it's possible we produce values which are actually outside the view volume. (Also, x=y=z=w=0 coords in clipspace would be not considered subject to clipping, and similarly result in all NaN coords.) We just hope for now other draw stages (and rasterizers) can handle those relatively safely (llvmpipe itself should be sort of robust against this, certainly converstion to fixed point will produce garbage, it might fail a couple assertions but should neither hang nor crash otherwise). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-11-29 18:39:40 +01:00
Józef Kucia	94bfb8bf38	nir: Fix assert in print_intrinsic_instr(). Signed-off-by: Józef Kucia <joseph.kucia@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-29 16:29:37 +00:00
Nicolai Hähnle	776b911365	amd/addrlib: update Mesa's copy of addrlib Update to the internal master as of 2018-11-15. This has a lot of gratuitous whitespace change, but on the plus side it's built using the same tooling that's used for AMDVLK, which should help going forward.	2018-11-29 13:18:24 +01:00
Nicolai Hähnle	621c107760	ac/surface/gfx9: let addrlib choose the preferred swizzle kind Our choices here are simply redundant as long as sin.flags is set correctly. (v2: - remove unused function parameter) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-29 13:18:23 +01:00
Nicolai Hähnle	729ebdf07e	radv: remove dependency on addrlib gfx9_enum.h v2: - use SI_CONTEXT_REG_OFFSET Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-29 13:18:23 +01:00
Thomas Hellstrom	058f85d41c	winsys/svga: Fix a memory leak The ioctl.cap_3d member was never freed. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-29 10:42:06 +01:00
Thomas Hellstrom	7fce3ca375	st/xa: Fix a memory leak Free the context after destruction. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-29 10:42:06 +01:00
Samuel Pitoiset	cc7deb749c	radv: drop few useless state changes when doing color/depth decompressions Viewport/scissor don't need to be updated for array textures. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:55 +01:00
Samuel Pitoiset	6d4f65deea	radv: remove unused pending_clears param in the transition path Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:53 +01:00
Samuel Pitoiset	4b9df824f7	radv: optimize CmdClear{Color,DepthStencil}Image() for layered textures If all layers are bound we can perform a fast color or depth clear instead of iterating over all layers. This has the advantage to avoid trashing the framebuffer for nothing if you we end up by doing a fast clear when calling radv_clear_image_layer(), and clearing all layers in one shot is obviously faster. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	7484bc894b	radv: refactor the fast clear path for better re-use Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	f78ee19702	radv: simplify a check in emit_fast_color_clear() Currently only true if RADV_PERFTEST=dccmsaa is set. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	eca931a726	radv: add radv_can_fast_clear_{color,depth}() helpers For further optimisations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	93f5ce8fa7	radv: add radv_image_view_can_fast_clear() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	aeaf8dbd09	radv: add radv_image_can_fast_clear() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	3e718db1ff	radv: remove useless check in emit_fast_color_clear() The driver doesn't support DCC/CMASK for mipmapped textures. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Vinson Lee	d0c7b079d0	freedreno: Fix autotools build. Fix build error. CXXLD pipe_msm.la ../../../../src/gallium/drivers/freedreno/.libs/libfreedreno.a(freedreno_batch.o): In function `batch_init': src/gallium/drivers/freedreno/freedreno_batch.c:54: undefined reference to `fd_device_version' src/gallium/drivers/freedreno/freedreno_batch.c:59: undefined reference to `fd_submit_new' src/gallium/drivers/freedreno/freedreno_batch.c:61: undefined reference to `fd_submit_new_ringbuffer' src/gallium/drivers/freedreno/freedreno_batch.c:64: undefined reference to `fd_submit_new_ringbuffer' src/gallium/drivers/freedreno/freedreno_batch.c:66: undefined reference to `fd_submit_new_ringbuffer' src/gallium/drivers/freedreno/freedreno_batch.c:70: undefined reference to `fd_submit_new_ringbuffer' Fixes: `b4476138d5` ("freedreno: move drm to common location") Fixes: `aa0fed10d3` ("freedreno: move ir3 to common location") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-11-28 22:23:52 -08:00
Marek Olšák	075fd5d8f2	radeonsi: add memory management stress tests for GDS Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	c1d3c08699	winsys/amdgpu: add support for allocating GDS and OA resources Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	d7a4fa91f0	radeonsi: allow si_cp_dma_clear_buffer to clear GDS from any IB Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	72b2b61d8c	winsys/amdgpu: use optimal VM alignment for CPU allocations Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	27f9935075	winsys/amdgpu: use optimal VM alignment for imported buffers Window system buffers didn't use the optimal alignment. Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	6b554d863f	winsys/amdgpu,radeon: pass vm_alignment to buffer_from_handle Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	f737fe00a0	winsys/amdgpu: overallocate buffers for faster address translation on Gfx9 Sadly, the 3 games I tested (DeusEx:MD, DiRT Rally, DOTA 2) are unaffected by the overallocation, because I guess their buffers don't fall into the small range below a power-of-two size. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	8c00f778fc	winsys/amdgpu: increase the VM alignment to the MSB of the size for Gfx9 Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	a2a6b06d48	winsys/amdgpu: use >= instead of > for VM address alignment Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	98f2312b4f	winsys/amdgpu: clean up code around BO VM alignment Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	5f9ccf827e	winsys/amdgpu: optimize slab allocation for 2 MB amdgpu page tables - the slab buffer size increased from 128 KB to 2 MB (PTE fragment size) - the max suballocated buffer size increased from 64 KB to 256 KB, this increases memory usage because it wastes memory - the number of suballocators increased from 1 to 3 and they are layered on top of each other to minimize unused space in slabs The final increase in memory usage is: DeusEx:MD: 1.8% DOTA 2: 1.75% DiRT Rally: 0.2% The kernel driver will also receive fewer buffers.	2018-11-28 20:20:27 -05:00
Marek Olšák	cf6835485c	radeonsi: generalize the slab allocator code to allow layered slab allocators There is no change in behavior. It just makes it easier to change the number of slab allocators.	2018-11-28 20:20:27 -05:00
Marek Olšák	9576266a37	winsys/amdgpu: always reclaim/release slabs if there is not enough memory	2018-11-28 20:20:27 -05:00
Marek Olšák	015061beb3	radeonsi: fix is_oneway_access_only for bindless images	2018-11-28 20:20:27 -05:00
Marek Olšák	8c25ab1a23	radeonsi/nir: parse more information about bindless usage fill more tgsi_shader_info fields.	2018-11-28 20:20:27 -05:00
Marek Olšák	2a936f8afa	tgsi/scan: add more information about bindless usage radeonsi will use this.	2018-11-28 20:20:27 -05:00
Marek Olšák	fba91b5173	radeonsi: small cleanup for memory opcodes	2018-11-28 20:20:27 -05:00
Marek Olšák	709905cbb6	radeonsi: fix is_oneway_access_only for image stores We need to look at the Dst for image stores.	2018-11-28 20:20:27 -05:00
Marek Olšák	648dc52367	radeonsi: use structured buffer intrinsics for image views to stop using the workaround in si_make_buffer_descriptor.	2018-11-28 20:20:27 -05:00
Marek Olšák	442dae2693	radeonsi: clean up primitive binning enablement no change in behavior. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Dave Airlie	8eb8be3f54	virgl: fix undefined shift to use unsigned. Ported from virglrenderer. Signed-off-by: Dave Airlie <airlied@redhat.com>	2018-11-29 09:09:31 +10:00
Dave Airlie	2ddd44d941	r600: make suballocator 256-bytes align Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108311 Cc: <mesa-stable@lists.freedesktop.org>	2018-11-29 09:09:02 +10:00
Kenneth Graunke	f11780779f	intel/compiler: Use nir's info when checking uses_streams. Vulkan and Gallium don't use Mesa's gl_program data structure, so they can't poke at 'prog'. But we can simply use the copy of the shader info stored with the NIR shader, which is guaranteed to exist. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-11-28 13:35:29 -08:00
Jason Ekstrand	199a0353d6	nir/derefs: Add a nir_derefs_do_not_alias enum value This makes some of the code more clear. Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-11-28 14:29:25 -06:00
Gurchetan Singh	eb44c36cf1	egl: add missing #include <stddef.h> in egldevice.h Otherwise, I get this error: main/egldevice.h:54:13: error: ‘NULL’ undeclared (first use in this function) dev = NULL; ^~~~ with this config: ./autogen.sh --enable-gles1 --enable-gles2 --with-platforms='surfaceless' --disable-glx --with-dri-drivers="i965" --with-gallium-drivers="" --enable-gbm v3: Use stddef.h (Matt) v4: Modify commit message (Eric) Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-28 11:22:47 -08:00
Matt Turner	2d48d5116b	gallivm: Use nextafterf(0.5, 0.0) as rounding constant The common truncf(x + 0.5) fails for the floating-point value just less than 0.5 (nextafterf(0.5, 0.0)). nextafterf(0.5, 0.0) + 0.5, after rounding is 1.0, thus truncf does not produce the desired value. The solution is to add nextafterf(0.5, 0.0) instead of 0.5 before truncating. This works for all values. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-11-28 11:22:47 -08:00
Juan A. Suarez Romero	e2ad94d928	docs: update calendar, add news item and link release notes for 18.2.6 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-11-28 19:20:09 +01:00
Juan A. Suarez Romero	a53a280479	docs: add sha256 checksums for 18.2.6 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `cfd1f8b92c`)	2018-11-28 19:20:09 +01:00
Juan A. Suarez Romero	f6ab6e2867	docs: add release notes for 18.2.6 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `3e741344d7`)	2018-11-28 19:20:09 +01:00
Nicolai Hähnle	c02390f8fc	egl/wayland: rather obvious build fix Fixes: `ce74a7bb8d` ("egl/wayland: plug memory leak in drm_handle_device()") Fixes: `c59d3aa4b9` ("egl/wayland: bail out when drmGetMagic fails")	2018-11-28 18:30:36 +01:00
Nicolai Hähnle	eb94b6bd5c	winsys/amdgpu: explicitly declare whether buffer_map is permanent or not Introduce a new driver-private transfer flag RADEON_TRANSFER_TEMPORARY that specifies whether the caller will use buffer_unmap or not. The default behavior is set to permanent maps, because that's what drivers do for Gallium buffer maps. This should eliminate the need for hacks in libdrm. Assertions are added to catch when the buffer_unmap calls don't match the (temporary) buffer_map calls. I did my best to update r600 for consistency (r300 needs no changes because it never calls buffer_unmap), even though the radeon winsys ignores the new flag. As an added bonus, this should actually improve the performance of the normal fast path, because we no longer call into libdrm at all after the first map, and there's one less atomic in the winsys itself (there are now no atomics left in the UNSYNCHRONIZED fast path). Cc: Leo Liu <leo.liu@amd.com> v2: - remove comment about visible VRAM (Marek) - don't rely on amdgpu_bo_cpu_map doing an atomic write Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-28 18:24:14 +01:00
Nicolai Hähnle	35eb81987c	winsys/amdgpu: add amdgpu_winsys_bo::lock We'll use it in the upcoming mapping change. Sparse buffers have always had one. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-28 18:23:29 +01:00
Eric Engestrom	e0f1f74eda	vulkan/wsi: fix s/,/;/ typo Fixes: `59e58c348e` "vulkan/wsi: Only wait on semaphores on the first swapchain" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-28 16:44:01 +00:00
Emil Velikov	ce74a7bb8d	egl/wayland: plug memory leak in drm_handle_device() As we fail to open the node, we leak the node/device name. v2: Log and then free() (Eric) Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-28 16:12:12 +00:00
Emil Velikov	c59d3aa4b9	egl/wayland: bail out when drmGetMagic fails Currently as the function fails, we pass uninitialized data to the authentication function. Stop doing that and print an warning when the function fails. v2: Plug memory leak in error path (Eric) Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (v1) Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-28 16:11:22 +00:00
Eric Engestrom	9575cd2893	wsi/display: fix mem leak when freeing swapchains Fixes: `da997ebec9` "vulkan: Add KHR_display extension using DRM [v10]" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Keith Packard <keithp@keithp.com>	2018-11-28 12:09:54 +00:00
Gert Wollny	f08d107054	i965: Set the FBO error state INCOMPLETE_ATTACHMENT only for SRGB_R8 Originally the driver reported GL_FRAMEBUFFER_UNSUPPORTED in all cases, adding more specific error messages was not correct and broke many tests. Mostly revert this and only report GL_FRAMEBUFFER_INCOMPLETE_ATTACHMENT for MESA_FORMAT_R_SRGB8. Fixes: `ebcde34545` i965: be more specific about FBO completeness errors Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108805 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-28 10:12:47 +01:00
Gert Wollny	d8bb88d0b4	i965: Explicitely handle swizzles for MESA_FORMAT_R_SRGB8 The format is emulated by using ISL_FORMAT_L8_SRGB, therefore we need to force swizzles for the GBA channels. However, doing this only based on the data type GL_RED breaks other formats, therefore, test specifically for the format. Fixes: `c5363869d4` i965: Force zero swizzles for unused components in GL_RED and GL_RG Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-28 10:07:02 +01:00
Gert Wollny	091295d7cb	virgl: Don't try handling server fences when they are not supported vtest doesn't implement the according API and would segfault: Program received signal SIGSEGV, Segmentation fault. #0 0x0000000000000000 in ?? () #1 in virgl_fence_server_sync at src/gallium/drivers/virgl/virgl_context.c:1049 #2 in st_server_wait_sync at src/mesa/state_tracker/st_cb_syncobj.c:155 so just don't do the call when the function pointers are not set. Fixes dEQP: dEQP-GLES3.functional.fence_sync.wait_sync_smalldraw dEQP-GLES3.functional.fence_sync.wait_sync_largedraw Fixes: `d1a1c21e76` virgl: native fence fd support Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Robert Foss <robert.foss@collabora.com>	2018-11-28 10:02:31 +01:00
Gert Wollny	073fdd7382	virgl,vtest: Initialize return value Avoids: Conditional jump or move depends on uninitialised value(s) at 0x9E2B39F: virgl_vtest_winsys_resource_cache_create (virgl_vtest_winsys.c:379) by 0x9E2725F: virgl_buffer_create (virgl_buffer.c:169) by 0x9E246D5: virgl_resource_create (virgl_resource.c:60) by 0xA0C1B9F: bufferobj_data (st_cb_bufferobjects.c:344) by 0xA0C1B9F: st_bufferobj_data (st_cb_bufferobjects.c:390) by 0x9F4ACE3: vbo_use_buffer_objects (vbo_exec_api.c:1136) by 0xA0C68C3: st_create_context_priv (st_context.c:416) by 0xA0C707A: st_create_context (st_context.c:598) by 0x9F81C6B: st_api_create_context (st_manager.c:918) by 0x9BBE591: dri_create_context (dri_context.c:161) by 0x9BB6931: driCreateContextAttribs (dri_util.c:473) by 0x4E97A44: drisw_create_context_attribs (drisw_glx.c:630) by 0x4E7C591: glXCreateContextAttribsARB (create_context.c:78) Uninitialised value was created by a stack allocation at 0x9E2B249: virgl_vtest_winsys_resource_cache_create (virgl_vtest_winsys.c:342) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Robert Foss <robert.foss@collabora.com>	2018-11-28 10:02:31 +01:00
Iago Toral Quiroga	e55cbf26ea	intel/compiler: fix register allocation in opt_peephole_sel This wasn't handling 64-bit cases properly. Found by inspection. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-11-28 08:28:27 +01:00
Matt Turner	6f737b9207	glsl: Remove unused member variable Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-27 22:29:53 -08:00
Matt Turner	1a210268b8	nir: Call fflush() at the end of nir_print_shader() We normally call with stderr which is unbuffered, so this won't affect that, but it does let me call nir_print_shader(nir, fopen("log", "w+")) from gdb and actually get the whole shader in my file. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-27 22:29:53 -08:00
Eric Anholt	e113b21cb7	v3d: Add renderonly support. I've been using this with the kmsro series to test v3d on VKMS without my old KMS hack in the v3d kernel driver. KMSRO still needs some cleanup, but v3d RO support seems reasonable.	2018-11-27 15:03:02 -08:00
Eric Anholt	55edafa73e	gallium: Remove unused variable in u_tests. Fixes: `0d17b685b1` ("gallium/u_tests: add a compute shader test that clears an image") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-27 15:02:57 -08:00
Bas Nieuwenhuizen	6569644bb6	radv: Align large buffers to the fragment size. Improves performance in Talos by about 15% (and significant improvements in RotR and possibly other but did not bench with final patch) on kernel 4.19 and earlier. On 4.20+ a similar effect comes from 433ca054949a "drm/amdgpu: try allocating VRAM as power of two" v2: Do not impact the alignment of the physical memory. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> CC: <mesa-stable@lists.freedesktop.org>	2018-11-27 22:17:42 +01:00
Hyunjun Ko	76945e4140	freedreno: implements get_sample_position Since `1285f71d3e` landed, it needs to provide apps with proper sample position for MSAA. Currently no way to query this to hw, these are taken from blob driver. Fixes: dEQP-GLES31.functional.texture.multisample.samples_#.sample_position Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Rob Clark	5973a4d0b7	freedreno/a3xx: also set FSSUPERTHREADENABLE We set equiv bit in SP_FS_CTRL_REG0. Somehow the hw doesn't hang with this mismatched config, but does run slower. It is faster with either neither bit set, or both bits set, but both is the fastest of the three configurations. Worth a bit over 10% gain in glmark2. Spotted-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	e68cd91251	freedreno: use MSM_BO_SCANOUT with scanout buffers Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2018-11-27 15:44:03 -05:00
Jonathan Marek	3ed4aad524	freedreno: use GENERIC instead of TEXCOORD for blit program blip_fp uses GENERIC as input, so blit_vp should match for linking Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	3a273a4abc	freedreno: a2xx texture update Adds all missing texture related logic. For everything to work it also needs changes to ir2/fd2_program, which are part of the ir2 update patch. Note: it needs rnndb update Signed-off-by: Jonathan Marek <jonathan@marek.ca> [remove stray patch] Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	4887aba638	freedreno/a2xx: Compute depth base in gmem correctly Note: it needs rnndb update Signed-off-by: Marek Vasut <marex@denx.de> Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	e7114575f7	freedreno/a2xx: set VIZ_QUERY_ID on a20x Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	a50b8a0152	freedreno: add missing a20x ids 200: 256KiB GMEM A200 (imx53) 201: 128KiB GMEM A200 (imx51) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	4e6ee033ff	freedreno/a2xx: fix POINT_MINMAX_MAX overflow As it stands, it overflows to zero. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	78fede86d9	freedreno: a2xx: fd2_draw update Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Jonathan Marek	3e7186d472	nir: add fceil lowering lowers ceil(x) as -floor(-x) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	11593f9041	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	d47d77d49d	freedreno/a6xx: set guardband clip On older gens, the CLIP_ADJ bitfields were actually 3.6 fixed point. Which might make more sense. Although this formula comes up with values pretty close to what blob does for various viewport sizes (for at least a5xx and a6xx), and seems to work. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	2773919f06	freedreno/a6xx: disable LRZ for z32 `f6131d4ec7` had the side effect of enabling LRZ w/ 32b depth buffers. But there are some bugs with this, which aren't fully understood yet, so for now just skip LRZ w/ z32.. Fixes: `f6131d4ec7` freedreno/a6xx: Clear z32 and separate stencil with blitter Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Kristian H. Kristensen	9595be67a9	freedreno/a6xx: Clear gmem buffers at flush time We generate an IB to clear the gmem at flush time and jump to it before rendering each tile. This lets us get rid of the command stream patching for gmem offsets. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Kristian H. Kristensen	b5a9bb28c6	freedreno/a6xx: Move resolve blits to an IB Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Kristian H. Kristensen	5f068cf3b0	freedreno/a6xx: Move restore blits to IB Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	09300bbe03	mesa/st: better colormask check for clear fallback For RGB surfaces (for example) we don't really care that the colormask is 0x7 instead of 0xf. This should not trigger clear_with_quad() slowpath. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-27 15:44:02 -05:00
Rob Clark	65cee01430	mesa/st: swap order of clear() and clear_with_quad() If we can't clear all the buffers with pctx->clear() (say, for example, because of ColorMask), push the buffers we can clear with pctx->clear() first. Tilers want to see clears coming before draws to enable fast- paths, and clearing one of the attachments with a quad-draw first confuses that logic. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-27 15:44:02 -05:00
Rob Clark	aa0fed10d3	freedreno: move ir3 to common location Move (most of) the ir3 compiler to src/freedreno/ir3 so that it can be re-used by some future vulkan driver. The parts that are gallium specific have been refactored out and remain in the gallium driver. Getting the move done now so that it can happen before further refactoring to support a6xx specific instructions. NOTE also removes ir3_cmdline compiler tool from autotools build since that was easier than fixing it and I normally use meson build. Waiting patiently for the day that we can remove everything from the autotools build. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	556eec249d	freedreno/ir3: remove u_inlines usage Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	312eae45a3	freedreno/ir3: split up ir3_shader Split the parts that are gallium specific into ir3_gallium so the rest can move to a common location outside of gallium. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	ea4cbf601d	freedreno/ir3: remove pipe_stream_output_info dependency A bit annoying to have to copy into our own struct. But this is something the compiler really needs to know, at least on earlier generations where streamout is implemented in shader. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	030e98630d	freedreno/ir3: some header file cleanup Clean up some of the low-hanging-fruit usages of freedreno_util.h Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	2482153d52	freedreno/ir3: use env_var_as_unsigned() Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	a321f939f6	util: env_var_as_unsigned() helper So I can drop env2u() helper from freedreno_util.h and get rid of one small ir3 dependency on gallium/freedreno Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	bfd8d26372	freedreno/ir3: move disasm and optmsgs debug flags Move them to IR3_SHADER_DEBUG so we can remove ir3's dependency on fd_mesa_debug. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	424d75656f	freedreno: FD_SHADER_DEBUG -> IR3_SHADER_DEBUG Only used by ir3, so move it into ir3 to be more self contained. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	8a654f092e	freedreno: remove shader_stage_name() Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	c635703c50	freedreno: shader_t -> gl_shader_stage Just massive search/replace for the most part. Step towards removing ir3 dependency on disasm.h which is shared by a2xx. One step closer to being able to move ir3 out of gallium. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	388aac32ed	freedreno/ir3: standalone compiler updates Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	b4476138d5	freedreno: move drm to common location So that we can re-use at least parts of it for vulkan driver, and so that we can move ir3 to a common location (which uses fd_bo to allocate storage for shaders) Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Rob Clark	6cb74eb4f1	freedreno/drm: remove dependency on gallium driver Prep work to move drm to a common location. Slightly hacky, but the softpin debug flag is only temporary. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Dylan Baker	88c4680b5a	util: promote u_memory to src/util as well as os_memory* Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:02 -05:00
Eric Anholt	bade179153	gallium: Fix uninitialized variable warning in compute test. The compiler doesn't know that ny != 0, so x might be uninitialized for the printf at the end. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-11-27 11:23:22 -08:00
Bas Nieuwenhuizen	08ea6b9d9b	radv: Clamp gfx9 image view extents to the allocated image extents. Mirrors AMDVLK. Looks like if we go over the alignment of height we actually start to change the addressing. Seems like the extra miplevels actually work with this. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108245 Fixes: `f6cc15dccd` "radv/gfx9: fix block compression texture views. (v2)" Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-27 10:19:52 +01:00
Iago Toral Quiroga	453570cd8c	intel/compiler: fix indentation style in opt_algebraic()	2018-11-27 09:53:09 +01:00
Anuj Phogat	16e4911972	anv/icl: Set use full ways in L3CNTLREG L3 allocation table in h/w specification recommends using 4 KB granularity for programming allocation fields in L3CNTLREG. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2018-11-26 15:11:36 -08:00
Anuj Phogat	3f55fd3814	intel/icl: Set way_size_per_bank to 4 Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2018-11-26 15:11:36 -08:00
Anuj Phogat	3ce04da5b4	i965/icl: Set use full ways in L3CNTLREG L3 allocation table in h/w specification recommends using 4 KB granularity for programming allocation fields in L3CNTLREG. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2018-11-26 15:11:36 -08:00
Anuj Phogat	3282c7be89	i965/icl: Fix L3 configurations Use L3 configuration specified in h/w specification. V2: Drop configs which do under allocation of l3 cache. Bump up the comment above table. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2018-11-26 15:11:36 -08:00
Eric Engestrom	c0c533767e	build: stop defining unused VERSION Scons and autotools don't define it, and as of last commit nothing uses it. `VERSION` is also a generic enough name that something somewhere will eventually clash, and we don't want to repeat the LLVM `DEBUG` fiasco. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-26 22:05:02 +00:00
Eric Engestrom	bd12e02530	vulkan/utils: s/VERSION/PACKAGE_VERSION/ Everything else uses PACKAGE_VERSION, so let's be consistent, and VERSION and PACKAGE_VERSION are currently defined to be the same in meson and android, while VERSION is undefined in autotools and scons. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-26 22:05:02 +00:00
Eric Engestrom	56d126f8fd	anv: correctly use vulkan 1.0 by default Per chapter 3.2 "Instances": > Providing a NULL VkInstanceCreateInfo::pApplicationInfo or providing > an apiVersion of 0 is equivalent to providing an apiVersion of > VK_MAKE_VERSION(1,0,0). Reported-by: Niklas Haas <git@haasn.xyz> Fixes: `8c048af589` "anv: Copy the appliation info into the instance" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-26 22:05:02 +00:00
Erik Faye-Lund	d6d35d87f1	mesa/main: fixup requirements for GL_PRIMITIVES_GENERATED This enum is also allowed by EXT_tessellation_shader, which is supported on older i965 HW (as opposed to OES_geometry_shader). This was missed when narrowing this code-path, leading to dEQP regressions. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108868 Fixes: `f09d94fbd1` "mesa/main: fix validation of transform-feedback queries" Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Mark Janes <mark.a.janes@intel.com>	2018-11-26 22:12:07 +01:00
Erik Faye-Lund	c120dbfe4d	mesa/main: fix incorrect depth-error If glGetTexImage or glGetnTexImage is called with a level that doesn't exist, we get an error message on this form: Mesa: User error: GL_INVALID_VALUE in glGetTexImage(depth = 0) This is clearly nonsensical, because these APIs don't even have a depth-parameter. The reason is that get_texture_image_dims() return all-zero dimensions for non-existent texture-images, and we go on to validate these dimensions as if they were user-input, because glGetTextureSubImage requires checking. So let's split this logic in two, so glGetTextureSubImage can have stricter input-validation. All arguments that are no longer validated are generated internally by mesa, so there's no use in validating them. Fixes: `42891dbaa1` "gettextsubimage: verify zoffset and depth are correct" Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-26 12:29:54 +01:00
Erik Faye-Lund	38af69adfa	mesa/main: check cube-completeness in common code This check is the only part of dimensions_error_check that isn't about error-checking the offset and size arguments of glGet[Compressed]TextureSubImage(), so it doesn't really belong in here. This doesn't make a difference right now, apart for changing the presedence of this error. But it will make a difference for the next patch, where we no longer call this method from the non-sub tex-image getters. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-26 12:29:54 +01:00
Erik Faye-Lund	42820c5727	mesa/main: factor out common error-checking This error checking is the same for teximage and texsubimage getters, so let's factor it out to its own function. This will be useful when getteximage and gettexsubimage gets their own error checking routines a bit later. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-26 12:29:54 +01:00
Erik Faye-Lund	5e0a84f31c	mesa/main: factor out tex-image error-checking This will be useful when we split error-checking for getteximage and gettexsubimage later. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-26 12:29:54 +01:00
Erik Faye-Lund	38bbb61252	mesa/main: remove bogus error for zero-sized images The explanation quotes the spec on the following wording to justify the error: "An INVALID_VALUE error is generated if xoffset + width is greater than the texture’s width, yoffset + height is greater than the texture’s height, or zoffset + depth is greater than the texture’s depth." However, this shouldn't generate an error in the case where all three of width, xoffset and the texture's width are zero. In this case, we end up generating an unspecified error. So let's remove this check, and instead make sure that we consider this as an empty texture. So let's not generate an error, there's non mandated in the spec in xoffset/yoffset/zoffset = 0 case. We already avoid doing any work in this case, because of the final, non-error generating check in this function. Fixes: `b37b35a5d2` "getteximage: assume texture image is empty for non defined levels" Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-26 12:29:54 +01:00
Erik Faye-Lund	f1998e15ff	mesa/main: remove ARB suffix from glGetnTexImage This function has been core since OpenGL 4.3, so naming the implementation and reporting erros using an ARB-suffix can be confusing. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-26 12:29:54 +01:00
Gert Wollny	f5d053702f	glsl: free or reuse memory allocated for TF varying When a shader program is de-serialized the gl_shader_program passed in may actually still hold memory allocations for the transform feedback varyings. If that is the case, free the varying names and reallocate the new storage for the names array. This fixes a memory leak: Direct leak of 48 byte(s) in 6 object(s) allocated from: in malloc (/usr/lib64/gcc/x86_64-pc-linux-gnu/7.3.0/libasan.so+0xdb880) in transform_feedback_varyings ../../samba/mesa/src/mesa/main/transformfeedback.c:875 in _mesa_TransformFeedbackVaryings ../../samba/mesa/src/mesa/main/transformfeedback.c:985 ... Indirect leak of 42 byte(s) in 6 object(s) allocated from: in __interceptor_strdup (/usr/lib64/gcc/x86_64-pc-linux-gnu/7.3.0/libasan.so+0x761c8) in transform_feedback_varyings ../../samba/mesa/src/mesa/main/transformfeedback.c:887 in _mesa_TransformFeedbackVaryings ../../samba/mesa/src/mesa/main/transformfeedback.c:985 Fixes: `ab2643e4b0` glsl: serialize data from glTransformFeedbackVaryings Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-26 09:58:25 +01:00
Bas Nieuwenhuizen	3c96a1e3a9	radv: Fix opaque metadata descriptor last layer. We used the layer count which results in an off by one error. Not sure this really affects anything. Fixes: `f4e499ec79` "radv: add initial non-conformant radv vulkan driver" Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-26 09:29:39 +01:00
Mathias Fröhlich	ff466c2d48	mesa/st: Make st_pipe_vertex_format static. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-26 07:57:10 +01:00
Mathias Fröhlich	2a3eae82a1	mesa/st: Use binding information from the VAO in feedback rendering. Use VAO binding information in feedback rendering. In theory it should reduce the amount of buffer objects scheduled for rendering. Feedback rendering is implemented in a crude way anyhow, so I do not expect much gain here. But for the sake of code reuse we should use the same code for the same task. And finally if feeback rendering may get improved the array setup is already well done there. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-26 07:57:10 +01:00
Mathias Fröhlich	a00a8fb8d1	mesa/st: Avoid extra references in the feedback draw function scope. The change removes the reference that is held on the entries of the vbuffers[] array. The new code does not do that anymore as following the code into draw_set_vertex_buffers() the draw context holds an other reference as long as it is reset down the function again. So it should be already by that argument save to remove that additional reference count. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-26 07:57:10 +01:00
Mathias Fröhlich	6705188cc5	mesa/st: Factor out array and buffer setup from st_atom_array.c. Factor out vertex array setup routines from the array state atom. The factored functions will be used in feedback rendering in the next change. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-26 07:57:09 +01:00
Mathias Fröhlich	774d585d49	mesa/st: Only unmap the uploader that was actually used. In st_atom_array, we only need to unmap the upload buffer that was actually used. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-26 07:57:09 +01:00
Mathias Fröhlich	65332aff29	mesa/st: Only care about the uploader if it was used. In st_atom_array, we only need to care for unmapping the upload buffer if we actually used it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-26 07:57:09 +01:00
Ilia Mirkin	927ce66b39	nv50/ir: remove dnz flag when converting MAD to ADD due to optimizations dnz flag only applies for multiplications (e.g. to make 0 * Infinity becomes 0 instead of NaN). Once we optimize a MAD into an ADD, the dnz flag no longer makes sense, and upsets the GM107 emitter (since it looks at the ftz and dnz flags together). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-11-24 22:15:53 -05:00
Marek Olšák	d4e7d8b7f0	winsys/amdgpu: fix a device handle leak in amdgpu_winsys_create Cc: 18.2 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-23 17:08:44 -05:00
Marek Olšák	82aa07f81f	winsys/amdgpu: fix a buffer leak in amdgpu_bo_from_handle Cc: 18.2 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-23 17:08:42 -05:00
Samuel Pitoiset	9fc1ce258c	radv: ignore subpass self-dependencies for CreateRenderPass() too We really need to refactor this... Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-23 11:59:11 +01:00
Samuel Pitoiset	2951a766bd	radv: remove useless sync before CmdClear{Color,DepthStencil}Image() We don't need to flush anything before these two commands as well. This is because they have to be externally synchronized, so the app should have called CmdPipelineBarrier() prior to that and the driver should have flushed the caches. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-23 11:59:08 +01:00
Erik Faye-Lund	a652842982	mesa/main: remove overly strict query-validation The rules encoded in this code also applies to OpenGL ES 3.0 and up, but the per-enum validation has already been taught about these rules. So let's get rid of this duplicate, narrow version of the validation. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:36 +01:00
Erik Faye-Lund	d52be6dd29	mesa/main: fix validation of GL_TIMESTAMP ctx->Extensions.ARB_timer_query is set based on the driver- capabilities, not based on the context type. We need to check against _mesa_has_ARB_timer_query(ctx) instead to figure out if the extension is really supported. We also need to check for EXT_disjoint_timer_query for GLES-support. This shouln't have any functional effect, as this entry-point is only valid on desktop GL, or on GLES with EXT_disjoint_timer_query in the first place. But if this gets added to the core of a future version of ES, this should be a step in the right direction. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:36 +01:00
Erik Faye-Lund	7a4d74c35a	mesa/main: fix validation of ARB_query_buffer_object ctx->Extensions.ARB_query_buffer_object is set based on the driver- capabilities, not based on the context type. We need to check against _mesa_has_ARB_query_buffer_object(ctx) instead to figure out if the extension is really supported. This turns attempts to read queries into buffer objects on ES 3 into errors, as required by the spec. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:36 +01:00
Erik Faye-Lund	75e39b59dc	mesa/main: fix validation of transform-feedback overflow queries ctx->Extensions.ARB_transform_feedback_overflow_query is set based on the driver-capabilities, not based on the context type. We need to check against _mesa_has_RB_transform_feedback_overflow_query(ctx) instead to figure out if the extension is really supported. This turns usage of GL_TRANSFORM_FEEDBACK_STREAM_OVERFLOW and GL_TRANSFORM_FEEDBACK_OVERFLOW into errors on ES 3, as required by the spec. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:36 +01:00
Erik Faye-Lund	f09d94fbd1	mesa/main: fix validation of transform-feedback queries ctx->Extensions.EXT_transform_feedback is set based on the driver- capabilities, not based on the context type. We need to check against _mesa_has_EXT_transform_feedback(ctx) instead to figure out if the extension is really supported. We also need to check for OES_geometry_shader. This turns usage of GL_TRANSFORM_FEEDBACK_PRIMITIVES_WRITTEN into an error on ES 2, as well as usage of GL_PRIMITIVES_GENERATED on ES 3, both as required by the spec. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:36 +01:00
Erik Faye-Lund	b551fe5fa7	mesa/main: fix validation of GL_TIME_ELAPSED ctx->Extensions.EXT_timer_query is set based on the driver- capabilities, not based on the context type. We need to check against _mesa_has_EXT_timer_query(ctx) instead to figure out if the extension is really supported. We also need to check for EXT_disjoint_timer_query, which enables the same functionality for ES. This turns usage of GL_TIME_ELAPSED into an error on ES 3, as is required by the spec. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:36 +01:00
Erik Faye-Lund	059928e114	mesa/main: fix validation of GL_ANY_SAMPLES_PASSED_CONSERVATIVE ctx->Extensions.ARB_ES3_compatibility is set based on the driver- capabilities, not based on the context type. We need to check against _mesa_has_ARB_ES3_compatibility(ctx) instead to figure out if the extension is really supported. In addition, EXT_occlusion_query_boolean should also allow this behavior. This shouldn't cause any functional change, as all drivers that support ES3_compatibility should in practice enable either ES3_compatibility or EXT_occlusion_query_boolean under all APIs that export this symbol. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:35 +01:00
Erik Faye-Lund	8ea819dd60	mesa/main: fix validation of GL_ANY_SAMPLES_PASSED ctx->Extensions.ARB_occlusion_query2 is set based on the driver- capabilities, not based on the context type. We need to check against _mesa_has_ARB_occlusion_query2(ctx) instead to figure out if the extension is really supported. In addition, EXT_occlusion_query_boolean should also allow this behavior. This shouldn't cause any functional change, as all drivers that support ARB_occlusion_query2 should in practice enable either ARB_occlusion_query2 or EXT_occlusion_query_boolean under all APIs that export this symbol. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:35 +01:00
Erik Faye-Lund	fff1738d57	mesa/main: fix validation of GL_SAMPLES_PASSED ctx->Extensions.ARB_occlusion_query is set based on the driver- capabilities, not based on the context type. We need to check against _mesa_has_ARB_occlusion_query(ctx) instead to figure out if the extension is really supported. We also need to check for ARB_occlusion_query2, as ARB_occlusion_query isn't available in core contexts. This turns usage of GL_SAMPLES_PASSED into an error on ES 3, as is required by the spec. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:35 +01:00
Erik Faye-Lund	9c13ad0ea4	mesa/main: simplify pipeline-statistics query validation The _mesa_has_ARB_pipeline_statistics_query(ctx)-helper will already check the GLES-version according to the extension-table, so if this extension would ever be back-ported to ES, we only need to update the table to support this. This shouln't have any functional effect. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:35 +01:00
Erik Faye-Lund	dd4241b34f	mesa/main: use non-prefixed enums for consistency These enums all have the same values as their non-prefixed versions, and there's several aliases for some of them. So let's switch to the non-prefixed versions for simplicity. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:35 +01:00
Erik Faye-Lund	ba4e8d3754	mesa/main: correct year for EXT_occlusion_query_boolean According to the extension spec, this was initially released in 2011, so let's set this to the correct value. The value of 2001 could be a copy-paste mistake, as ARB_occlusion_query which this is based on was released then. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:35 +01:00
Erik Faye-Lund	35555b08d7	mesa/main: correct requirement for EXT_occlusion_query_boolean EXT_occlusion_query_boolean require support for GL_ANY_SAMPLES_PASSED, which ARB_occlusion_query doesn't supply. We need ARB_occlusion_query2 for this instead. This is still not 100% accurate, as we also require support for the GL_SAMPLES_PASSED_CONSERVATIVE target, which isn't guaranteed by either ARB_occlusion_query nor ARB_occlusion_query2. But it should be trivial to implement for any driver supporting ARB_occlusion_query2, as it can simply be implemented as GL_ANY_SAMPLES_PASSED. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-23 10:48:35 +01:00
Tapani Pälli	09adaa4b89	anv: allow exporting an imported SYNC_FD semaphore type Fixes issues with following SkQP tests: unitTest_VulkanHardwareBuffer_Vulkan_EGL_Syncs unitTest_VulkanHardwareBuffer_Vulkan_Vulkan_Syncs Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-23 07:49:46 +02:00
Eric Engestrom	896c59d690	glapi: add missing visibility args Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108829 Fixes: `3218056e0e` "meson: Build i965 and dri stack" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-22 18:21:05 +00:00
Jason Ekstrand	a24654b49d	anv/nir: Rework arguments to apply_pipeline_layout Instead of taking a whole pipeline (which could be anything!), just take a physical device and robust_buffer_access boolean. This makes it easier to verify that only the things in the hash actually affect pipeline compilation. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-11-22 09:17:28 -06:00
Jason Ekstrand	617e402b3d	anv: Put robust buffer access in the pipeline hash It affects apply_pipeline_layout. Shaders compiled with the wrong value will work but they may not be robust as requested by the app. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-11-22 09:17:10 -06:00
Jason Ekstrand	a845c2bc10	anv: Expose VK_EXT_scalar_block_layout Our compile already splits UBO loads into scalars and the untyped surface read messages we use for SSBO reads and writes only require dword alignment. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-22 08:16:47 -06:00
Jason Ekstrand	2ca9a4417d	vulkan: Update the XML and headers to 1.1.93 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-22 08:16:40 -06:00
Samuel Pitoiset	4ff4af3d91	radv: remove useless sync after CmdClear{Color,DepthStencil}Image() 'post_flush' is only set to NULL for the normal clear path (ie. only vkCmdClearColorImage() and vkCmdClearDepthStencilImage() are affected commands). Because these two operations have to be externally synchronized with VK_PIPELINE_STAGE_TRANSFER_BIT and VK_ACCESS_TRANSFER_WRITE_BIT, it's useless to set those flags internallY. VK_PIPELINE_STAGE_TRANSFER_BIT will wait for compute to be idle, while VK_ACCESS_TRANSFER_WRITE_BIT will invalidate both L1 vector caches and L2. RADV_CMD_FLAG_WRITEBACK_GLOBAL_L2 will be superseded by RADV_CMD_FLAG_INV_GLOBAL_L2. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-22 08:56:36 +01:00
Bas Nieuwenhuizen	33b2f74e77	vulkan: Allow storage images in the WSI. Since apps also have to follow the ImageFormatProperties query, we can disallow formats that don't allow image stores (for AMD that would be SRGB formats). Note that this only affects anything if the app actually decides to use the flag. Had someone ask for this on IRC and at least on the AMD side we can support it. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-21 21:36:55 +01:00
Axel Davy	1f1d4d571a	st/nine: Remove thread_submit warning thread_submit can be useful even without DRI_PRIME, as it can help avoid missed pageflips. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Andre Heider <a.heider@gmail.com>	2018-11-21 19:55:28 +01:00
Axel Davy	d304f0aa31	st/nine: Allow 'triple buffering' with thread_submit The path allowing triple buffering behaviour wasn't implemented yet for thread_submit Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Andre Heider <a.heider@gmail.com>	2018-11-21 19:55:28 +01:00
Robert Foss	19af208c7d	virgl: add assert and missing function parameter Verify the pipe_fd_type to be of PIPE_FD_TYPE_NATIVE_SYNC. Fixes: `d1a1c21e76` "virgl: native fence fd support" Suggested-by: Eric Engestrom <eric.engestrom@intel.com> Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-21 15:59:00 +01:00
Gert Wollny	61b535437e	r600: clean up the GS ring buffers when the context is destroyed This fixes two memory leaks reported by ASAN: Direct leak of 248 byte(s) in 1 object(s) allocated from: in malloc (/usr/lib64/gcc/x86_64-pc-linux-gnu/7.3.0/libasan.so+0xdb880) in r600_alloc_buffer_struct ../../samba/mesa/src/gallium/drivers/r600/r600_buffer_common.c:578 in r600_buffer_create ../../samba/mesa/src/gallium/drivers/r600/r600_buffer_common.c:600 in r600_resource_create_common ../../samba/mesa/src/gallium/drivers/r600/r600_pipe_common.c:1265 in r600_resource_create ../../samba/mesa/src/gallium/drivers/r600/r600_pipe.c:725 in pipe_buffer_create ../../samba/mesa/src/gallium/auxiliary/util/u_inlines.h:291 in update_gs_block_state ../../samba/mesa/src/gallium/drivers/r600/r600_state_common.c:1482 Direct leak of 248 byte(s) in 1 object(s) allocated from: in malloc (/usr/lib64/gcc/x86_64-pc-linux-gnu/7.3.0/libasan.so+0xdb880) in r600_alloc_buffer_struct ../../samba/mesa/src/gallium/drivers/r600/r600_buffer_common.c:578 in r600_buffer_create ../../samba/mesa/src/gallium/drivers/r600/r600_buffer_common.c:600 in r600_resource_create_common ../../samba/mesa/src/gallium/drivers/r600/r600_pipe_common.c:1265 in r600_resource_create ../../samba/mesa/src/gallium/drivers/r600/r600_pipe.c:722 in pipe_buffer_create ../../samba/mesa/src/gallium/auxiliary/util/u_inlines.h:291 in update_gs_block_state ../../samba/mesa/src/gallium/drivers/r600/r600_state_common.c:1489 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Fixes: `1371d65a7f` r600g: initial support for geometry shaders on evergreen (v2) Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-11-21 10:34:17 +01:00
Samuel Pitoiset	4b9bc4791b	radv: only sync CP DMA for transfer operations or bottom pipe CP DMA can only be busy when the driver copies buffers. The only affected Vulkan commands are vkCmdCopyBuffer() and vkCmdUpdateBuffer() (because we fallback to a copy depending on a threshold). Clear operations are currently not concerned because the driver always syncs after the last DMA operation. Per the spec, these two operations have to be externally synchronized with VK_PIPELINE_STAGE_TRANSFER_BIT. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-21 10:03:01 +01:00
Samuel Pitoiset	457ac6ce1e	radv: ignore subpass self-dependencies Unnecessary as they allow the app to call vkCmdPipelineBarrier() inside the render pass. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-21 10:02:59 +01:00
Iago Toral Quiroga	8e73b57634	Revert "nir/builder: Assert that intN_t immediates fit" This reverts commit `1f29f4db1e`. For this to work the compiler must ensure that it never puts the values that arrive to this helper into unsigned variables at any point in its processing, since that would not apply sign extension to the value and it would break the expectations here. Unfortunately, we use uint64_t extensively to pass and copy things around, so some times we get to this helper with values that are not properly sign extended to 64-bit. Here is an example for an 8-bit value that comes from a switch case: (gdb) p /x x $1 = 0xffffffd6 The value seems to have been sign extended to 32-bit at some point getting proper sign extension, but then copied into a uint64_t which wont' apply sign extension, breaking the expectations of the assertion. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-21 08:12:50 +01:00
Iago Toral Quiroga	387888e3b7	nir/from_ssa: fix bit-size of temporary register Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-21 08:07:22 +01:00
Mathias Fröhlich	2d3c466add	mesa: Remove unneeded bitfield widths from the VAO. With the current VAO layout we do not need to make these fields a bitfield. We get a tight struct layout with this change for VAO attributes. v2: Change unsigned char -> GLubyte. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	0a7020b4e6	mesa: Factor out struct gl_vertex_format. Factor out struct gl_vertex_format from array attributes. The data type is supposed to describe the type of a vertex element. At this current stage the data type is only used with the VAO, but actually is useful in various other places. Due to the bitfields being used, special care needs to be taken for the glGet code paths. v2: Change unsigned char -> GLubyte. Use struct assignment for struct gl_vertex_format. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	2da7b0a2fb	tnl: Use gl_array_attribute::_ElementSize. Instead of open coding the size computation, use the already available gl_array_attribute::_ElementSize value. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	a4c01839c2	nouveau: Use gl_array_attribute::_ElementSize. Instead of open coding the size computation, use the already available gl_array_attribute::_ElementSize value. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	182ed6de8c	mesa: Unify glEdgeFlagPointer data type. Use GL_UNSIGNED_BYTE as initialization data type for the edge flag vertex attribute array. The same datatype is used in the glEdgeFlagPointer function when setting the array pointer. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	1b743e2966	mesa: Work with bitmasks when en/dis-abling VAO arrays. For enabling or disabling VAO arrays it is now possible to change a set of arrays with a single call without the need to iterate the attributes. Make use of this technique in the vao module. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	3c46fa5988	mesa: Remove gl_array_attributes::Enabled. Now that all users go via the VAO Enabled bitfield, get rid of the Enabled boolean. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	093aeb3565	mesa: Use gl_vertex_array_object::Enabled for glGet. Instead of using gl_array_attributes::Enabled use the much more compact representation stored in gl_vertex_array_object::Enabled using the corresponding bits. Keep the glGet changes in a seperate patch at least for review. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	1217a8448c	mesa: Use the gl_vertex_array_object::Enabled bitfield. Instead of using gl_array_attributes::Enabled use the much more compact representation stored in gl_vertex_array_object::Enabled using the corresponding bits. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Mathias Fröhlich	73d2d313e9	mesa: Rename gl_vertex_array_object::_Enabled -> Enabled. Mark the up to now derived bitfield value now as primary value by removing the underscore. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-21 06:27:19 +01:00
Marek Olšák	ea9f95e2a6	radeonsi: go back to using bottom-of-pipe for beginning of TIME_ELAPSED Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102597 Cc: 18.3 <mesa-stable@lists.freedesktop.org> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-20 21:18:48 -05:00
Marek Olšák	6c1a34d2e7	radeonsi: don't send data after write-confirm with BOTTOM_OF_PIPE_TS There are no writes. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-20 21:18:46 -05:00
Marek Olšák	bc5adc27b5	st/mesa: pin driver threads to a fixed CCX when glthread is enabled radeonsi has 3 driver threads (glthread, gallium, winsys), other drivers may have 2 (glthread, gallium), so it makes sense to pin them to a random CCX and keep that irrespective of the app thread. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-20 21:18:43 -05:00
Marek Olšák	48f2160936	st/mesa: regularly re-pin driver threads to the CCX where the app thread is This is used when glthread is disabled. Mesa pretty much chases the app thread on the CPU. The performance is the same as pinning the app thread. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-20 21:18:30 -05:00
Marek Olšák	ce7f84eb77	drirc: enable glthread for Talos Principle Ryzen 1700X, Vega 56, 1600x900, 4xAA: improvement +4.4% Immediate mode was needed. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-11-20 21:17:42 -05:00
Marek Olšák	7f1cac7ba6	mesa/glthread: enable immediate mode Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-11-20 21:17:41 -05:00
Marek Olšák	247d5a8e94	mesa/glthread: pass the function name to _mesa_glthread_restore_dispatch If you insert printf there, you'll know why glthread was disabled. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-11-20 21:17:38 -05:00
Marek Olšák	25d95ed535	gallium/u_tests: fix MSVC build by using old-style zero initializers	2018-11-20 19:06:40 -05:00
Kenneth Graunke	562448b75a	i965: Do NIR shader cloning in the caller. This moves nir_shader_clone() to the driver-specific compile function, rather than the shared src/intel/compiler code. This allows i965 to do key-specific passes before calling brw_compile_*. Vulkan should not need this cloning as it doesn't compile multiple variants. We do need to continue cloning in the compute shader code because we lower various things in NIR based on the SIMD width. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2018-11-20 15:53:46 -08:00
Kenneth Graunke	6a10dd08f4	i965: Use a 'nir' temporary rather than poking at brw_program It's shorter and will also be useful when I adjust cloning soon. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2018-11-20 15:53:46 -08:00
Marek Olšák	0d17b685b1	gallium/u_tests: add a compute shader test that clears an image	2018-11-20 18:50:48 -05:00
Dave Airlie	3486fe655a	ac: handle cast derefs Just give back the same value for now. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-21 08:54:46 +10:00
Dave Airlie	baa4bdd3a6	radv: handle loading from shared pointers We won't have a var to load from, so don't try to the processing required if we don't need it. This avoids crashes in: dEQP-VK.spirv_assembly.instruction.compute.variable_pointers.compute.workgroup_two_buffers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-21 08:54:42 +10:00
Dave Airlie	ec9fe8abc7	ac: avoid casting pointers on bcsel and stores For variable pointers we really don't want to case the pointers to int without a good reason, just add a wrapper for bcsel loading and result storing. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-21 08:54:25 +10:00
Dylan Baker	a999798daa	meson: Add tests to suites Meson test has a concepts of suites, which allow tests to be grouped together. This allows for a subtest of tests to be run only (say only the tests for nir). A test can be added to more than one suite, but for the most part I've only added a test to a single suite, though I've added a compiler group that includes nir, glsl, and glcpp tests. To use this you'll need to invoke meson test directly, instead of ninja test (which always runs all targets). it can be invoked as: `meson test -C builddir --suite $suitename` (meson test has addition options that are pretty useful). Tested-By: Gert Wollny <gert.wollny@collabora.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-20 09:09:22 -08:00
Andrii Simiklit	b787dcf57b	i965/batch: avoid reverting batch buffer if saved state is an empty There's no point reverting to the last saved point if that save point is the empty batch, we will just repeat ourselves. v2: Merge with new commits, changes was minimized, added the 'fixes' tag v3: Added in to patch series v4: Fixed the regression which was introduced by this patch Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108630 Reported-by: Mark Janes <mark.a.janes@intel.com> The solution provided by: Jordan Justen <jordan.l.justen@intel.com> CC: Chris Wilson <chris@chris-wilson.co.uk> Fixes: `3faf56ffbd` "intel: Add an interface for saving/restoring the batchbuffer state." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107626 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108630 (fixed in v4) Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-20 06:33:43 -08:00
Emil Velikov	982e012b3a	travis: adding missing x11-xcb for meson+vulkan Required by the x11 WSI Fixes: `df82012b2c` ("travis: add meson build for vulkan drivers.") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-20 11:16:46 +00:00
Emil Velikov	5bc509363b	glx: make xf86vidmode mandatory for direct rendering Currently we detect the module and if missing, the glXGetMsc* API is effectively a stub, always returning false. This is what effectively has been happening with our meson build :-( Thus users have no chance of using it - they cannot even distinguish if the failure is due to a misconfigured build. There's no reason for keeping xf86vidmode optional - it has been available in all distributions for years. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `a47c525f32` "meson: build glx"	2018-11-20 11:13:20 +00:00
Emil Velikov	84445a86d1	travis: drop unneeded x11proto-xf86vidmode-dev The only place where the package is needed is for building the DRI based libGL library. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-20 11:13:20 +00:00
Samuel Pitoiset	f4563d8f5b	ac/nir: fix intrinsic name string size in visit_image_atomic() Fixes an assertion in SoTTR. Fixes: `dd0172e865` ("radv: Use structured intrinsics instead of indexing workaround for GFX9.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-20 10:23:45 +01:00
Bas Nieuwenhuizen	dd0172e865	radv: Use structured intrinsics instead of indexing workaround for GFX9. These force the index to be used in the instruction so we don't need the workaround. Totals: SGPRS: 1321642 -> 1321802 (0.01 %) VGPRS: 943664 -> 943788 (0.01 %) Spilled SGPRs: 28468 -> 28480 (0.04 %) Spilled VGPRs: 88 -> 89 (1.14 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 80 -> 80 (0.00 %) dwords per thread Code Size: 52415292 -> 52338932 (-0.15 %) bytes LDS: 400 -> 400 (0.00 %) blocks Max Waves: 233903 -> 233803 (-0.04 %) Wait states: 0 -> 0 (0.00 %) Totals from affected shaders: SGPRS: 238344 -> 238504 (0.07 %) VGPRS: 232732 -> 232856 (0.05 %) Spilled SGPRs: 13125 -> 13137 (0.09 %) Spilled VGPRs: 88 -> 89 (1.14 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 80 -> 80 (0.00 %) dwords per thread Code Size: 15752712 -> 15676352 (-0.48 %) bytes LDS: 139 -> 139 (0.00 %) blocks Max Waves: 31680 -> 31580 (-0.32 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-19 23:36:00 +01:00
Kenneth Graunke	0990168642	i965: Allow only one slot of clip distances to be set on Gen4-5. The existing backend code assumed that if VARYING_SLOT_CLIP_DIST0 was written, then VARYING_SLOT_CLIP_DIST1 would be as well. That's true with the current lowering, but not necessary if there are 4 or fewer clip distances. Separate out the checks to allow this. The new NIR-based lowering will trigger this case, which would have caused backend validation errors (src is null) without this patch. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-19 14:33:16 -08:00
Kenneth Graunke	5b682143da	nir: Make nir_lower_clip_vs optionally work with variables. The way nir_lower_clip_vs() works with store_output intrinsics makes a ton of assumptions about the driver_location field. In i965 and iris, I'd rather do this lowering early and work with variables. v3d may want to switch to that as well, and ir3 could too, but I'm not sure exactly what would need updating. For now, handle both methods. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-19 14:33:16 -08:00
Kenneth Graunke	d0f746b645	nir: Save nir_variable pointers in nir_lower_clip_vs rather than locs. I'll want the variables in the next patch. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-19 14:33:16 -08:00
Kenneth Graunke	63c8696874	nir: Inline lower_clip_vs() into nir_lower_clip_vs(). It's now called exactly once, and there's not really any distinction. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-19 14:33:14 -08:00
Kenneth Graunke	bfa789aceb	nir: Use nir_shader_get_entrypoint in nir_lower_clip_vs(). Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-19 14:31:20 -08:00
Dave Airlie	c8a35285f0	nir: handle shared pointers in lowering indirect derefs. Check if the base ends up with no variable, and continue if we see that case outside the loop. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-20 05:36:52 +10:00
Dave Airlie	760859cac2	nir: move getting deref from var after we check deref type. I posted a load of hacks before to do this, Jason suggested this, just check the deref mode, not the variable mode and delay getting the variable until we know the type. avoids crashes when derefing shared memory pointers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-20 05:36:38 +10:00
Dave Airlie	2f4f5a5055	spirv/vtn: handle variable pointers without offset lowering Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-20 05:36:16 +10:00
Jason Ekstrand	dca35c598d	intel/fs,vec4: Fix a compiler warning ../src/intel/compiler/brw_fs_nir.cpp:3534:46: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘int’ [-Wsign-compare] assert(nir_intrinsic_write_mask(instr) == ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~ (1 << instr->num_components) - 1); ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ This was caused by `6339aba775` which added these completely valid checks. However clang likes to complain about signedness mismatches. Fixes: `6339aba775` "intel/compiler: Lower SSBO and shared..." Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2018-11-19 09:57:41 -06:00
Jason Ekstrand	060817b2fa	intel,nir: Move gl_LocalInvocationID lowering to nir_lower_system_values It's not at all intel-specific; the formula is dictated by OpenGL and Vulkan. The only intel-specific thing is that we need the lowering. As a nice side-effect, the new version is variable-group-size ready. Reviewed-by: Plamena Manolova <plamena.manolova@intel.com>	2018-11-19 09:57:41 -06:00
Eric Engestrom	486091bc00	gbm: add missing comma between strings Fixes: `d971a4230d` "loader: Factor out the common driver opening logic from each loader." Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-19 15:50:56 +00:00
Samuel Pitoiset	724107553c	radv: implement fast HTILE clears for depth or stencil only on GFX9 This allows to fast clear the depth part (or the stencil part) of a depth+stencil surface when HTILE is enabled. I didn't test on GFX8, so it's disabled currently. This gives a very nice boost, for example when clearing the depth aspect of a 4096x4096 D32_SFLOAT_S8_UINT image (18x faster). BEFORE: 235 us AFTER: 13 us Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:18 +01:00
Samuel Pitoiset	7dcddbe54d	radv: rewrite the condition that checks allowed depth/stencil values Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:16 +01:00
Samuel Pitoiset	9133bbf186	radv: check allowed fast HTILE clears a bit earlier Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:14 +01:00
Samuel Pitoiset	193ad4748b	radv: add radv_is_fast_clear_{depth,stencil}_allowed() helpers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:12 +01:00
Samuel Pitoiset	c7e142ed78	radv: add radv_get_htile_fast_clear_value() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:10 +01:00
Samuel Pitoiset	6f3fbcc041	radv: remove unnecessary goto in the fast clear paths Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:08 +01:00
Samuel Pitoiset	36006e3cec	radv/winsys: remove the max IBs per submit limit for the sysmem path This path will be eventually improved later but as it's only used on SI (or with RADV_DEBUG=noibs), I'm not sure if that matters much. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:06 +01:00
Samuel Pitoiset	4d30f2c6f4	radv/winsys: remove the max IBs per submit limit for the fallback path The chained submission is the fastest path and it should now be used more often than before. This removes some EOP events. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 16:32:04 +01:00
Lucas Stach	8ca8a6a7b1	etnaviv: use dummy RT buffer when rendering without color buffer At least GC2000 seems to push some dirt from the PE color cache into the last bound render target when drawing depth only. Newer cores seem to behave properly and don't do this, but I have found no way to fix it on GC2000. Flushes and stalls don't seem to make any difference. In order to stop the core from pushing the dirt into a precious real render target, plug in dummy buffer when rendering without a color buffer. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de>	2018-11-19 15:48:10 +01:00
Dave Airlie	8706204074	virgl: fix vtest regression since fencing changes. The in_fence_fd needs to be initialised to -1. Fixes: `d1a1c21e7` (virgl: native fence fd support) Reviewed-by: Robert Foss <robert.foss@collabora.com>	2018-11-19 15:33:19 +01:00
Samuel Pitoiset	55c75d2b49	radv: always clear the FCE predicate after DCC/FMASK/CMASK decompressions DCC and FMASK also imply a fast-clear eliminate, so it should be safe to reset the predicate unconditionally. We still only skip FMASK or CMASK decompressions for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 14:05:35 +01:00
Samuel Pitoiset	483a28bfd4	radv: tidy up radv_set_dcc_need_cmask_elim_pred() This is just a small cleanup. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-19 14:05:33 +01:00
Nicolai Hähnle	46a59ce026	radeonsi: fix an out-of-bounds read reported by ASAN We read 4 values out of sample_locs_8x, so make sure the array is big enough. Fixes: `ac76aeef20` ("radeonsi: switch back to standard DX sample positions") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-19 11:16:35 +01:00
Gert Wollny	d174cbccfa	r600: Only set context streamout strides info from the shader that has outputs With 5d517a streamout info is only attached to the shader for which the transform feedback is actually recorded, but the driver set the context info with each state submitted, thereby always using the info data that was attached to the vertex shader. Pass the streamout stride info to the context only from the shader that actually has outputs. (Thanks to Marek Olšák for pointing me in the right direction) Fixes regresion with: dEQP-GLES31.functional.tessellation.invariance.* Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108734 Fixes: `5d517a599b` st/mesa: Don't record garbage streamout information in the non-SSO case. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-19 11:06:56 +01:00
Gert Wollny	18a8e11aea	i965:use FRAMEBUFFER_UNSUPPORTED instead of FRAMEBUFFER_INCOMPLETE_DIMENSIONS FRAMEBUFFER_INCOMPLETE_DIMENSIONS is not supported for GLES 3.0 and later and not defined for Desktop OpenGL. Instead use FRAMEBUFFER_UNSUPPORTED like it was done before. Thanks to Iago Toral and Andrey Simiklit for pointing out the problem and the details. Fixes: `ebcde34545` i965: be more specific about FBO completeness errors Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-11-19 11:06:52 +01:00
Gert Wollny	40eca7d3e1	virgl: Use file descriptor instead of un-allocated object The structure qdws is not allocated at this point, nor is the file descriptor set to it's member. Use the fd directly instead. Fixes: `d1a1c21e76` virgl: native fence fd support Signed-off-by: Gert Wollny <gert.wollny@collabora.com>	2018-11-19 11:03:56 +01:00
Gert Wollny	78fdc507a3	i965: Add support for and expose EXT_texture_sRGB_R8 Emulate MESA_FORMAT_R_SRGB8 by using L8_UNORM_SRGB. This is possible because component swizzling is handled based on the mesa format and, hence, the a r001 swizzling can be used to correct the components. Enables and makes pass (tested on Kabylake) dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.* dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8* Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-19 08:05:44 +01:00
Gert Wollny	c5363869d4	i965: Force zero swizzles for unused components in GL_RED and GL_RG This makes it possible to use a hardware luminance format as RED format. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-19 08:05:44 +01:00
Gert Wollny	ebcde34545	i965: be more specific about FBO completeness errors The driver was returning GL_FRAMEBUFFER_UNSUPPORTED for all cases of an incomplete fbo, be a bit more specific about this following the description of glCheckFramebufferStatus. This helps to keeps dEQP happy when adding EXT_texture_sRGB_R8 support. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-19 08:05:44 +01:00
Gert Wollny	24a02157dd	i965: Correct L8_UNORM_SRGB table entry As the name says, the format is an sRGB format. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-19 08:05:44 +01:00
Robert Foss	70692adf48	virgl: Clean up fences commit Remove a dead variable, a int->bool conversion and some whitespace changes. Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-18 12:14:55 +01:00
Kenneth Graunke	c2e3d0f163	i915: Delete swizzling detection logic. This is all leftover from the i965 split. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-17 10:26:31 -08:00
Ilia Mirkin	beb66d3747	nv50/ir/ra: enforce max register requirement, and change spill order On nv50, certain operations must happen on regs below 64, due to encoding requirements. First of all, we add infrastructure to enforce this. Secondly we change the spill order to first spill RIG nodes that are unconstrained, followed by ones that are. This makes the gamecube logo shadertoy compile properly. Curiously, if we adjust the spill order so that we first spill the constrained RIG nodes instead, the RA also succeeds. However it seems more logical to first spill the unconstrained ones. While we're at it, drop the nv50 max register to reserve r127 as the zero register of last resort (r63 is preferred). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Karol Herbst <kherbst@redhat.com>	2018-11-16 22:43:52 -05:00
Ilia Mirkin	799e021894	nv50/ir/ra: improve condition for short regs, unify with cond for 16-bit Instead of the size restriction existing in two places, and potentially being applied twice, we move this together. Ops with 16-bit register addresses can only take a short reg, and ops with immediates can only take a short reg. Of course we leave the immediate 0 in place since we know that it will be replaced by r63/r127 down the line, so don't treat zeroes as an immediate. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-11-16 20:53:33 -05:00
Ilia Mirkin	955d943c33	nv50/ir: delete MINMAX instruction that is no longer in the BB We removed the op from the BB, but it was still listed in its sources' uses. This could trip up some logic down the line which analyzes all the uses of an l-value, e.g. spilling. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-11-16 20:53:09 -05:00
Eric Anholt	7e9fc11ff8	egl: Print the actual message to the console from _eglError(). Previously we would print errors on the console like: libEGL debug: EGL user error 0x3001 (EGL_NOT_INITIALIZED) in eglInitialize When we had everything we needed for: libEGL debug: EGL user error 0x3001 (EGL_NOT_INITIALIZED) in eglInitialize: DRI2: failed to find EGLDevice (for a gbm error in my case) Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-16 17:49:31 -08:00
Eric Anholt	d971a4230d	loader: Factor out the common driver opening logic from each loader. I copied the code from egl_dri2.c, but the functionality was equivalent between all the loaders other than their particular environment variables. v2: Drop the logging function equivalent to loader_default_logger() (requested by Eric, Emil). Move the SCons workaround across. Drop the now-unused driGetDriverExtensions() declaration that was lost in a rebase. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (v1)	2018-11-16 17:49:17 -08:00
Eric Anholt	cc19815738	loader: Stop using a local definition for an in-tree header I need other types from the header now, and "gl.h is big" is not a good reason to duplicate definitions. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-16 15:38:18 -08:00
Eric Anholt	2bc1f5c2e7	egl: Move loader_set_logger() up to egl_dri2.c. Everyone needs to call it, and platform_x11 forgot to. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-16 15:38:18 -08:00
Eric Anholt	c2b515379b	glx: Move DRI extensions pointer loading to driOpenDriver(). The only thing you do with a dri driver handle is get the extensions pointer, so just fold it in to simplify the callers. v2: Add the declaration of driGetDriverExtensions() that got lost in a rebase. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (v1)	2018-11-16 15:38:18 -08:00
Eric Anholt	7076e9f116	glx: Remove an old DEFAULT_DRIVER_DIR default. You can tell by "Mesa/configs/default" how old this is. Your build system really has to provide the DEFAULT_DRIVER_DIR, or other loaders will break. v2: Move the bad (non-prefix-dependent) define to the SConscript to avoid breaking it. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (v1)	2018-11-16 15:37:47 -08:00
Samuel Pitoiset	d031d5c999	radv: enable primitive binning by default After doing a bunch of benchmarks, primitive binning helps some games like The Talos Principle (+5%) or Serious Sam 2017 (+3%). For other titles, either it doesn't change anything or it hurts very few (less than 1%). This only affects GFX9. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-16 17:51:15 +01:00
Samuel Pitoiset	afd834b62e	radv: add a debug option for disabling primitive binning Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-16 17:51:12 +01:00
Robert Foss	d1a1c21e76	virgl: native fence fd support Following the support for fences on the virtio driver add support for native fence on virgl. This was somewhat based on the freedeno one. Signed-off-by: Gustavo Padovan <gustavo.padovan@collabora.com> Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-16 14:41:57 +01:00
Lionel Landwerlin	0db898cef2	intel/aub_viewer: Print blend states properly Identical fix to : commit `70de31d0c1` Author: Jason Ekstrand <jason.ekstrand@intel.com> Date: Fri Aug 24 16:05:08 2018 -0500 intel/batch_decoder: Print blend states properly Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Toni Lönnberg <toni.lonnberg@intel.com>	2018-11-16 11:40:38 +00:00
Lionel Landwerlin	ac324a6809	intel/aub_viewer: fix dynamic state printing Identical fix to : commit `cbd4bc1346` Author: Jason Ekstrand <jason.ekstrand@intel.com> Date: Fri Aug 24 16:04:03 2018 -0500 intel/batch_decoder: Fix dynamic state printing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Toni Lönnberg <toni.lonnberg@intel.com>	2018-11-16 11:40:14 +00:00
Lionel Landwerlin	59c1059528	intel/aubinator: fix ring buffer pointer We can only start parsing commands from the head pointer. This was working fine up to now because we only dealt with a "made up" ring buffer (generated by aub_write) which always had its head at 0. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Toni Lönnberg <toni.lonnberg@intel.com>	2018-11-16 11:39:54 +00:00
Lionel Landwerlin	25443cbb72	intel/decoders: read ring buffer length Use this value to limit reading the ring buffer. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Toni Lönnberg <toni.lonnberg@intel.com>	2018-11-16 11:37:08 +00:00
Lionel Landwerlin	1c56d21156	egl/dri: fix error value with unknown drm format According to the EGL_EXT_image_dma_buf_import spec, creating an EGL image with a DRM format not supported should yield the BAD_MATCH error : " * If <target> is EGL_LINUX_DMA_BUF_EXT, and the EGL_LINUX_DRM_FOURCC_EXT attribute is set to a format not supported by the EGL, EGL_BAD_MATCH is generated. " Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `20de7f9f22` ("egl/dri2: support for creating images out of dma buffers") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-11-16 10:28:06 +00:00
Daniel Stone	5e1fe240c4	gbm: Clarify acceptable formats for gbm_bo gbm_bo_create() was presumably meant to originally accept gbm_bo_format enums, but it's accepted GBM_FORMAT_* tokens since the dawn of time. This is good, since gbm_bo_format is rarely used and covers a lot less ground than GBM_FORMAT_*. Change the documentation to refer to both; this involves removing a 'see also' for gbm_bo_format, since we can't also use \sa to refer to a family of anonymous #defines. Signed-off-by: Daniel Stone <daniels@collabora.com> Reported-by: Pekka Paalanen <pekka.paalanen@collabora.co.uk> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-16 09:40:46 +00:00
Connor Abbott	ba94a00c7c	Revert "radv: disable VK_SUBGROUP_FEATURE_VOTE_BIT" This reverts commit `647c2b90e9`. There was one recently-introduced bug in ac for dvec3 loads, but the other test failures were actually bugs in the tests. See `9429e621c4` Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-16 10:32:03 +01:00
Eric Anholt	cc71bf529c	vc4: Don't return a vc4 BO handle on a renderonly screen. The handles exported need to be on the KMS device's fd, anything else is failure. Also, this code is assuming that the scanout resource has been created already, so assert it.	2018-11-15 21:11:44 -08:00
Eric Anholt	cc0bc76a38	vc4: Make sure we make ro scanout resources for create_with_modifiers. The DRI3 create_with_modifiers paths don't set tmpl.bind to SCANOUT or SHARED, with the theory that given that you've got modifiers, that's all you need. However, we were looking at the tmpl.bind for setting up the KMS handle in the renderonly case, so we'd end up trying to use vc4's handle on the hx8357d fd. Fixes: `84ed8b67c5` ("vc4: Set shareable BOs as T tiled if possible")	2018-11-15 21:11:44 -08:00
Danylo Piliaiev	f9fd0cf479	i965: Fix calculation of layers array length for isl_view Handle all cases in calculation of layers count for isl_view taking into account texture view and image unit. st_convert_image was taken as a reference. When u->Layered is true the whole level is taken with respect to image view. In other case only one layer is taken. v3: (Józef Kucia and Ilia Mirkin) - Rewrote patch by taking st_convert_image as a reference - Removed now unused get_image_num_layers function - Changed commit message v4: (Jason Ekstrand) - Added assert Fixes: `5a8c8903` Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107856 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-15 19:59:54 -06:00
Jason Ekstrand	6339aba775	intel/compiler: Lower SSBO and shared loads/stores in NIR We have a bunch of code to do this in the back-end compiler but it's fairly specific to typed surface messages and the way we emit them. This breaks it out into NIR were it's easier to do things a bit more generally. It also means we can easily share the code between the vec4 and FS back-ends if we wish. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-15 19:59:49 -06:00
Jason Ekstrand	d34fd81e76	nir: Add alignment parameters to SSBO, UBO, and shared access This also changes spirv_to_nir and glsl_to_nir to set them. The one place that doesn't set them is shared memory access lowering in nir_lower_io. That will have to be updated before any consumers of it can effectively use these new alignments. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Acked-by: Karol Herbst <kherbst@redhat.com>	2018-11-15 19:59:42 -06:00
Jason Ekstrand	fb127f7729	nir/lower_io: Add shared to get_io_offset_src Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-15 19:59:31 -06:00
Jason Ekstrand	b5c48271d4	nir/glsl: Force 32-bit for UBO and SSBO Booleans Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-15 19:59:30 -06:00
Jason Ekstrand	44b7005581	nir/spirv: Force 32-bit for UBO and SSBO Booleans Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-15 19:59:29 -06:00
Jason Ekstrand	f16bd8a9fe	nir/builder: Add a nir_pack/unpack/bitcast helpers The new helpers can generate any pack/unpack operation including those for which we do not have specific opcodes and they express a bitcast in terms of these pack/unpack operations. In particular, the new helpers properly handle 8-bit types. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-15 19:59:28 -06:00
Jason Ekstrand	b77d68b78e	nir/builder: Add iadd_imm and imul_imm helpers The pattern of adding or multiplying an integer by an immediate is fairly common especially in deref chain handling. This adds a helper for it and uses it a few places. The advantage to the helper is that it automatically handles bit sizes for you. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-11-15 19:59:27 -06:00
Jason Ekstrand	1f29f4db1e	nir/builder: Assert that intN_t immediates fit This assert won't catch all mistakes with this helper but it will at least ensure that the top bits are all zero or all one which should help catch bugs. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-15 19:59:26 -06:00
Jason Ekstrand	4266932c0b	nir/lower_alu_to_scalar: Don't try to lower unpack_32_2x16 It messes up when trying to lower. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-15 19:59:09 -06:00
Ian Romanick	425c133ab9	glsl: Refactor type checking for redeclarations Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-11-15 14:27:32 -08:00
Ian Romanick	61e003ce7e	glsl: Omit redundant qualifier checks on redeclarations Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-11-15 14:27:29 -08:00
Ian Romanick	9b9f3218db	glsl: prevent qualifiers modification of predeclared variables Section 3.7 (Identifiers) of the GLSL spec says: However, as noted in the specification, there are some cases where previously declared variables can be redeclared to change or add some property, and predeclared "gl_" names are allowed to be redeclared in a shader only for these specific purposes. More generally, it is an error to redeclare a variable, including those starting "gl_". This patch should fix piglit tests: clip-distance-redeclare-without-inout.frag clip-distance-redeclare-without-inout.vert However, this causes a regression in clip-distance-out-values.shader_test. A fix for that test has been sent to the piglit list for review: https://patchwork.freedesktop.org/patch/255201/ As far as I understood following mailing thread: https://lists.freedesktop.org/archives/piglit/2013-October/007935.html looks like we have accepted to remove an ability to change qualifiers but have not done it yet. Unless I missed something) v2 (idr): Move 'earlier->data.mode != var->data.mode' test much earlier in the function. Add special handling for gl_LastFragData. Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-11-15 14:27:26 -08:00
Eric Anholt	538bca78e2	v3d: Don't try to set PF flags on a LDTMU operation We need an ALU op in order to set PF. Fixes a recent assertion failure in dEQP-GLES3.functional.ubo.single_basic_type.shared.bool_vertex	2018-11-15 11:12:54 -08:00
Eric Anholt	03928dd682	v3d: Fix double-swapping of R/B on V3D 4.1 Fixes: `4018eb04e8` ("v3d: Use the TLB R/B swapping instead of recompiles when available.")	2018-11-15 11:12:54 -08:00
Eric Engestrom	2b2f790e59	egl: fix bad rebase I screwed up a rebase over a refactor and didn't notice locally because the uncommitted refactor hid the issue. Fixes: `c973364967` "egl: add missing glvnd entrypoint for EGL_ANDROID_blob_cache" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-15 17:51:40 +00:00
Sagar Ghuge	6e60ff1ea9	intel/compiler: Disassemble GEN6_SFID_DATAPORT_SAMPLER_CACHE as dp_sampler Both BRW_SFID_SAMPLER and GEN6_SFID_DATAPORT_SAMPLER_CACHE are getting disassembled as "sampler", which is misleading for assembler tool. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2018-11-15 09:36:55 -08:00
Eric Engestrom	c973364967	egl: add missing glvnd entrypoint for EGL_ANDROID_blob_cache Fixes dEQP-EGL.functional.get_proc_address.extension.egl_android_blob_cache on builds with glvnd enabled. Fixes: `6f5b57093b` "egl: add support for EGL_ANDROID_blob_cache" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 16:27:27 +00:00
Eric Engestrom	2640854399	gbm: add new entrypoint to symbols check Fixes: `6328536ff2` "gbm: Introduce a helper function for printing GBM format names." Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-15 16:25:42 +00:00
Emil Velikov	adbdfc6666	bin/get-pick-list.sh: handle reverts prior to the branchpoint Currently we detect when a breaking commit: - has landed in stable, and - is referenced by a untagged fix in master Yet we did not consider the case of breaking commit: - prior to the branchpoint, and - is referenced by a untagged fix in master Addressing the latter is extremely slow, due to the size of the lookup. That said, we can trivially use the existing is_sha_nomination() helper to catch reverts. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 16:15:15 +00:00
Emil Velikov	c0012a0708	bin/get-pick-list.sh: use test instead of [ ] Latter is rather picky wrt surrounding white space. The explicit `test` doesn't have that problem, plus the statements read a bit easier. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:55:51 +00:00
Emil Velikov	77ff0bfb5f	bin/get-pick-list.sh: handle unofficial "broken by" tag We have a number of cases were devs will use a tag "broken by". While it's not something officially documented or recommended, checking for it is trivial enough. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:55:47 +00:00
Emil Velikov	209525aafb	bin/get-pick-list.sh: handle fixes tag with missing colon Every so often, we forget to add the colon after "fixes". Trivially tweak the script to catch it. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:55:44 +00:00
Emil Velikov	b7418d1f3f	bin/get-pick-list.sh: flesh out is_sha_nomination Refactor is_fixes_nomination into a is_sha_nomination helper. This way we can reuse it for more than the usual "Fixes:" tag. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:55:40 +00:00
Emil Velikov	533fead423	bin/get-pick-list.sh: tweak the commit sha matching pattern Currently we match on: - any arbitrary length of, - any a-z A-Z and 0-9 characters At the same time, a commit sha consists of lowercase hexadecimal numbers. Any sha shorter than 8 characters is ambiguous - in some cases even 11+ are required. So change the pattern to a-f0-9 and adjust the length to 8-40. As we're here we could use a single grep, instead of the grep/sed combo. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:55:36 +00:00
Emil Velikov	181203f3c5	bin/get-pick-list.sh: handle the fixes tag Having a separate script to handle the fixes tag, brings a number of issues, so let's fold it in get-pick-list.sh. v2: - pass the sha as argument to the function - Keep original sed pattern Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:55:31 +00:00
Emil Velikov	e6b3a3b201	bin/get-pick-list.sh: handle "typod" usecase. As the comment in get-typod-pick-list.sh says, there's little point in having a duplicate file. Add the new pattern + tag to get-pick-list.sh and nuke this file. v2: - pass the sha as argument to the function - grep -q instead of using a variable (Eric) Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:55:24 +00:00
Emil Velikov	fac10169bb	bin/get-pick-list.sh: prefix output with "[stable] " With later commits we'll fold all the different scripts into one. Add the explicit prefix, so that we know the origin of the nomination v2: - pass the sha as argument to the function - swap $tag = none for an else statment (Juan) - grep -q instead of using a variable (Eric) - print the tag and commit oneline separately (Eric) v3: - drop unused "tag=none" assignment (Juan) - typo nomination Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v2) Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:54:48 +00:00
Emil Velikov	559c32d241	bin/get-pick-list.sh: simplify git oneline printing Currently we force disable the pager via "\|cat" where --no-pager exists. Additionally we could use git show instead of git log -n1. Use those for a slightly more understandable code. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-15 15:51:24 +00:00
Emil Velikov	7d9556681d	docs: document the staging branch and add reference to it A while back we agreed that having a live/staging branch is beneficial. Sadly we forgot to document that, so here is my first attempt. Document the caveat that the branch history is not stable. CC: Andres Gomez <agomez@igalia.com> CC: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-15 15:48:15 +00:00
Emil Velikov	4ae749acf1	docs/submittingpatches.html: correctly handle the <p> tag As pointed out by the w3c validator. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-15 15:48:13 +00:00
Emil Velikov	19a081473f	docs/releasing.html: polish cherry-picking/testing text Reword slightly and highlight the important parts of the text. CC: Andres Gomez <agomez@igalia.com> CC: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-15 15:48:08 +00:00
Guido Günther	ab5653680e	etnaviv: Make sure rs alignment checks match etna_resource_alloc and etna_resource_from_handle currently use different checks. This leads to etna_resource_from_handle:492: target=2, format=PIPE_FORMAT_B8G8R8X8_UNORM, 1080x1920x1, array_size=1, last_level=0, nr_samples=0, usage=0, bind=8000a, flags=0 etna_resource_from_handle:541: BO stride 4320 is too small for RS engine width padding (4352, format PIPE_FORMAT_B8G8R8X8_UNORM) since etna_resource_from_handle wants to be aligned to a 16 byte boundary while the etna_resource_alloc does not. Adjust the two checks by using a common function. Broken by `baff59ebf0` Signed-off-by: Guido Günther <guido.gunther@puri.sm> Signed-off-by: Lucas Stach <l.stach@pengutronix.de>	2018-11-15 16:38:35 +01:00
Juan A. Suarez Romero	52368ef83a	docs: update calendar, add news item and link release notes for 18.2.5 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-11-15 13:08:58 +00:00
Juan A. Suarez Romero	aa7a419b8b	docs: add sha256 checksums for 18.2.5 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit 79be754f9a74a43b5748dc0934241e7701cb9581)	2018-11-15 13:06:12 +00:00
Juan A. Suarez Romero	e53ec08931	docs: add release notes for 18.2.5 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `f34bddc325`)	2018-11-15 13:06:10 +00:00
Marek Olšák	9367514524	radeonsi: fix video APIs on Raven2 This was missed when I added the new enum. Cc: 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-11-14 17:08:34 -05:00
Andrii Simiklit	e13dd70581	i965: avoid 'unused variable' warnings 1. brw_pipe_control.c:311:34: warning: unused variable ‘devinfo’ 2. brw_program_binary.c:209:19: warning: unused variable ‘gen_size’ 3. brw_program_binary.c:216:19: warning: unused variable ‘nir_size’ v2: Changes for unreproducible issues were removed Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-14 14:41:58 +00:00
Andrii Simiklit	7aca650122	compiler: avoid 'unused variable' warnings 1. nir/nir_lower_vars_to_ssa.c:691:21: warning: unused variable ‘var’ nir_variable *var = path->path[0]->var; v2: Changes for some part of 'may be used uninitialized' warnings were removed, seems like it is a compiler issue. ( Eric Engestrom <eric.engestrom@intel.com> ) Possible like this one: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=46684 This issue is flagged as duplicate but an original one is not closed yet. Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-14 13:35:38 +00:00
Andrii Simiklit	69ee49ac46	intel/tools: avoid 'unused variable' warnings 1. tools/aub_read.c:271:31: warning: unused variable ‘end’ const uint32_t p = data, end = data + data_len, next; 2. tools/aub_mem.c:292:13: warning: unused variable ‘res’ void res = mmap((uint8_t )bo.map + map_offset, 4096, PROT_READ, tools/aub_mem.c:357:13: warning: unused variable ‘res’ void res = mmap((uint8_t *)bo.map + (page - bo.addr), 4096, PROT_READ, v2: The i965_disasm.c changes was moved into a separate patch The 'end' variable declared separately with MAYBE_UNUSED to avoid effect of it to other variables. ( Eric Engestrom <eric.engestrom@intel.com> ) Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-14 13:35:28 +00:00
Thomas Hellstrom	25b48e3df9	st/xa: Bump minor Bump minor to signal support for new formats and higher precision solid pictures. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-11-14 13:12:09 +01:00
Thomas Hellstrom	c9085f6d3b	st/xa: Support Component Alpha with trivial blending Support Component Alpha for those composite operations that do not require per-channel alpha blending. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com>	2018-11-14 13:12:09 +01:00
Thomas Hellstrom	0477d17f51	st/xa: Minor renderer cleanups constify function arguments to clean up the code a bit. Reported-by: Brian Paul <brianp@vmware.com> Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com>	2018-11-14 13:12:09 +01:00
Thomas Hellstrom	56aa23b146	st/xa: Fix transformations when we have both source and mask samplers In the case when we had both source and mask samplers, transformations were typically not applied correctly. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com>	2018-11-14 13:12:09 +01:00
Thomas Hellstrom	e1298def9f	st/xa: Support a couple of new formats Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-11-14 13:12:09 +01:00
Thomas Hellstrom	258d20152a	st/xa: Support higher color precision for solid pictures The only solid fill picture type we supported only had 8 bit color channels. Add a new solid picture type that supports float channels. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-11-14 13:11:51 +01:00
Thomas Hellstrom	d86ad38205	st/xa: Render update. Better support for solid pictures Remove unused and obsolete code for gradients and component-alpha Support solid source- and mask pictures using a variable number of samplers in the composite pipeline rather than the fixed number we used before. Tested using rendercheck for XA. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-11-14 13:07:00 +01:00
Gert Wollny	4bba280937	nir: Allow to skip integer ops in nir_lower_to_source_mods Some hardware supports source mods only for float operations. Make it possible to skip lowering to source mods in these cases. v2: use option flags instead of a boolean (Jason Ekstrand) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-14 08:59:26 +01:00
Karol Herbst	b4380cb070	nir/spirv: cast shift operand to u32 v2: fix for specialization constants as well Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-11-14 02:09:11 +01:00
Karol Herbst	099728b115	nir: replace nir_load_system_value calls with appropiate builder functions this helps reduce the overall code changes when a bit_size parameter is added to nir_load_system_value Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-11-14 02:09:11 +01:00
Karol Herbst	80db331c2d	nir: add const_index parameters to system value builder function this allows to replace some nir_load_system_value calls with the specific system value constructor Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-11-14 02:09:11 +01:00
Timothy Arceri	95b513c937	radv: make use of nir_move_out_const_to_consumer() vkpipeline-db results: Totals from affected shaders: SGPRS: 28400 -> 28576 (0.62 %) VGPRS: 27916 -> 27692 (-0.80 %) Spilled SGPRs: 140 -> 138 (-1.43 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 1534456 -> 1520560 (-0.91 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 3541 -> 3582 (1.16 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-14 09:41:50 +11:00
Lionel Landwerlin	ea53f76d7b	anv: move helper function internally It's only used in anv_image.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 18:56:31 +00:00
Lionel Landwerlin	8b00d3d6eb	anv: use image aspects rather than computed ones This shouldn't make any difference but I feel uneasy to use the expanded aspects that do not represent the image in its entirety. If we ever change the implementation of the anv_image_aspect_to_plane() helper, this is safer. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 18:56:27 +00:00
Lionel Landwerlin	465de47bad	anv: associate vulkan formats with aspects This will make it easier to associate an aspect with a plane number. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 18:56:24 +00:00
Lionel Landwerlin	fe3b7fe982	anv/lower_ycbcr: make sure to set 0s on all components To play around with debugging, we might want to disable one or the other component. Having 0s as default values makes this work. Otherwise we might have NULL components, leading to crashes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 18:56:21 +00:00
Lionel Landwerlin	ee8d65c25a	anv/image: remove unused parameter Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 18:56:13 +00:00
Lionel Landwerlin	352e297091	anv: simplify internal address offset Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 18:56:10 +00:00
Eric Engestrom	4fa2fb3524	meson: fix wayland-less builds Those empty variables in the !wayland case are useless and running that meson.build with them breaks the build: [287/850] Generating wayland-drm-client-protocol.h with a custom command. FAILED: src/egl/wayland/wayland-drm/wayland-drm-client-protocol.h client-header ../src/egl/wayland/wayland-drm/wayland-drm.xml src/egl/wayland/wayland-drm/wayland-drm-client-protocol.h /bin/sh: client-header: command not found ninja: build stopped: subcommand failed. Fixes: `d1992255bb` "meson: Add build Intel "anv" vulkan driver" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-13 17:25:02 +00:00
Eric Engestrom	7df80de6e6	gbm: remove unnecessary meson include `inc_wayland_drm` is only used if wayland is built, and it's already added in that case a few lines below. Fixes: `a29869e872` "gbm: Don't traverse backwards for includes" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-13 17:25:02 +00:00
Eric Engestrom	3832db275e	meson: only run vulkan's meson.build when building vulkan Fixes: `d1992255bb` "meson: Add build Intel "anv" vulkan driver" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-13 17:25:02 +00:00
Eric Engestrom	4f1ae271e1	xmlpool: update translation po files These files are close to 4 years out of date; a lot's changed since. Let's just check in a recently-regenerated version. Changes generated by running `ninja xmlpool-{pot,update-po,gmo}`. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-13 17:25:02 +00:00
Eric Engestrom	1e918e5bef	REVIEWERS: add Vulkan reviewer group Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Emil Velikov <emil.l.velikov@gmail.com>	2018-11-13 17:25:02 +00:00
Eric Engestrom	59b3335496	REVIEWERS: add Emil as EGL reviewer Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Emil Velikov <emil.l.velikov@gmail.com>	2018-11-13 17:25:02 +00:00
Eric Engestrom	923aca84b2	REVIEWERS: add include path for EGL Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Emil Velikov <emil.l.velikov@gmail.com>	2018-11-13 17:25:02 +00:00
Toni Lönnberg	2af4e3345f	intel/genxml: Add engine definition to render engine instructions (gen11) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. v4: Added missing engine definition to MI_TOPOLOGY_FILTER. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	1921982d3e	intel/genxml: Add engine definition to render engine instructions (gen10) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. v4: Added missing engine definition to MI_TOPOLOGY_FILTER. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	030fe0f981	intel/genxml: Add engine definition to render engine instructions (gen9) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. v4: Added more missing engine definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	12e34fc7ba	intel/genxml: Add engine definition to render engine instructions (gen8) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. v4: Added missing engine tag for MI_TOPOLOGY_FILTER and MI_LOAD_URB_MEM. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	a883fd2277	intel/genxml: Add engine definition to render engine instructions (gen75) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	27cf6252d3	intel/genxml: Add engine definition to render engine instructions (gen7) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	ecf62a967e	intel/genxml: Add engine definition to render engine instructions (gen6) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions v4: Added missing engine to MEDIA_GATEWAY_STATE Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	571d6447d8	intel/genxml: Add engine definition to render engine instructions (gen5) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	6463ceca69	intel/genxml: Add engine definition to render engine instructions (gen45) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added addition engine definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	a4ca710c96	intel/genxml: Add engine definition to render engine instructions (gen4) Instructions meant for the render engine now have a definition specifying that so that can differentiate instructions meant for different engines due to shared opcodes. v2: Divided into individual patches for each gen v3: Added additional engine definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	102dadec81	intel/decoder: tools: Use engine for decoding batch instructions The engine to which the batch was sent to is now set to the decoder context when decoding the batch. This is needed so that we can distinguish between instructions as the render and video pipe share some of the instruction opcodes. v2: The engine is now in the decoder context and the batch decoder uses a local function for finding the instruction for an engine. v3: Spec uses engine_mask now instead of engine, replaced engine class enums with the definitions from UAPI. v4: Fix up aubinator_viewer (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	a6aab7e436	intel/decoder: tools: gen_engine to drm_i915_gem_engine_class Removed the gen_engine enum and changed the involved functions to use the drm_i915_gem_engine_class enum from UAPI instead. v3: Wrong engine was being used for blocks in video ring v4: Fixed aubinator_viewer.cpp Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Toni Lönnberg	b00bccd012	intel/decoder: Engine parameter for instructions Preliminary work for adding handling of different pipes to gen_decoder. Each instruction needs to have a definition describing which engine it is meant for. If left undefined, by default, the instruction is defined for all engines. v2: Changed to use the engine class definitions from UAPI v3: Changed I915_ENGINE_CLASS_TO_MASK to use BITSET_BIT, change engine to engine_mask, added check for incorrect engine and added the possibility to define an instruction to multiple engines using the "\|" as a delimiter in the engine attribute. v4: Fixed the memory leak. v5: Removed an unnecessary ralloc_free(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-13 15:10:12 +00:00
Gert Wollny	8d4bb6e5cd	virgl: Add command and flags to initiate debugging on the host (v2) On the host VREND_DEBUG=guestallow must be set to let the guest override the debug flags. v2: Send flag string instead of flags, this avoids the need to keep the flags in sync. v3: Only request host logging if the host actually understands the command Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-11-13 14:42:22 +01:00
Gert Wollny	caa964b422	mesa: Reference count shaders that are used by transform feedback objects Transform feedback objects may hold a pointer to a shader program, and at least in Gallium, this must be a valid pointer until ctx->Driver.EndTransformFeedback in glEndTransformFeedback has been called - which is conform with the spec that any program that is part of a current rendering state should only be flagged for deletion by glDeleteProgram. This was not handled properly for the transform feedback objects so that a call sequence glUseProgram(x) glBeginTransformFreedback(...) glPauseTransformFeedback(...) glDeleteProgram(x) glEndTransformFeedback(...) would result in a use after free bug. With this patch the transform feedback object also updates the reference count to the used program thereby keeping the program valid as long as the transform feedback objects links to it. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108713 Fixes: `654587696b` mesa: add end_transform_feedback() helper Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-13 10:57:25 +01:00
Samuel Pitoiset	90d68858ed	radv: set optimal OVERWRITE_COMBINER_WATERMARK on GFX9 Ported from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-13 10:24:36 +01:00
Samuel Pitoiset	f70c5d31cd	radv: set PA.SC_CONSERVATIVE_RASTERIZATION.NULL_SQUAD_AA_MASK_ENABLE Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-13 10:24:33 +01:00
Samuel Pitoiset	b5f213bb1d	radv: binding streamout buffers doesn't change context regs Cc: 18.3 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-13 10:24:31 +01:00
Plamena Manolova	c5f3013cba	nir: Don't lower the local work group size if it's variable. If the local work group size is variable it won't be available at compile time so we can't lower it in nir_lower_system_values(). Signed-off-by: Plamena Manolova <plamena.n.manolova@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-11-13 10:57:04 +02:00
Matt Turner	efb1ccadca	util/ralloc: Make sizeof(linear_header) a multiple of 8 Prior to this patch sizeof(linear_header) was 20 bytes in a non-debug build on 32-bit platforms. We do some pointer arithmetic to calculate the next available location with ptr = (linear_size_chunk )((char )&latest[1] + latest->offset); in linear_alloc_child(). The &latest[1] adds 20 bytes, so an allocation would only be 4-byte aligned. On 32-bit SPARC a 'sttw' instruction (which stores a consecutive pair of 4-byte registers to memory) requires an 8-byte aligned address. Such an instruction is used to store to an 8-byte integer type, like intmax_t which is used in glcpp's expression_value_t struct. As a result of the 4-byte alignment returned by linear_alloc_child() we would generate a SIGBUS (unaligned exception) on SPARC. According to the GNU libc manual malloc() always returns memory that has at least an alignment of 8-bytes [1]. I think our allocator should do the same. So, simple fix with two parts: (1) Increase SUBALLOC_ALIGNMENT to 8 unconditionally. (2) Mark linear_header with an aligned attribute, which will cause its sizeof to be rounded up to that alignment. (We already do this for ralloc_header) With this done, all Mesa's unit tests now pass on SPARC. [1] https://www.gnu.org/software/libc/manual/html_node/Aligned-Memory-Blocks.html Fixes: `47e1758692` ("glcpp: use the linear allocator for most objects") Bug: https://bugs.gentoo.org/636326 Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-12 20:54:49 -08:00
Matt Turner	7e3748c268	util/ralloc: Switch from DEBUG to NDEBUG The debug code is all asserts, so protect it with the same thing that controls assert. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-12 20:54:49 -08:00
Timothy Arceri	34dffcf913	nir: add support for removing redundant stores to copy prop var For example the following type of thing is seen in TCS from a number of Vulkan and DXVK games: vec1 32 ssa_557 = deref_var &oPatch (shader_out float) vec1 32 ssa_558 = intrinsic load_deref (ssa_557) () vec1 32 ssa_559 = deref_var &oPatch@42 (shader_out float) vec1 32 ssa_560 = intrinsic load_deref (ssa_559) () vec1 32 ssa_561 = deref_var &oPatch@43 (shader_out float) vec1 32 ssa_562 = intrinsic load_deref (ssa_561) () intrinsic store_deref (ssa_557, ssa_558) (1) /* wrmask=x / intrinsic store_deref (ssa_559, ssa_560) (1) / wrmask=x / intrinsic store_deref (ssa_561, ssa_562) (1) / wrmask=x */ No shader-db changes on i965 (SKL). vkpipeline-db results RADV (VEGA): Totals from affected shaders: SGPRS: 7832 -> 7728 (-1.33 %) VGPRS: 6476 -> 6740 (4.08 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 469572 -> 456596 (-2.76 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 989 -> 960 (-2.93 %) Wait states: 0 -> 0 (0.00 %) The Max Waves and VGPRS changes here are misleading. What is happening is a bunch of TCS outputs are being optimised away as they are now recognised as unused. This results in more varyings being compacted via nir_compact_varyings() which can result in more register pressure when they are not packed in an optimal way. This is an existing problem independent of this patch. I've run some benchmarks and haven't noticed any performance regressions in affected games. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 15:19:36 +11:00
Timothy Arceri	3561108de0	anv/i965: make use of nir_link_constant_varyings() shader-db results for SLK: total instructions in shared programs: 13106498 -> 13091573 (-0.11%) instructions in affected programs: 1186244 -> 1171319 (-1.26%) helped: 6186 HURT: 0 total cycles in shared programs: 332062633 -> 331961653 (-0.03%) cycles in affected programs: 8537165 -> 8436185 (-1.18%) helped: 5371 HURT: 862 LOST: 6 GAINED: 14 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-13 14:06:32 +11:00
Eric Anholt	621b0fa892	egl: Improve the debugging of gbm format matching in DRI configs. Previously the debug would be: libEGL debug: No DRI config supports native format 0x20203852 libEGL debug: No DRI config supports native format 0x38385247 but libEGL debug: No DRI config supports native format R8 libEGL debug: No DRI config supports native format GR88 is a lot easier to understand. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2018-11-12 15:20:23 -08:00
Eric Anholt	6328536ff2	gbm: Introduce a helper function for printing GBM format names. This requires that the caller make a little (stack) allocation to store the string. v2: Use gbm_format_canonicalize (suggested by Daniel) Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2018-11-12 15:20:23 -08:00
Eric Anholt	ee7f848c00	gbm: Move gbm_format_canonicalize() to the core. I want it for the format name debugging code. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2018-11-12 15:20:23 -08:00
Dylan Baker	4eab98b66e	meson: fix libatomic tests There are two problems: 1) the extra underscore in MISSING_64BIT_ATOMICS 2) we should link with libatomic if the previous test decided we needed it Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-and-Tested-by: Matt Turner <mattst88@gmail.com>	2018-11-12 13:29:00 -08:00
Marek Olšák	32a334777c	mesa: mark GL_SR8_EXT non-renderable on GLES Fixes: dEQP-GLES3.functional.fbo.completeness.renderable.texture.color0.sr8_ext Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-12 16:19:43 -05:00
Marek Olšák	e0c7114eb3	st/mesa: disable L3 thread pinning This implementation can have massive drawbacks. Cc: 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Edmondo Tommasina <edmondo.tommasina@gmail.com>	2018-11-12 16:18:15 -05:00
Christian Gmeiner	c6aaafa3a1	nir: add lowering for ffloor Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-12 21:57:25 +01:00
Alyssa Rosenzweig	41c8f99137	util: Fix warning in u_cpu_detect on non-x86 regs is only set and used on x86; on other platforms (like ARM), this code causes a trivial warning, solved by moving the regs declaration to the architecture-dependent usage. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2018-11-12 10:28:04 -08:00
Dylan Baker	9c2a95b298	meson: Don't set -Wall meson does this for you with its warn levels, so we don't need to set it ourselves. Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-12 08:55:55 -08:00
Rob Clark	4a0c2cfdd6	freedreno/drm: fix unused 'entry' warnings Looks like importing libdrm_freedreno into mesa crossed paths with `e27902a261`. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-12 10:45:48 -05:00
Lionel Landwerlin	89785e2d56	i965: add support for sampling from AYUV Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-12 13:22:54 +00:00
Lionel Landwerlin	252ca7b43f	dri: add AYUV format v2: Add a AYUV entry android in the android backend (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-12 13:22:54 +00:00
Lionel Landwerlin	8a15f06d19	nir/lower_tex: Add AYUV lowering support Byte ordering is : 0: V 1: U 2: Y 3: A v2: Split refactoring of alpha channel (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (v1) Acked-by: Eric Engestrom <eric.engestrom@intel.com> (v2)	2018-11-12 13:22:54 +00:00
Lionel Landwerlin	0a30c33e83	nir/lower_tex: add alpha channel parameter for yuv lowering We're about to introduce AYUV support which provides its own alpha channel. So give alpha as a parameter and set it to 1 on exising formats. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-12 13:22:54 +00:00
Samuel Pitoiset	97fb1a02fd	radv: make use of num_good_cu_per_sh in si_emit_graphics() too Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-12 09:35:46 +01:00
Samuel Pitoiset	d9d14346c2	radv: clean up setting partial_es_wave for distributed tess on VI Only needed when the pipeline actually uses tessellation. I don't think that changes anything, except improving readability. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-12 09:35:44 +01:00
Samuel Pitoiset	cc4569b733	radv: cleanup and document a Hawaii bug with offchip buffers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-12 09:35:42 +01:00
Hanno Böck	8dc2085baf	glsl/test: Fix use after free in test_optpass. The variable state is free'd and afterwards state->error is used as the return value, resulting in a use after free bug detected by memory safety tools like address sanitizer. Signed-off-by: Hanno Böck <hanno@hboeck.de> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108636 Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-12 07:42:58 +02:00
Timothy Arceri	a068958692	nir: don't pack varyings ints with floats unless flat Fixes: `1c9c42d16b` ("nir: add varying component packing helpers") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-12 15:38:56 +11:00
Timothy Arceri	9dd737bb02	nir: add glsl_type_is_integer() helper Fixes: `1c9c42d16b` ("nir: add varying component packing helpers") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-12 15:38:56 +11:00
Francisco Jerez	552642066f	intel/fs: Prevent emission of IR instructions not aligned to their own execution size. This can occur during payload setup of SIMD-split send message instructions, which can lead to the emission of header setup instructions with a non-zero channel group and fixed SIMD width. Such instructions could end up using undefined channel enable signals except they don't care since they're always marked force_writemask_all. Not known to affect correctness of any workload at this point, but it would be trivial to back-port to stable if something comes up. Reported-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Tested-by: Sagar Ghuge <sagar.ghuge@intel.com>	2018-11-09 19:39:22 -08:00
Timothy Arceri	590fcb50e7	st/mesa: make use of nir_link_constant_varyings() Shader-db results radeonsi (VEGA): Totals from affected shaders: SGPRS: 161464 -> 161368 (-0.06 %) VGPRS: 86904 -> 86292 (-0.70 %) Spilled SGPRs: 296 -> 314 (6.08 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 3618596 -> 3573852 (-1.24 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 26189 -> 26276 (0.33 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-10 11:41:00 +11:00
Timothy Arceri	d40dd05553	nir: add new linking opt nir_link_constant_varyings() This pass moves constant outputs to the consuming shader stage where possible. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-11-10 11:41:00 +11:00
Andre Heider	414470854d	st/nine: clean up thead shutdown sequence a bit Just break out of the loop instead, it does the same thing. Signed-off-by: Andre Heider <a.heider@gmail.com> Reviewed-by: Axel Davy <davyaxel0@gmail.com>	2018-11-09 22:37:27 +01:00
Andre Heider	123bf9cbe7	st/nine: plug thread related leaks Signed-off-by: Andre Heider <a.heider@gmail.com> Reviewed-by: Axel Davy <davyaxel0@gmail.com>	2018-11-09 22:37:27 +01:00
Andre Heider	10598c9667	st/nine: fix stack corruption due to ABI mismatch This fixes various crashes and hangs when using nine's 'thread_submit' feature. On 64bit, the thread function's data argument would just be NULL. On 32bit, the data argument would be garbage depending on the compiler flags (in my case -march>=core2). Fixes: `f3fa7e3068` ("st/nine: Use WINE thread for threadpool") Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Andre Heider <a.heider@gmail.com> Reviewed-by: Axel Davy <davyaxel0@gmail.com>	2018-11-09 22:37:26 +01:00
Marek Olšák	d2b2364313	radeonsi: stop command submission with PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET only Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-11-09 14:55:04 -05:00
Marek Olšák	4bec5025ac	gallium: add PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-11-09 14:55:04 -05:00
Marek Olšák	9dc776f3f2	radeonsi: don't set the CB clear color registers for 0/1 clear colors on Raven2 and add has_dcc_constant_encode.	2018-11-09 14:55:04 -05:00
Marek Olšák	832ab883e2	radeonsi: use better DCC clear codes Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-11-09 14:55:04 -05:00
Marek Olšák	d059eae269	ac/surface: remove the overallocation workaround for Vega12 not needed anymore (probably since the tile_swizzle fix) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-09 14:55:04 -05:00
Lionel Landwerlin	959e2a5aeb	intel/aub_read: remove useless breaks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-09 18:17:30 +00:00
Erik Faye-Lund	b55af392d9	Revert "mesa: expose NV_conditional_render on GLES" This reverts commit `5213be9fab`.	2018-11-09 17:39:25 +01:00
Erik Faye-Lund	cf8b271cbe	Revert "mesa/main: fixup make check after NV_conditional_render for gles" This reverts commit `cccd7a253f`.	2018-11-09 17:39:22 +01:00
Erik Faye-Lund	cccd7a253f	mesa/main: fixup make check after NV_conditional_render for gles It seems I missed some details when exposing NV_conditional_render on GLES; this fixes up "make check". Fixes: `5213be9fab` ("mesa: expose NV_conditional_render on GLES") Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-and-Tested-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-09 16:47:34 +01:00
Nicolai Hähnle	8c97abc066	radv: include LLVM IR in the VK_AMD_shader_info "disassembly" Helpful for debugging compiler backend problems: this allows us to easily retrieve the LLVM IR from RenderDoc. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-09 14:54:37 +01:00
Erik Faye-Lund	5213be9fab	mesa: expose NV_conditional_render on GLES The extension spec has been updated to include GLES 2 support, so let's enable it there. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-09 13:03:00 +01:00
Iago Toral Quiroga	35baee5dce	nir/constant_folding: fix incorrect bit-size check nir_alu_type_get_type_size takes a type as parameter and we were passing a bit-size instead, which did what we wanted by accident, since a bit-size of zero matches nir_type_invalid, which has a size of 0 too. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-09 08:22:15 +01:00
Iago Toral Quiroga	6c418dfa42	intel/compiler: fix node interference of simd16 instructions SIMD16 instructions need to have additional interferences to prevent source / destination hazards when the source and destination registers are off by one register. While we already have code to handle this, it was only running for SIMD16 dispatches, however, we can have SIDM16 instructions in a SIMD8 dispatch. An example of this are pull constant loads since commit `b56fa830c6`, but there are more cases. This fixes a number of CTS test failures found in work-in-progress tests that were hitting this situation for 16-wide pull constants in a SIMD8 program. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-11-09 08:22:08 +01:00
Roland Scheidegger	a3c898dc97	gallivm: fix improper clamping of vertex index when fetching gs inputs Because we only have one file_max for the (2d) gs input file, the value actually represents the max of attrib and vertex index (although I'm not entirely sure if we really want the max, since the max valid value of the vertex dimension can be easily deduced from the input primitive). Thus in cases where the number of inputs is higher than the number of vertices per prim, we did not properly clamp the vertex index, which would result in out-of-bound fetches, potentially causing segfaults (the segfaults seemed actually difficult to trigger, but valgrind certainly wasn't happy). This might have happened even if the shader did not actually try to fetch bogus vertices, if the fetching happened in non-active conditional clauses. To fix simply use the correct max vertex index value (derived from the input prim type) instead when clamping for this case. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-11-09 00:53:03 +01:00
Aditya Swarup	a5c39ed974	i965: Lift restriction in external textures for EGLImage support Fixes Skqp's unitTest_EGLImageTest test. For Intel platforms, we support external textures only for EGLImages created with EGL_EXT_image_dma_buf_import. This restriction seems to be Intel specific and not present for other platforms. While running SKQP test - unitTest_EGLImageTest, GL_INVALID is sent to the test because of this restriction. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105301 Signed-off-by: Aditya Swarup <aditya.swarup@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-11-08 12:33:06 -08:00
Ian Romanick	c5a4c26450	glsl: Add pragma to disable all warnings Use #pragma warning(off) and #pragma warning(on) to disable or enable all warnings. This is a big hammer. If we ever need a smaller hammer, we can enhance this functionality. There is one lame thing about this. Because we parse everything, create an AST, then convert the AST to GLSL IR, we have to treat the #pragma like a statment. This means that you can't do something like ' void ' #pragma warning(off) ' __foo ' #pragma warning(on) ' (float param0); Fixing that would, as far as I can tell, require a huge amount of work. I did try just handling the #pragma during parsing (like we do for state for the whole shader. v2: Fix the #pragma lines in the commit message that git-commit ate. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-11-08 11:00:00 -08:00
Ian Romanick	011abfc963	glsl: Add warning tests for identifiers with __ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-11-08 10:59:53 -08:00
Jason Ekstrand	d28bc35ece	intel/fs: Add an assert to optimize_frontfacing_ternary Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-08 10:09:25 -06:00
Jason Ekstrand	bcc6aab065	anv: Use nir_src_is_const and friends in lowering code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-08 10:09:25 -06:00
Jason Ekstrand	52145070c0	intel/analyze_ubo_ranges: Use nir_src_is_const and friends Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-08 10:09:25 -06:00
Jason Ekstrand	1413512b4c	intel/vec4: Use the new nir_src_is_const and friends As of this commit, all uses of const sources either go through a nir_src_as_<type> helper which handles bit sizes correctly or else are accompanied by a nir_src_bit_size() == 32 assertion to assert that we have the size we think we have. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-08 10:09:25 -06:00
Jason Ekstrand	61e15348c4	nir: Add a read_mask helper for ALU instructions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-08 10:09:22 -06:00
Jason Ekstrand	344cfe6980	intel/fs: Use the new nir_src_is_const and friends As of this commit, all uses of const sources either go through a nir_src_as_<type> helper which handles bit sizes correctly or else are accompanied by a nir_src_bit_size() == 32 assertion to assert that we have the size we think we have. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-08 10:09:20 -06:00
Jason Ekstrand	6b2918709a	intel/fs,vec4: Clean up a repeated pattern with SSBOs Everywhere we handle SSBO intrinsics, we have exactly the same pattern for computing the index so we may as well make a helper for it. We also add a get_nir_src_imm to vec4 and use it for SSBO offsets. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-08 10:09:06 -06:00
Samuel Pitoiset	c472ad82e4	radv: fix GPU hangs when loading depth/stencil clear values on SI/CIK HTILE is supported on these chips, not sure how I missed that. This restores using PFP_SYNC_ME when LOAD_CONTEXT_REG is not used. Fixes: `f425d9ee74` ("radv: use LOAD_CONTEXT_REG when loading fast clear values") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-08 11:20:03 +01:00
Samuel Pitoiset	f425d9ee74	radv: use LOAD_CONTEXT_REG when loading fast clear values This avoids syncing the Micro Engine. This is only supported for VI+ currently. There is probably a way for using LOAD_CONTEXT_REG on previous chips but that could be done later. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-08 10:41:45 +01:00
Samuel Pitoiset	0dcd99c687	radv: only expose VK_SUBGROUP_FEATURE_ARITHMETIC_BIT for VI+ Inclusive and exclusives scan are missing because older chips don't have llvm.amdgcn.update.dpp. This fixes crashes with dEQP-VK.subgroups.arithmetic.*. CC: mesa-stable@lists.freedesktop.org Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-08 10:41:41 +01:00
Adam Jackson	16f1023037	glx: Demand success from CreateContext requests (v2) GLXCreate{,New}Context, like most X resource creation requests, does not emit a reply and therefore is emitted into the X stream asynchronously. However, unlike most resource creation requests, the GLXContext we return is a handle to library state instead of an XID. So if context creation fails for any reason - say, the server doesn't support indirect contexts - then we will fail in strange places for strange reasons. We could make every GLX entrypoint robust against half-created contexts, or we could just verify that context creation worked. Reuse the __glXIsDirect code to do this, as a cheap way of verifying that the XID is real. glXCreateContextAttribsARB solves this by using the _checked version of the xcb command, so effectively this change makes the classic context creation paths as robust as CreateContextAttribs. v2: Better use of Bool, check that error != NULL first (Olivier Fourdan) Signed-off-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-11-07 12:38:05 -05:00
Karol Herbst	f7fae7f64e	gm107/ir: fix compile time warning in getTEXSMask In function 'uint8_t nv50_ir::getTEXSMask(uint8_t)': warning: control reaches end of non-void function [-Wreturn-type] Reported-by: Moiman@freenode Fixes: `f821e80213` "gm107/ir: use scalar tex instructions where possible" Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-07 17:48:58 +01:00
Michel Dänzer	32b0eb51a3	winsys/amdgpu: Stop using amdgpu_bo_handle_type_kms_noimport It only behaves any different from amdgpu_bo_handle_type_kms with libdrm 2.4.93, and it breaks if an older version is picked up. Bugzilla: https://bugs.freedesktop.org/108096 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-11-07 17:37:47 +01:00
Lionel Landwerlin	792dde66f2	intel/dump_gpu: add platform option Got tired of remembering the PCI ids. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-07 11:27:41 +00:00
Lionel Landwerlin	e262cc0353	intel/dump_gpu: move output option together Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-07 11:27:38 +00:00
Samuel Pitoiset	0a0aa2ba6c	radv: disable conditional rendering for vkCmdCopyQueryPoolResults() VK_EXT_conditional_rendering says that copy commands should not be affected by conditional rendering. Cc: 18.2 18.3 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-07 11:31:36 +01:00
Samuel Pitoiset	1e7c3379e1	radv: allocate enough space in CS when copying query results with compute Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-07 11:31:34 +01:00
Timothy Arceri	9aa3c1915e	ac/nir_to_llvm: fix b2f for f64 Fixes: `d7e0d47b9d` ("nir: Add a bunch of b2[if] optimizations") Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-07 16:35:07 +11:00
Karol Herbst	f821e80213	gm107/ir: use scalar tex instructions where possible TEXS, TLD4 and TLD4S are variants of tex instructions which are more scalar, which gives RA more freedom and is less likely to insert silly MOVs to satisfy quad registers. shader-db changes: total instructions in shared programs : 7687265 -> 7614782 (-0.94%) total gprs used in shared programs : 803620 -> 798045 (-0.69%) total shared used in shared programs : 639636 -> 639636 (0.00%) total local used in shared programs : 24648 -> 24648 (0.00%) total bytes used in shared programs : 82103400 -> 81330696 (-0.94%) local shared gpr inst bytes helped 0 0 3648 10647 10647 hurt 0 0 464 205 205 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-06 19:57:05 +01:00
Karol Herbst	edd6c41751	nv50/ir: add scalar field to TexInstructions Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-06 19:57:05 +01:00
Karol Herbst	8d825f78fc	nv50/ra: add condenseDef overloads for partial condenses Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-06 19:57:05 +01:00
Karol Herbst	a4550de434	nv50/ir: print color masks of tex instructions v2: print the mask for TXG as well make the mask to be printed more mask like Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-06 19:57:05 +01:00
Jason Ekstrand	610061838a	vulkan: Update the XML and headers to 1.1.91 The biggest change here is the rename of VK_NVX_ray_tracing to VK_NV_ray_tracing and the total removal of VK_KHR_mir_surface. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-06 12:21:19 -06:00
Gert Wollny	c171d76b94	r600: Add support for EXT_texture_sRGB_R8 Enables on R600 and makes pass: dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.* dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8* v2: remove chunk for dri/radeon (Emil) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-11-06 18:49:02 +01:00
Lionel Landwerlin	421fa01d64	anv/android: mark gralloc allocated BOs as external Allocating through Gralloc implies buffers are going to be used outside the driver. We have special MOCS settings for external BOs and we probably want to use them here too. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `a1220e7311` ("anv/android: Set the BO flags in bo_cache_import (v2)") Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-06 15:28:07 +00:00
Lionel Landwerlin	b43f955037	anv: stub internal android code This reduces the amount of #ifdef ANDROID we'll have to have inside the driver. Potentially offering better coverage of the android extensions. v2: Move anv_android.h include before anv_entrypoints.h (Tapani) Fix autotools android build (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-06 15:28:07 +00:00
Kristian H. Kristensen	f6131d4ec7	freedreno/a6xx: Clear z32 and separate stencil with blitter Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-11-06 08:56:38 -05:00
Rob Clark	3bbad81c80	freedreno/a6xx: fix VSC bug with larger # of tiles At higher resolutions with the addition of MSAA, the number of tiles can increase to the point where we use more than one VSC pipe per tile. Which would cause us to calculate an out-of-bounds offset for VSC_SIZE_ADDRESS. So don't try to be clever, just always put it at a fixed offset assuming the max 32 VSC pipes in use. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-06 08:56:21 -05:00
Rob Clark	2d9c3a5db2	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-06 08:43:27 -05:00
Olivier Fourdan	55af17ffed	wayland/egl: Resize EGL surface on update buffer for swrast After commit `a9fb331ea` ("wayland/egl: update surface size on window resize"), the surface size is updated as soon as the resize is done, and `update_buffers()` would resize only if the surface size differs from the attached size. However, in the case of swrast, there is no resize callback and the attached size is updated in `dri2_wl_swrast_commit_backbuffer()` prior to the `swrast_update_buffers()` so the attached size is always up to date when it reaches `swrast_update_buffers()` and the surface is never resized. This can be observed with "totem" using the GDK backend on Wayland (the default) when running on software rendering: $ LIBGL_ALWAYS_SOFTWARE=true CLUTTER_BACKEND=gdk totem Resizing the window would leave the EGL surface size unchanged. To avoid the issue, partially revert the part of commit `a9fb331ea` for `swrast_update_buffers()` and resize on the win size and not the attached size. Fixes: `a9fb331ea` - wayland/egl: update surface size on window resize Signed-off-by: Olivier Fourdan <ofourdan@redhat.com> CC: Daniel Stone <daniel@fooishbar.org> CC: Juan A. Suarez Romero <jasuarez@igalia.com> CC: mesa-stable@lists.freedesktop.org Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-11-06 13:59:38 +01:00
Lionel Landwerlin	b47a69ed4c	intel/decoders: fix instruction base address parsing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `00103db04a` ("intel: Fix decoding for partial STATE_BASE_ADDRESS updates.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-05 13:22:35 -08:00
Emil Velikov	b3ade65387	egl/glvnd: correctly report errors when vendor cannot be found If the user provides an invalid display or device the ToVendor lookup will fail. In this case, the local [Mesa vendor] error code will be set. Thus on sequential eglGetError(), the error will be EGL_SUCCESS. To be more specific, GLVND remembers the last vendor and calls back into it's eglGetError, although there's no guarantee to ever have had one. v2: - Add _eglError call, so the debug callback is executed (Kyle) - Drop XXX comment. Piglit: tests/egl/spec/egl_ext_device_query Fixes: `ce562f9e3f` ("EGL: Implement the libglvnd interface for EGL (v3)") Cc: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Kyle Brenneman <kbrenneman@nvidia.com>	2018-11-05 20:53:05 +00:00
Emil Velikov	2a8fefdeb0	egl: add EGL_EXT_device_base entrypoints eglQueryDevicesEXT (unlike the other three functions) does not depend on the display. It is implemented in GLVND, which calls into each driver collecting the list of devices and presenting it to the user. For the other entrypoints, GLVND acts as pass through stub calling into the vendor library. The vendor implementation calls back into GLVND to get the vendor dispatch. Then the driver proceeds to call itself via the said dispatch. This design makes is possible to keep using "old" GLVND with newer vendor drivers. Since effectively all the extension code is within the latter itself. Without said entrypoints, any user will outright crash - as reported in the bug report. Note: there's a follow-up fix needed to our GLVND code, to make piglit happy. v2: add some beefy documentation in the commit message. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108635 Fixes: `7552fcb7b9` ("egl: add base EGL_EXT_device_base implementation") Reported-by: kyle.devir@mykolab.com Cc: kyle.devir@mykolab.com Acked-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Tested-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-05 20:53:05 +00:00
Emil Velikov	7e169cf2a0	docs: mention EXT_shader_implicit_conversions Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-05 20:53:05 +00:00
Marek Olšák	04298a2f24	st/va: fix incorrect use of resource_destroy Fixes: `4373dd3215` ("st/va: Support YUV formats in vaCreateSurfaces") Cc: Drew Davenport <ddavenport@chromium.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-11-05 15:47:50 -05:00
Sergii Romantsov	5aeee1ab15	i965/batch/debug: Allow log be dumped before assert Message that may show the culprit of assert now will be dumped before that for debug purposes. Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Lionel G Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-05 09:24:55 -08:00
Lionel Landwerlin	4fd0ff75f3	intel/sanitize_gpu: add debug message on mmap fail Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-05 15:45:08 +00:00
Lionel Landwerlin	e400ac52e4	intel/sanitize_gpu: deal with non page multiple buffer sizes We can only map at page aligned offsets. We got that wrong with buffer size where (size % 4096) != 0 (anv has a WA buffer of 1024). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-05 15:45:07 +00:00
Lionel Landwerlin	c5fca35af1	intel/sanitize_gpu: add help/gdb options to wrapper Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-05 15:45:07 +00:00
Lionel Landwerlin	9ab5089150	intel/dump_gpu: add missing gdb option Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-05 15:43:34 +00:00
Eric Engestrom	d515ded4d9	wsi/wayland: only finish() a successfully init()ed display Fixes: `4369102498` "vulkan/wsi/wayland: Stop caching Wayland displays" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de>	2018-11-05 15:29:21 +00:00
Eric Engestrom	dcee22afed	wsi/wayland: use proper VkResult type Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-05 14:55:05 +00:00
Sergii Romantsov	ce837a5372	autotools: library-dependency when no sse and 32-bit Building of 32bit Mesa may fail if __SSE__ is not specified. Added missed dependency from libm. v2: avoided dependecy on any flag, just link v3: meson doesn't fail, but have added dependency on libm CC: Dylan Baker <dylan@pnwbakers.com> CC: Lionel G Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108560 Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-05 13:21:49 +01:00
Samuel Pitoiset	f7fd0d86a9	radv: more use of radv_cp_wait_mem() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-05 09:48:50 +01:00
Samuel Pitoiset	c571ca7a08	radv: replace si_emit_wait_fence() with radv_cp_wait_mem() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-05 09:48:50 +01:00
Samuel Pitoiset	b1b2dd06a7	radv: add missing TFB queries support to CmdCopyQueryPoolsResults() Cc: 18.3 <mesa-stable@lists.freedesktop.org> Fixes: `b4eb029062` ("radv: implement VK_EXT_transform_feedback") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-05 09:48:43 +01:00
Samuel Pitoiset	dc3419195c	radv: remove useless sync after copying query results with compute The spec says: "vkCmdCopyQueryPoolResults is considered to be a transfer operation, and its writes to buffer memory must be synchronized using VK_PIPELINE_STAGE_TRANSFER_BIT and VK_ACCESS_TRANSFER_WRITE_BIT before using the results." VK_PIPELINE_STAGE_TRANSFER_BIT will wait for compute to be idle, while VK_ACCESS_TRANSFER_WRITE_BIT will invalidate both L1 vector caches and L2. So, it's useless to set those flags internally. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-05 09:47:55 +01:00
Vinson Lee	64a9ed8848	r600/sb: Fix constant logical operand in assert. Fixes: `da977ad907` ("r600/sb: start adding GDS support") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2018-11-04 21:09:55 -08:00
Kenneth Graunke	5d517a599b	st/mesa: Don't record garbage streamout information in the non-SSO case. In the non-SSO case, where multiple shader stages are linked together, we were recording garbage pipe_stream_output_info structures for all but the last enabled geometry-processing stage. Specifically, we were using the gl_transform_feedback_info from shader_program->last_vert_prog (the stage whose outputs will be recorded)...but were pairing it with the output varying mappings from the current shader stage. For example, a program with a VS and GS, the VS's pipe_shader_state would have a pipe_stream_output_info based on the GS transform feedback info, but the VS output mapping. This generally worked out okay because only the pipe_stream_output_info for the last stage really matters - the others can be ignored. However, we'd like to avoid confusing the pipe driver. In particular, my new driver translates the stream out information to hardware packets at bind_{vs,tes,gs}_state() time...and was hitting asserts about garbage varyings that didn't exist. This patch changes st/mesa to record a blank pipe_stream_output_info with num_outputs = 0 for all stages prior to last_vert_prog. The last one is captured as normal. (In the fully-SSO case, nothing should change - each program contains a single shader stage, so last_vert_prog is the current shader.) Tested with llvmpipe (piglit's gpu profile), and freedreno (a3xx, gpu profile with -t transform.feedback). Fixes several hundred CTS tests on my new driver. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-11-03 23:34:36 -07:00
Kenneth Graunke	b6410a2d22	st/nir: Drop unused parameter from st_nir_assign_uniform_locations(). ARB programs won't have one of these, and we don't use it anyway. Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-11-03 23:34:36 -07:00
Kenneth Graunke	5294d65011	st/mesa: Pull nir_lower_wpos_ytransform work into a helper function. This will let me use it in the ARB program code as well. Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-11-03 23:34:34 -07:00
Kenneth Graunke	424a6052df	intel: Use a URB start offset of 0 for disabled stages. There are some cases where the VS is the only stage enabled, it uses the entire URB, and the URB is large enough that placing later stages after the VS exceeds the number of bits for "URB Starting Address". For example, on Icelake GT2, "varying-packing-simple mat2x4 array" from Piglit is getting a starting offset of 128 for the GS/HS/DS. But the field is only large enough to hold an offset of 127. i965 doesn't hit any genxml assertions because it's still using the old OUT_BATCH mechanism. 128 << GEN7_URB_STARTING_ADDRESS_SHIFT (57) == 0, with the extra bit falling off the end. So we place the disabled stage at the beginning of the URB (overlapping with push constants). This is likely okay since it's a zero size region (0 entries). It seems like the Vulkan driver might hit this assertion, however, and the situation seems harmless. To work around this, always place disabled stages at the start of the URB, so the last enabled stage can fill the remaining space without overflowing the field. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-11-03 23:25:57 -07:00
Mauro Rossi	5c0cff868a	android: radv: add libmesa_git_sha1 static dependency libmesa_git_sha1 whole static dependency is added to get git_sha1.h header and avoid following building error: external/mesa/src/amd/vulkan/radv_device.c:46:10: fatal error: 'git_sha1.h' file not found ^ 1 error generated. Fixes: `9d40ec2cf6` ("radv: Add support for VK_KHR_driver_properties.") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-03 10:48:45 +01:00
Eric Anholt	0d78c6af0d	vc4: Use the normal simulator ioctl path for CL submit as well. The simulator no longer needs to look back into the gallium structs.	2018-11-02 14:26:38 -07:00
Eric Anholt	c80e267a0a	vc4: Maintain a separate GEM mapping of BOs in the simulator. This will let us avoid looking back into the gallium driver's vc4_bo.	2018-11-02 14:26:38 -07:00
Eric Anholt	645ca269d2	vc4: Take advantage of _mesa_hash_table_remove_key() in the simulator.	2018-11-02 14:26:38 -07:00
Eric Anholt	f32ba7abd7	v3d: Remove the special path for simulaton of the submit ioctl. Now that it doesn't need to find the struct v3d_bos, it can just take the normal v3d_ioctl() path.	2018-11-02 14:26:38 -07:00
Eric Anholt	df9f574c13	v3d: Maintain a mapping of the GEM buffer in the simulator. This way we don't need to reach back into the gallium driver code to get the mapping.	2018-11-02 14:26:38 -07:00
Dylan Baker	7652931d33	meson: link gallium nine with pthreads In some cases (not building with llvm, which automatically pulls in pthreads) nine needs to be directly linked with pthreads. Fixes building on x86 (32 bit) without llvm. Distro bug: https://bugs.gentoo.org/670094 Fixes: `6b4c7047d5` ("meson: build gallium nine state_tracker") Tested-by: Rafal Lalik <rafallalik@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-11-02 13:10:33 -07:00
Anuj Phogat	1c140470ef	anv/icl: Disable prefetching of sampler state entries WA_1606682166: Incorrect TDL's SSP address shift in SARB for 16:6 & 18:8 modes. Disable the Sampler state prefetch functionality in the SARB by programming 0xB000[30] to '1'. This is to be done at boot time and the feature must remain disabled permanently. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-02 08:34:33 -07:00
Topi Pohjolainen	9a41a10f8a	i965/icl: Disable prefetching of sampler state entries In the same spirit as commit `a5889d70f2` "i965/icl: Disable binding table prefetching". Fixes some 110+ intermittent piglit failures with tex-miplevel-selection variants. WA_1606682166: Incorrect TDL's SSP address shift in SARB for 16:6 & 18:8 modes. Disable the Sampler state prefetch functionality in the SARB by programming 0xB000[30] to '1'. This is to be done at boot time and the feature must remain disabled permanently. Anuj: Set SamplerCount = 0 for vs, gs, hs, ds and wm units as well. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-11-02 08:34:33 -07:00
Jan Vesely	9cab8ccd6c	amd: Make vgpr-spilling depend on llvm version The option was removed in LLVM r345763 Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-11-02 10:32:47 -04:00
Timothy Arceri	769ae9fb7f	nir: fix condition propagation when src has a swizzle We cannot use nir_build_alu() to create the new alu as it has no way to know how many components of the src we will use. This results in it guessing the max number of components from one of its inputs. Fixes the following CTS tests: dEQP-VK.spirv_assembly.instruction.graphics.selection_block_order.out_of_order_frag dEQP-VK.spirv_assembly.instruction.graphics.selection_block_order.out_of_order_geom dEQP-VK.spirv_assembly.instruction.graphics.selection_block_order.out_of_order_tessc dEQP-VK.spirv_assembly.instruction.graphics.selection_block_order.out_of_order_vert Fixes: `2975422ceb` ("nir: propagates if condition evaluation down some alu chains") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-03 00:44:01 +11:00
Mauro Rossi	b9dec214f5	android: gallium/auxiliary: add include to get u_debug.h header To avoid build error in u_debug_stack_android.cpp due to now missing u_debug.h header: external/mesa/src/gallium/auxiliary/util/u_debug_stack_android.cpp:26:10: fatal error: 'u_debug.h' file not found #include "u_debug.h" ^ 1 error generated. Fixes: `37db383abb` ("util: Move u_debug to utils") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-02 13:31:37 +01:00
Gert Wollny	b710680093	virgl/vtest-winsys: Use virgl version of bind flags The bind flags defined by mesa/gallium might not always be in sync with the ones copied to virglrenderer/gallium. Therefore, use the flags defined in virgl like it is done for all the other calls to create resources. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-02 11:53:09 +01:00
Gert Wollny	acd2968005	mesa/st: Add support for EXT_texture_sRGB_R8 This only adds support on the Gallium core level, for the drivers it is likely that additional changes are needed to support the new texture format and thereby enabling the extension. Enables on softpipe and makes pass: dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.* v2: - add include for getting GL_SR8_EXT v4: - since the extension is not required don't bother providing a fallback (Ilia Mirkin) - split patch (2/2) to separate Gallium and mesa/st parts (Roland Scheidegger) - trim commit message to only contain the history of the patch relevant to this part v5: - don't include GLES headers (required enum has been added to glheader.h) (Ilia Mirkin) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-02 11:52:44 +01:00
Gert Wollny	29f0ab2c30	Gallium: Add format PIPE_FORMAT_R8_SRGB This format is needed to support EXT_texture_sRGB_R8. THe patch adds a new format enum, the format entries in Gallium and and svga, the mapping between sRGB and linear formats, and tests. v2: - add mapping to linear format for PIPE_FORMATR_R8_SRGB v3: - Add texture format to svga format table since otherwise building mesa will fail when this driver is enabled. It was not tested whether the extension actually works. v4: - svga: remove the SVGA specific format definitions and table entries and only add correct the location of PIPE_FORMAT_R8_SRGB in the format_conversion_table (Ilia Mirkin) - Split patch (1/2) to separate Gallium part and mesa/st part. (Roland Scheidegger) - Trim the commit message to only contain the relevant parts from the split. v5: - svga: correct location of PIPE_FORMAT_SRGB_R8 (Ilia Mirkin) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-02 11:52:44 +01:00
Gert Wollny	b8e9c6522d	mesa/core: Add definitions and translations for EXT_texture_sRGB_R8 v2: - fix format definition line - disable for desktop GL - don't add GL_R8_EXT to glext.h since it is already in GLES2/gl2ext.h in glext.h and include this header where needed (all Emil) v3: - swrast: Fill the function table for sRGB_R8 The size of the function table is checked at compile time and must correspond to the number of mesa texture formats. dri/swrast being gles-2.0 doesn't support the extension though v4: - correct format layout comment (Ilia Mirkin) - correct logic for accepting GL_RED only textures (in part Ilia Mirkin) EXT_texture_sRGB_R8 requires OpenGL ES 3.0 which includes ARB_texture_rg/EXT_texture_rg, so one only must check for the first when SR8_EXT is really requested. v5: - add define for GL_ES8_XT to glheader.h and don't include GLES headers (Ilia Mirkin) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-11-02 11:52:44 +01:00
Erik Faye-Lund	742dace825	glsl: do not allow implicit casts of unsized array initializers The GLSL 4.6 specification (section 4.1.14. "Implicit Conversions") says: "There are no implicit array or structure conversions. For example, an array of int cannot be implicitly converted to an array of float." So let's add a check in place when assigning array initializers to implicitly sized arrays, to avoid incorrectly allowing code on the form: int[] foo = float[](1.0, 2.0, 3.0) This fixes the following dEQP test-cases: - dEQP-GLES31.functional.shaders.implicit_conversions.es31.invalid.arrays.int_to_float_vertex - dEQP-GLES31.functional.shaders.implicit_conversions.es31.invalid.arrays.int_to_float_fragment - dEQP-GLES31.functional.shaders.implicit_conversions.es31.invalid.arrays.int_to_uint_vertex - dEQP-GLES31.functional.shaders.implicit_conversions.es31.invalid.arrays.int_to_uint_fragment - dEQP-GLES31.functional.shaders.implicit_conversions.es31.invalid.arrays.uint_to_float_vertex - dEQP-GLES31.functional.shaders.implicit_conversions.es31.invalid.arrays.uint_to_float_fragment - dEQP-GLES31.functional.shaders.implicit_conversions.es32.invalid.arrays.int_to_float_vertex - dEQP-GLES31.functional.shaders.implicit_conversions.es32.invalid.arrays.int_to_float_fragment - dEQP-GLES31.functional.shaders.implicit_conversions.es32.invalid.arrays.int_to_uint_vertex - dEQP-GLES31.functional.shaders.implicit_conversions.es32.invalid.arrays.int_to_uint_fragment - dEQP-GLES31.functional.shaders.implicit_conversions.es32.invalid.arrays.uint_to_float_vertex - dEQP-GLES31.functional.shaders.implicit_conversions.es32.invalid.arrays.uint_to_float_fragment Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-02 11:10:36 +01:00
Erik Faye-Lund	6df922f438	mesa/glsl: add support for EXT_shader_implicit_conversions EXT_shader_implicit_conversions adds support for implicit conversions for GLES 3.1 and above. This is essentially a subset of ARB_gpu_shader5, and augments OES_gpu_shader5. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-02 11:10:36 +01:00
Erik Faye-Lund	ecab2d6f14	glsl: fall back to inexact function-match In GLES, we currently either need an exact match with a local function, or an exact match with a builtin. However, if we add support for implicit conversions for GLES shaders, we also need to fall back to a non-exact match in the case where there were no builtin match either. Luckily, we already have a variable ready with this, so let's just return it if the builtin-search failed. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-02 11:10:36 +01:00
Erik Faye-Lund	e975c5b785	glsl: add has_implicit_uint_to_int_conversion()-helper This makes the code a bit easier to read, as well as reduces repetition, especially when we add support for EXT_shader_implicit_conversions. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-02 11:10:36 +01:00
Erik Faye-Lund	12f001f013	glsl: add has_implicit_conversions()-helper This makes the code a bit easier to read, as well as will reduce repetition when we add support for EXT_shader_implicit_conversions. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-11-02 11:10:36 +01:00
Mathias Fröhlich	9f009c1a8f	mesa: Remove needless indirection in some draw functions. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-02 08:42:03 +01:00
Timothy Arceri	c7bdda8aa5	nir: allow propagation of if evaluation for bcsel Shader-db results Skylake: total instructions in shared programs: 13109035 -> 13109024 (<.01%) instructions in affected programs: 4777 -> 4766 (-0.23%) helped: 11 HURT: 0 total cycles in shared programs: 332090418 -> 332090443 (<.01%) cycles in affected programs: 19474 -> 19499 (0.13%) helped: 6 HURT: 4 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-02 15:56:34 +11:00
Dave Airlie	677b496b6b	radv: fix begin/end transform feedback with 0 counter buffers. If the user gives 0 counterBuffers then the driver should still enable transform feedback on all targets. This changes the driver to always enable xfb, and use counter buffers where one is defined for the target in question. Fixes: `b4eb029062` (radv: implement VK_EXT_transform_feedback) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-02 04:15:07 +00:00
Dave Airlie	7f37a52a21	radv: apply xfb buffer offset at buffer binding time not later. (v2) In order to handle pause/resume properly, the offset should be added to the buffer binding not to the begin/end paths. v2: don't add offset to size Fixes ext_transform_feedback-alignment* under zink Fixes: `b4eb029062` (radv: implement VK_EXT_transform_feedback) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-11-02 04:13:31 +00:00
Mark Janes	5f312e95f8	Revert "i965/batch: avoid reverting batch buffer if saved state is an empty" This reverts commit `a9031bf9b5`. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108630	2018-11-01 16:28:05 -07:00
Eric Anholt	43a397c580	vc4: Drop the winsys_stride relayout in the simluator Since `0c1dd9dee0` ("broadcom/vc4: Allow importing linear BOs with arbitrary offset/stride."), we have the vc4-side BO properly laid out (assuming it's linear) in the winsys BO so that we can skip this extra copy.	2018-11-01 14:34:02 -07:00
Eric Anholt	4e1b163eed	v3d: Update the TLB config for depth writes on V3D 4.2. Fixes 311 piglit cases on the simulator.	2018-11-01 13:56:30 -07:00
Eric Anholt	4018eb04e8	v3d: Use the TLB R/B swapping instead of recompiles when available. The recompile reduction is nice, but this also makes it so that a straight texture copy could get optimized some day to not unpack/repack the f16 values.	2018-11-01 13:56:30 -07:00
Eric Anholt	3923cf626d	v3d: Take advantage of _mesa_hash_table_remove_key() in the simulator.	2018-11-01 13:54:36 -07:00
Eric Anholt	47586ab569	v3d: Respect user-passed strides for BO imports. If the caller has passed in a stride for (linear) BO import, we should use that stride when rendering to the BO (or, if we some day support texturing from linear-imported BOs, when doing the linear-to-UIF shadow copy). This lets us remove the extra stride-changing relayout in the simulator.	2018-11-01 13:54:36 -07:00
Eric Anholt	5313fb8abd	v3d: Drop #if 0-ed out v3d_dump_to_file(). This came from vc4, where we had a file format for GPU hangs. I don't have one of those for V3D, and I probably won't ever have the simulator side produce dumps even if I do.	2018-11-01 13:54:36 -07:00
Eric Anholt	d3f66c385b	v3d: Fix a typo in a comment in job handling.	2018-11-01 13:54:36 -07:00
Eric Anholt	b93fc160f4	v3d: Fix a copy-and-paste comment in the simulator code.	2018-11-01 13:54:36 -07:00
Anuj Phogat	13c955182f	anv/icl: Set Error Detection Behavior Control Bit in L3CNTLREG The default setting of this bit is not the desirable behavior. WA_1406697149 Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-01 12:00:23 -07:00
Anuj Phogat	b3d6937fb0	i965/icl: Set Error Detection Behavior Control Bit in L3CNTLREG The default setting of this bit is not the desirable behavior. WA_1406697149 Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-01 12:00:23 -07:00
Emil Velikov	ac95a0e024	docs: add 19.0.0-devel release notes template Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-01 18:56:54 +00:00
Emil Velikov	97c73c9174	mesa: bump version to 19.1.0-devel Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-01 18:54:02 +00:00
Dylan Baker	1f41104b9b	meson: don't install translation files Tested-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `7834926a4f` ("meson: add support for generating translation mo files")	2018-11-01 10:49:16 -07:00
Eric Engestrom	4da169d368	egl: use the LC_ALL hammer instead of LANG Some environment (like Travis apparently) set LC_* vars, messing up the sort ordering, so let's use envvar with the highest priority to make sure this is actually sorted in ASCII order. Suggested-by: Michel Dänzer <michel@daenzer.net> Fixes: `b42dc50a5f` "egl: fix entrypoint sorting test" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-11-01 17:25:08 +00:00
Eric Engestrom	b42dc50a5f	egl: fix entrypoint sorting test Fixes: `68dc591af1` "egl: Fix eglentrypoint.h sort order." Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 15:45:26 +00:00
Andrii Simiklit	fc3cecda8c	intel/tools: fix resource leak Some memory and file descriptors are not freed/closed. v2: fixed case where we skipped the 'aub' variable initialization Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-01 13:21:07 +00:00
Jonathan Gray	ae8e81b0e3	intel/tools: include stdarg.h in error2aub Include stdarg.h in error2aub.c otherwise it fails to build on OpenBSD due to not finding definitions for va_list va_start va_end. Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-01 10:27:26 +00:00
Mathias Fröhlich	68dc591af1	egl: Fix eglentrypoint.h sort order. Fixes a make check failure. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108617 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 10:56:21 +01:00
Samuel Pitoiset	9cbdcc86b7	radv: set PA_SU_PRIM_FILTER_CNTL optimally Ported from RadeonSI. It's always TRUE for CIK+ because RADV doesn't support 16 samples. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-01 08:49:15 +01:00
Samuel Pitoiset	85010585cd	radv: only enable gl_SampleMask if MSAA is enabled too Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-01 08:49:11 +01:00
Samuel Pitoiset	0c08074cef	radv: use radeon_info::num_good_cu_per_sh Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-01 08:49:08 +01:00
Samuel Pitoiset	9278089d05	ac/nir: make use of i1false in few more places Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-01 08:49:05 +01:00
Samuel Pitoiset	79410b1e87	radv: add support for Raven2 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-01 08:48:52 +01:00
Mathias Fröhlich	ad52e19408	mesa: Collect all the draw functions in draw.{h,c}. Some of these functions were distributed across different implementation and header files. Put them at a central place. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	3d64f3c795	mesa/vbo: Move _vbo_draw_indirect -> _mesa_draw_indirect Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	f726c61cc1	mesa/vbo: Move src/mesa/vbo/vbo_exec_array.c -> src/mesa/main/draw.c The array type draw is no longer directly dependent on the vbo module. Thus move array type draws into mesa/main/draw.c. Rename symbols starting with vbo_* to _mesa_* and apply some reindenting to make it consistent. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	952a5da584	vbo: Pull the _mesa_set_draw_vao calls out of the if clauses. These calls are just the same in each if branch. So pull that before the if. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	b00cb994ef	vbo: Preserve vbo_save::no_current_update on primitive restart. With this change we preserve the no_current_update property when we observe a glPrimitiveRestart call. That means that we now also get the no_current_update optimization for display lists that are made out of indexed draws using primitive restart. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	f2a52b3c25	vbo: Make no_current_update an argument to vbo_save_NotifyBegin. Instead of coding additional information into the primitive mode, make the only remaining flag there a direct argument to vbo_save_NotifyBegin. v2: Fix incorrect no_current_update in glRectf. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	b899f5e59c	vbo: Move no_current_update out of _mesa_prim. The _mesa_prim::no_current_update flag should tell the compiled display list if the current attributes that are placed in the dlists vbo shall take a defined state past replay of a display list. Immediate mode draws compiled into display lists should set the current values. Array draws may leave the current values in undefined state. So finally this flag is not a property of every primitive but it is a property of the compiled display list and there it is a property of the last primitive compiled into the list. So move the flag out of _mesa_prim into vbo_save. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	eae4ee9419	vbo: Remove the now unused VBO_SAVE_PRIM_WEAK define. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	873adb06fa	vbo: Remove the always false branch dlist replay. The previous patch left a constant if (0) in the code. Clean that up now. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	1387b4d533	vbo: Test for VBO_SAVE_PRIM_WEAK in _mesa_prim::mode is false. When setting the _mesa_prim::mode field we always filter out all non OpenGL primitive mode bits. So this tested bit cannot be there anymore and the test evaluates to zero. The zero is removed with the next patch to ease review. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	cee0dd8d5a	vbo: Remove VBO_SAVE_PRIM_WEAK from vbo_save_NotifyBegin calls. Now looking at the implementation of vbo_save_NotifyBegin. The VBO_SAVE_PRIM_WEAK flag, delivered in the primitive mode argument to vbo_save_NotifyBegin, is not evaluated anymore. The two users of the mode argument are the primitive mode itself, where the VBO_SAVE_PRIM_WEAK bit is masked out to retrieve the underlying OpenGL primitive mode. The other user is to check for the VBO_SAVE_PRIM_NO_CURRENT_UPDATE bit which is different from VBO_SAVE_PRIM_WEAK. So, since vbo_save_NotifyBegin does not care about VBO_SAVE_PRIM_WEAK, we can savely remove it from the call arguments of vbo_save_NotifyBegin. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	b632c072b2	vbo: Remove set but not used weak field from _mesa_prim. The only reader of the weak field in _mesa_prim is pretty console printing. By that, remove the weak field from _mesa_prim. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	2dc951b7c3	vbo: Remove the VBO_SAVE_FALLBACK flag. On finishing a display list playback the VBO_SAVE_FALLBACK bit is still kept in vbo_save_context::replay_flags. But examining replay_flags and the display list flags that feed this value the corresponding bit is never set these days anymore. So, since it is nowhere set or checked, we can safely remove it. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Mathias Fröhlich	5b41504f66	vbo: Remove unused vbo_save_fallback function. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 06:08:49 +01:00
Emil Velikov	075f92b2b7	docs/relnotes: add the EGL Device extensions Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-01 00:05:43 +00:00
Emil Velikov	83c7fbb4e4	meson: egl: group dri2 bits separately from haiku One cannot have haiku and dri2 - surfaceless,x11,etc. Group things up, which will make the addition of platform_device a bit easier. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-11-01 00:05:43 +00:00
Emil Velikov	c7cc135e23	egl: enable EGL_EXT_device_{base,enumeration,query} Now that we support the extensions, fully, enabled them. The specs mandate that we always have at least one device and each dpy has a device associated with it. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 00:05:43 +00:00
Emil Velikov	00992700c9	egl: set the EGLDevice when creating a display This is the final requirement from the base EGLDevice spec. v2: - split from another patch - move wayland hunk after we have the fd Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 00:05:43 +00:00
Emil Velikov	dbb4457d98	egl: add EGL_EXT_device_drm support Add implementation based around the drmDevice API. As such it's only available only when building with libdrm. With the latter already a requirement when using !SW code paths in the platform code. Note: the current code will work if a device is hot-plugged. Yet hot-unplugged is not implemented, since I have no ways of testing it. v2: - ddd some _eglDeviceSupports checks - require DRM_NODE_RENDER - add _eglGetDRMDeviceRenderNode helper v3: - flip inverted asserts (Mathias) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 00:05:43 +00:00
Emil Velikov	f73c5d27c1	egl: add EGL_MESA_device_software support Add a plain software device, which is always available. We can safely assign it as the first/initial device in _eglGlobals, although we ensure that's the case with a handful of _eglDeviceSupports checks throughout the code. v2: - s/_eglFindDevice/_eglAddDevice/ (Eric) - s/_eglLookupAllDevices/_eglRefreshDeviceList/ (Eric) - move ^^ helpers into a earlier patch (Eric, Mathias) - set the SW device on _eglGlobal init. (Eric) - add a number of _eglDeviceSupports checks (Mathias) - split Device/Display attach to a separate patch v3: - flip inverted asserts (Mathias) - s/on-stack/static/ (Mathias) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 00:05:43 +00:00
Adam Jackson	3f08e500c4	specs: Add EGL_MESA_device_software The device extension string is expected to contain the name of the extension defining what kind of device it is, so the caller can know what kinds of operations it can perform with it. So that string had better be non-empty, hence this trivial extension. v2: - drop "fallback", update history and update contributor list Signed-off-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 00:05:43 +00:00
Emil Velikov	7552fcb7b9	egl: add base EGL_EXT_device_base implementation Introduce the API for device query and enumeration. Those at the moment produce nothing useful since zero devices are actually available. That contradicts with the spec, so the extension isn't advertised just yet. With later commits we'll add support for software (always) and hardware devices. Each one exposing the respective extension string. v2: - fold API boilerplate into this patch - move _eglAddDevice, _eglDeviceSupports, _eglRefreshDeviceList to this patch (Eric, Mathias) - make _eglFiniDevice the one called last v3: - comment on the dummy _egl_device_extension enum entry (Eric) - annotate dev as MAYBE_UNUSED (Mathias) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-11-01 00:05:43 +00:00
Emil Velikov	e55c1bcb08	glx: be explicit about when mapping X <> GLX visuals Write down both X and GLX visual types when mapping from one to the other. Makes grepping through the code a tiny bit easier. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-11-01 00:05:43 +00:00
Emil Velikov	833e3cad19	glx: remove unused __glXPreferEGL() declaration The function definition is no longer around, drop the useless declaration. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-11-01 00:05:43 +00:00
Emil Velikov	4428eed896	travis: use mako for python2 Earlier commit flipped the default to python2 but forgot to update the travis file. Props to pip caching things "worked" for a little while. Fixes: `f22ad5ef18` ("travis: use python3 for the autoconf builds") Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-01 00:05:43 +00:00
Dave Airlie	fcf15a007d	radv/xfb: don't increase offset by component mask start. This is incorrect, the offset is into the buffer, and it's legal to write loc 0,0 -> buffer0, offset 0 loc 0,1 -> buffer1, offset 0 This fixes a bunch of piglits running on my zink xfb code on radv. Fixes: `6c21645046` (radv: emit stream outputs for vertex and tessellation stages) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-31 23:48:10 +00:00
Dylan Baker	d25179469b	util/gen_xmlpool: Make use of python's foreach loop Instead of using a while loop with indexing. This is much cleaner. This requires some other small changes. Acked-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:46 -07:00
Dylan Baker	465cfcb266	util/gen_xmlpool: Don't use len to test for container emptiness This is a very common python anti-pattern. Not using length allows us to go through faster C paths, but has the same meaning. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:46 -07:00
Dylan Baker	b9cd81ea31	util/gen_xmlpool: Don't write via shell redirection Using shell redirection to write to a file is more complicated than necessary, and has the potential to run into unicode encoding problems. It's also less code. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108530 v2: - update commit message to say less about LANG=C - use flags instead of positional arguments for the script (Emil) Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:46 -07:00
Dylan Baker	1df086662a	util/gen_xmlpool: use with statement to open file Which ensures it is closed at the end of the scope. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:12 -07:00
Dylan Baker	bc4a7645e4	util/gen_xmlpool: use a main function Again, just good style Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:12 -07:00
Dylan Baker	187fad5c0b	util/gen_xmlpool: Use print function instad of sys.stderr.write This ensures that stderr is flushed, unlike writing Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:12 -07:00
Dylan Baker	2c2aa98ee7	util/gen_xmlpool: Use more standard style gen_xmlpool uses a style unlike the rest of mesa, spaces between function/method calls and the parens, strange whitespace to force lining up method calls, and some other whitespace stuff. Since I'm going to be doing some work in the file, I'm going to start cleaning those up. Acked-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:12 -07:00
Dylan Baker	a8004ef03e	docs/meson: Add note about update translations Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:12 -07:00
Dylan Baker	0621e91a8c	util/xmlpool: Update for meson generation Meson won't put the .gmo files in the layout that python's gettext.translation() expects, it puts them in the build directory in a flat layout. This modifies android and autotools to do the same (scons doesn't work with translations at all) v3: - Squash 4 patches into this patch Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:12 -07:00
Dylan Baker	7834926a4f	meson: add support for generating translation mo files Meson has handy a handy built-in module for handling gettext called i18n, this module works a bit differently than our autotools build does, namely it doesn't automatically generate translations instead it creates 3 new top level targets to run. These are: xmlpool-pot xmlpool-update-po xmlpool-gmo v2: - Add new files to autotools dist tarball Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 16:37:12 -07:00
Dylan Baker	2857b18991	util/gen_xmlpool: use argparse for argument handling This is a little cleaner than just looking at sys.argv, but it's also going to allow us to handle the differences in the way meson and autotools handle translations more cleanly. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-31 16:37:12 -07:00
Timothy Arceri	5b757b4097	nir: fix if condition propagation for alu use We need to update the cursor before we check if the alu use is dominated by the if condition. Previously we were checking if the current location of the alu instruction was dominated by the if condition which would miss some optimisation opportunities. Fixes: `a3b4cb3458` ("nir/opt_if: Rework condition propagation") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-11-01 09:22:55 +11:00
Vinson Lee	802ae533ab	freedreno: Do not link ir3_compiler with valgrind libraries. This patch fixes this freedreno autotools build error. CXXLD ir3_compiler /usr/lib/valgrind/libcoregrind-amd64-linux.a(libcoregrind_amd64_linux_a-m_main.o): In function `_start': (.text+0x0): multiple definition of `_start' /usr/lib/gcc/x86_64-linux-gnu/7/../../../x86_64-linux-gnu/Scrt1.o:(.text+0x0): first defined here /usr/bin/ld: /usr/lib/valgrind/libcoregrind-amd64-linux.a(libcoregrind_amd64_linux_a-m_main.o): relocation R_X86_64_32S against undefined symbol `vgPlain_interim_stack' can not be used when making a PIE object; recompile with -fPIC /usr/bin/ld: /usr/lib/valgrind/libcoregrind-amd64-linux.a(libcoregrind_amd64_linux_a-m_trampoline.o): relocation R_X86_64_32 against `.text' can not be used when making a PIE object; recompile with -fPIC /usr/bin/ld: /usr/lib/valgrind/libcoregrind-amd64-linux.a(libcoregrind_amd64_linux_a-dispatch-amd64-linux.o): relocation R_X86_64_32S against symbol `vgPlain_stats__n_xindirs_32' can not be used when making a PIE object; recompile with -fPIC /usr/bin/ld: final link failed: Nonrepresentable section on output collect2: error: ld returned 1 exit status Fixes: `f3cc0d2747` ("freedreno: import libdrm_freedreno + redesign submit") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108595 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-10-31 15:05:28 -07:00
Emil Velikov	f22ad5ef18	travis: use python3 for the autoconf builds Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-31 19:16:00 +00:00
Emil Velikov	986033a275	configure: allow building with python3 Pretty much all of the scripts are python2+3 compatible. Check and allow using python3, while adjusting the PYTHON2 refs. Note: - python3.4 is used as it's the earliest supported version - python2 chosen prior to python3 v2: use python2 by default Cc: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-31 19:15:50 +00:00
Juan A. Suarez Romero	6d7d3dbda5	docs: update calendar, add new item and link release notes for 18.2.4 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-10-31 19:58:00 +01:00
Juan A. Suarez Romero	5b074c756e	docs: add sha256 checksums for 18.2.4 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `624e384ea8`)	2018-10-31 19:55:28 +01:00
Juan A. Suarez Romero	7c2239aa55	docs: add release notes for 18.2.4 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `1cdef5e70c`)	2018-10-31 19:55:25 +01:00
Eric Engestrom	091da79bb0	meson: hide warnings from external project `gtest` gtest is an external project that is copied in this tree for technical reasons, but isn't maintained by us, so its warnings are irrelevant. Cc: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-31 18:20:25 +00:00
Eric Engestrom	455a3cd515	tools/imgui: disable all warnings This is an external project we have no control over, and will not be fixing (other than by sometimes pulling the latest sources), so warnings serve no purpose here. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-31 16:28:33 +00:00
Alejandro Piñeiro	95b8da22cf	glspirv: no need to force entrypoint name to "main" Since commit "intel/compiler: Stop assuming the entrypoint is called "main"" there is no need to force the entrypoint name to be "main". Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-31 15:57:23 +01:00
Tapani Pälli	27f1298b9d	glsl/linker: validate attribute aliasing before optimizations Patch does a 'dry run' of assign_attribute_or_color_locations before optimizations to catch cases where we have aliasing of unused attributes which is forbidden by the GLSL ES 3.x specifications. We need to run this pass before unused attributes may be removed and with attribute binding information from program, therefore we re-use existing pass in linker rather than attempt to write another one. This fixes WebGL2 test 'gl-bindAttribLocation-aliasing-inactive' and Piglit test 'gles-3.0-attribute-aliasing'. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106833 Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-31 14:53:47 +02:00
Eric Engestrom	a96749b13c	egl: drop EGL driver `name` This is a revert of Marek's `2cb9ab53dd` revert. It was needed to revert the previous commit, and didn't have any issue itself. -- The "DRI2" name was reported as confusing when printing EGL infos (one user reported thinking DRI3 was not working on his X server), and the only alternative is Haiku, which can only be used on a Haiku machine. The name therefore doesn't add any information that the user wouldn't know already, so let's just drop it. Suggested-by: Emil Velikov <emil.l.velikov@gmail.com> Related-to: `b174a1ae72` ("egl: Simplify the "driver" interface") Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 11:01:54 +00:00
Eric Engestrom	cb0980e69a	egl: move alloc & init out of _eglBuiltInDriver{DRI2,Haiku} This is a revert of Marek's `84f3afc2e1` revert, with a missing line added back. I failed a rebase and dropped that crucial line, and didn't do a runtime test after my rebase, and as a result broke EGL for everyone. This commit has been tested by Intel's CI and I re-read it once more, so it should be good this time. -- Note: dropping the EGL_BAD_ALLOC in egl_haiku because it's overwritten by the EGL_NOT_INITIALIZED in eglInitialize(). Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-31 11:01:54 +00:00
Christian Gmeiner	21d9b78289	Revert "imx: make use of loader_open_render_node(..) helper" This reverts commit `773d6ea6e7`. Since kernel 4.17 (drm/etnaviv: remove the need for a gpu-subsystem DT node) the etnaviv DRM driver doesn't have an associated DT node anymore. This is technically correct, as the etnaviv device is a virtual device driving multiple hardware devices. Before 4.17 the userspace had access to the following information: DRIVER=etnaviv OF_NAME=gpu-subsystem OF_FULLNAME=/gpu-subsystem OF_COMPATIBLE_0=fsl,imx-gpu-subsystem OF_COMPATIBLE_N=1 MODALIAS=of:Ngpu-subsystemT<NULL>Cfsl,imx-gpu-subsystem DRIVER=imx-drm OF_NAME=display-subsystem OF_FULLNAME=/display-subsystem OF_COMPATIBLE_0=fsl,imx-display-subsystem OF_COMPATIBLE_N=1 Afer 4.17: DRIVER=etnaviv MODALIAS=platform:etnaviv The OF node has never been part of the etnaviv UABI, simply due to the fact that it's still possible to instantiate the etnaviv driver from a platform file, instead of a devicetree node. A patch set to fix this problem was send out [1] but it looks like that a proper solution needs more time to bake. [1] https://lists.freedesktop.org/archives/dri-devel/2018-October/194651.html Suggested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2018-10-31 09:41:26 +01:00
Samuel Pitoiset	9ef8ea1451	radv: use WAIT_REG_MEM_GREATER_OR_EQUAL instead of a magic value Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-31 09:21:28 +01:00
Samuel Pitoiset	a9a56f47f8	radv: use pool->stride when calling radv_query_shader() Not needed to recompute the stride. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-31 09:21:28 +01:00
Samuel Pitoiset	e60ab66e33	radv: rename some parameters in Cmd{Begin,End}TransformFeedbackEXT() To match latest spec. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-31 09:21:28 +01:00
Samuel Pitoiset	57982b683b	radv/winsys: do not assign last submission when chained path failed I don't think we want to wait for something that hasn't been correctly submitted. This is similar to the fallback path. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-31 09:21:28 +01:00
Samuel Pitoiset	ae3aecd07f	radv/winsys: fix buffer deletion in the sysmem path In case we failed to submit the CS correctly. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-31 09:21:28 +01:00
Samuel Pitoiset	72877865d9	radv/winsys: cleanup the chained submission path Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-31 09:21:28 +01:00
Samuel Pitoiset	d12dd16a97	radv/winsys: remove unused surface_best() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-31 09:21:28 +01:00
Jason Ekstrand	d3a0d8b750	intel/compiler: Stop assuming the entrypoint is called "main" This isn't true for Vulkan so we have to whack it to "main" in anv which is silly. Instead of walking the list of functions and asserting that everything is named "main" and hoping there's only one function named "main", just use the nir_shader_get_entrypoint() helper which has better assertions anyway. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-30 20:14:52 -05:00
Timothy Arceri	31596836fc	st/glsl_to_nir: fix next_stage gathering ffs() just returns the bit that is set, we need to know what stage that bit represents so use u_bit_scan() instead. Fixes: `2ca5d9548f` ("st/glsl_to_nir: gather next_stage in shader_info") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-31 09:33:17 +11:00
Timothy Arceri	9ec4a5ef29	st/mesa: calculate buffer size correctly for packed uniforms Fixes: `edded12376` ("mesa: rework ParameterList to allow packing") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-31 09:32:41 +11:00
Dylan Baker	fb02bd3d1c	util: move u_cpu_detect to util CC: vlee@freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107870 Fixes: `80825abb5d` ("move u_math to src/util") Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	37db383abb	util: Move u_debug to utils Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	2fd5dff7e7	util: Move os_misc to util this is needed by u_debug Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	f1f104e548	gallium/util: remove u_inlines.h from u_debug.c It's not used, and I'm not pulling u_inlines into src/util. Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	59d494c1cc	gallium/util: remove p_format.h from u_debug.h Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	314777e86a	gallium/util: move memory debug declarations into u_debug_gallium Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	68074dfa0e	gallium/util: move debug_print_tranfer_flags to u_debug_galilum This also appears to be unused. Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	fc39dc9841	gallium/util: move debug_print_bind_flags to u_debug_gallium This also appears to be unused. Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	e4f1fea821	gallium/util: move debug_print_usage_enum to the u_debug_gallium This isn't used in mesa, maybe vmware uses this in a closed source state tracker? Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	078b3cdb34	gallium/util: start splitting u_debug into generic and gallium specific components In order to pull u_debug into src/util we need to break the generically useful bits from the bits that are tightly coupled to gallium. Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Dylan Baker	389d59c72a	gallium: split u_prim_name out of u_debug.h This allows us to pull u_prim.h out of u_debug.h Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00
Andre Heider	25a3ce97d5	gallium/hud: fix power sensor readings for amdgpu users amdgpu doesn't use the INPUT but the AVERAGE subfeature: $ sensors -u amdgpu-pci-0100 Adapter: PCI adapter power1: power1_average: 17.233 power1_cap: 180.000 Signed-off-by: Andre Heider <a.heider@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 16:30:32 -04:00
Rhys Perry	5172eb231d	glsl_to_tgsi: don't create 64-bit integer MAD/FMA TGSI has no I64MAD/U64MAD opcode. Fixes: `278580729a` ('st/glsl_to_tgsi: add support for 64-bit integers') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 20:27:12 +00:00
Marek Olšák	26cb93e229	radeonsi: add support for Raven2 (v2) v2: fix enabling primitive binning Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-30 16:03:02 -04:00
Marek Olšák	0dea85928e	radeonsi: clean up decompress flags in fast color clear	2018-10-30 16:03:02 -04:00
Marek Olšák	99835fff08	radeonsi/gfx9: set optimal OVERWRITE_COMBINER_WATERMARK	2018-10-30 16:03:02 -04:00
Marek Olšák	8ad12c8bec	gallium: rework PIPE_HANDLE_USAGE_* flags Only radeonsi uses them, so adjust them to match its needs.	2018-10-30 16:03:02 -04:00
Danylo Piliaiev	00fc56a68d	anv: Disable dual source blending when shader doesn't support it on gen8+ Dual source blending behaviour is undefined when shader doesn't have second color output. "If SRC1 is included in a src/dst blend factor and a DualSource RT Write message is not used, results are UNDEFINED. (This reflects the same restriction in DX APIs, where undefined results are produced if “o1” is not written by a PS – there are no default values defined)." Dismissing fragment in such situation leads to a hang on gen8+ if depth test in enabled. Since blending cannot be gracefully fixed in such case and the result is undefined - blending is simply disabled. v2 (Jason Ekstrand): - Apply the workaround to each individual entry - Emit a warning through debug_report Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-30 12:59:53 -07:00
Danylo Piliaiev	eca4a6548d	i965: Disable dual source blending when shader doesn't support it on gen8+ Dual source blending behaviour is undefined when shader doesn't have second color output, dismissing fragment in such situation leads to a hang on gen8+ if depth test in enabled. Since blending cannot be gracefully fixed in such case and the result is undefined - blending is simply disabled. v2 (Kenneth Graunke): - Listen to BRW_NEW_FS_PROG_DATA in 3DSTATE_PS_BLEND - Also whack BLEND_STATE[] to keep the two in sync, since we're not sure exactly which copy of the redundant info the hardware will use. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107088 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-30 12:59:53 -07:00
Kenneth Graunke	337a808062	i965: Respect GL_TEXTURE_SRGB_DECODE_EXT in GenerateMipmaps() Apparently, we're supposed to look at the texture object's built-in sampler object's sRGB decode setting in order to decide whether to decode/downsample/re-encode, or simply downsample as-is. Previously, I had always done the decoding/encoding. Fixes SKQP's Skia_Unit_Tests.SRGBMipMaps test. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-30 12:59:53 -07:00
Andrii Simiklit	e4e0fd5ffe	i965/batch: don't ignore the 'brw_new_batch' call for a 'new batch' If we restore the 'new batch' using 'intel_batchbuffer_reset_to_saved' function we must restore the default state of the batch using 'brw_new_batch' function because the 'intel_batchbuffer_flush' function will not do it for the 'new batch' again. At least the following fields of the batch 'state_base_address_emitted','aperture_space', 'state_used' should be restored to default values to avoid: 1. the aperture_space overflow 2. the missed STATE_BASE_ADDRESS commad in the batch 3. the memory overconsumption of the 'statebuffer' due to uncleared 'state_used' field. etc. v2: merge with new commits, changes was minimized, added the 'fixes' tag v3: added in to patch series Fixes: `3faf56ffbd` "intel: Add an interface for saving/restoring the batchbuffer state." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107626 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-30 12:09:17 -07:00
Andrii Simiklit	a9031bf9b5	i965/batch: avoid reverting batch buffer if saved state is an empty There's no point reverting to the last saved point if that save point is the empty batch, we will just repeat ourselves. CC: Chris Wilson <chris@chris-wilson.co.uk> Fixes: `3faf56ffbd` "intel: Add an interface for saving/restoring the batchbuffer state." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107626 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-30 12:09:09 -07:00
Eric Engestrom	ea738a91de	egl: add messages to a few assert() and turn a couple into unreachable() Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	d0d6ec549d	util: s/0/NULL/ for pointer Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	5c64847322	i965: add missing case to fix -Wswitch While at it, turn "unreachable" assert() into unreachable(). Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	2894e278cf	mesa: fix struct/class mismatch Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	6000895e2d	mesa: fix memcpy() and memset(0) of non-trivial structs Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	69eb6d58e8	nouveau: remove unused class member Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-10-30 18:10:59 +00:00
Eric Engestrom	6f9309d5d4	scons: drop unused HAVE_STDINT_H macro This was required back when MSVC didn't support C99 and was missing this header, but since MSVC 2013 (or maybe earlier?) this isn't it does and this code isn't doing anything anymore. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	a18d726621	aub_viewer: show vertex buffer pitch Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	0bbee28a3b	meson: add note about intel tools build options Fixes: `ea83a1d304` "intel: tools: import ImGui" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-30 18:10:59 +00:00
Eric Engestrom	4a266d01a7	vl: drop left-over variable Fixes: `6ccc435e7a` "pipe-loader: move dup(fd) within pipe_loader_drm_probe_fd" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 18:10:59 +00:00
Eric Anholt	68657d76b9	vc4: Fix unused variable warning. Fixes: `bb84fa146f` ("util: use C99 declaration in the for-loop hash_table_foreach() macro")	2018-10-30 10:46:52 -07:00
Eric Anholt	cc54e1acf9	v3d: Use nir_remove_unused_io_vars to handle binner shader output DCE We were doing this late after nir_lower_io, but we can just reuse the core code. By doing it at this stage, we won't even set up the VS attributes as inputs, reducing our VPM size.	2018-10-30 10:46:52 -07:00
Eric Anholt	c152c79d5e	v3d: Only add output slot tracking for the current varying slot. We always emit 4 slots per slot because things like color output and position processing in the epilogue will potentially look up more values than the variable declaration had. However, when we get a .location_frac != 0, we don't want to overwrite components of the following .driver_location.	2018-10-30 10:46:52 -07:00
Eric Anholt	17c8198952	v3d: Use nir_lower_io_to_scalar_early to DCE unused VS input components. This lets us trim unused trailing components in the vertex attributes, reducing the size of our VPM allocations.	2018-10-30 10:46:52 -07:00
Eric Anholt	fc85f7cfdc	v3d: Don't rely on sorting input vars for VPM read setup. For supporting scalar VPM i/o at the NIR level, we need to do a pass over the vars to figure out how big each attribute is after DCE. Once we've done that, we can just walk over c->vattr_sizes[] instead of bothering with vars.	2018-10-30 10:46:52 -07:00
Eric Anholt	cc78676030	v3d: Split out NIR input setup between FS and VPM. They don't share much code, and I'm about to rewrite the remaining shared code for the VPM case.	2018-10-30 10:46:52 -07:00
Eric Anholt	8265dfaa87	nir: Allow using nir_lower_io_to_scalar_early on VS input vars. This will be used on V3D to cut down the size of the VS inputs in the VPM (memory area for sharing data between shader stages). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-30 10:46:52 -07:00
Jason Ekstrand	f48b742289	anv: Bump the advertised patch version to 90 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-30 11:43:43 -05:00
Emil Velikov	29283921b7	m4: add Werror when checking for compiler flags Seemingly that at some point clang started accepting _any_ flags, whereas previously it would error out. These days, you can give it -Whamsandwich and it will succeed, while at the same time throwing an annoying warning. Add -Werror so that everything gets flagged and set accordingly. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108082 Cc: Vinson Lee <vlee@freedesktop.org> Repored-by: Vinson Lee <vlee@freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-30 16:41:05 +00:00
Dylan Baker	a8bed38b54	docs/calendar: Add 18.3 plan and expand 18.2 Emil will be helping out with 18.3, while Juan finalises 18.2 v2: [Emil] add Emil for 18.3, fix typos CC: Emil Velikov <emil.velikov@collabora.com> CC: Juan A. Romero Suarez <jasuarez@igalia.com> Cc: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-10-30 16:35:58 +00:00
Emil Velikov	c210d0c3b7	vulkan/wsi: use the drmGetDevice2() API On older kernels, the drmGetDevice() call will wake up all the GPUs on the system, while fetching the PCI revision. Use the 2 version of the API and pass flags == 0, so we don't fetch the device PCI revision, since we don't need that information. Fixes: `baa38c144f` ("vulkan/wsi: Use VK_EXT_pci_bus_info for DRM fd matching") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-30 16:35:50 +00:00
Jason Ekstrand	a45b6fb452	spirv: Pass SSA values through functions Previously, we would create temporary variables and fill them out. Instead, we create as many function parameters as we need and pass them through as SSA defs. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-10-30 11:22:44 -05:00
Mauro Rossi	bfe0e32913	android: i965/tiled_memcpy: fix build for x86 generic target x86 32 bit generic target does not enable ARCH_X86_HAVE_SSE4_1 for this reason all Android library modules using SSE4_1 in mesa are built conditionally to ARCH_X86_HAVE_SSE4_1 The same approach is now applied to libmesa_intel_tiled_memcpy_sse41 in order to avoid the following building errors: external/mesa/src/mesa/drivers/dri/i965/intel_tiled_memcpy.c:574:15: error: initializing '__m128i' (vector of 2 'long long' values) with an expression of incompatible type 'int' __m128i val = _mm_stream_load_si128((__m128i )src); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ external/mesa/src/mesa/drivers/dri/i965/intel_tiled_memcpy.c:578:15: error: initializing '__m128i' (vector of 2 'long long' values) with an expression of incompatible type 'int' __m128i val0 = _mm_stream_load_si128(((__m128i )src) + 0); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ external/mesa/src/mesa/drivers/dri/i965/intel_tiled_memcpy.c:579:15: error: initializing '__m128i' (vector of 2 'long long' values) with an expression of incompatible type 'int' __m128i val1 = _mm_stream_load_si128(((__m128i )src) + 1); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ external/mesa/src/mesa/drivers/dri/i965/intel_tiled_memcpy.c:580:15: error: initializing '__m128i' (vector of 2 'long long' values) with an expression of incompatible type 'int' __m128i val2 = _mm_stream_load_si128(((__m128i )src) + 2); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ external/mesa/src/mesa/drivers/dri/i965/intel_tiled_memcpy.c:581:15: error: initializing '__m128i' (vector of 2 'long long' values) with an expression of incompatible type 'int' __m128i val3 = _mm_stream_load_si128(((__m128i *)src) + 3); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 5 errors generated. Fixes: `11b1afdc92` ("i965/tiled_memcpy: inline movntdqa loads in tiled_to_linear") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-30 14:45:16 +02:00
Toni Lönnberg	50e952840f	intel: tools: Add handling for video pipe Preliminary work for adding handling of different pipes to gen_decoder. We need to be able to distinguish between different pipes in order to decode the packets correctly due to opcode re-use. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-30 12:43:00 +00:00
Toni Lönnberg	d5a938c58d	intel/decoder: Use 'DWord Length' and 'bias' fields for packet length. Use the 'DWord Length' and 'bias' fields from the instruction definition to parse the packet length from the command stream when possible. The hardcoded mechanism is used whenever an instruction doesn't have this field. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-30 12:43:00 +00:00
Marek Olšák	a09cbaffbf	mesa: expose EXT_texture_compression_s3tc on GLES The spec was modified to support GLES. Tested-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-10-30 13:31:00 +01:00
Michał Janiszewski	2734baa9e2	mesa: Add missing include guards Signed-off-by: Michał Janiszewski <janisozaur+signed@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-10-30 06:19:10 -06:00
Michał Janiszewski	ec994ca0fc	glx: Add missing include guards Signed-off-by: Michał Janiszewski <janisozaur+signed@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-10-30 06:19:10 -06:00
Michał Janiszewski	8ebd7039c4	svga: Add missing include guards Signed-off-by: Michał Janiszewski <janisozaur+signed@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-10-30 06:19:09 -06:00
Michał Janiszewski	0654450911	glsl: Add missing include guards Signed-off-by: Michał Janiszewski <janisozaur+signed@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-10-30 06:19:09 -06:00
Eric Engestrom	fddf384d1d	intel/batch-decoder: remove never-used function This function was there when the file was introduced in commit `38f10d5a03` "intel: tools: add aubinator viewer", but was never actually used. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-30 10:59:43 +00:00
Eric Engestrom	e9fb81375a	st/dri: remove leftover local variable Left over from the cleanup in `6ccc435e7a` "pipe-loader: move dup(fd) within pipe_loader_drm_probe_fd" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-30 10:20:58 +00:00
Vadym Shovkoplias	7d66eddbbd	glsl/linker: Fix out variables linking during single stage Since out variables are copied from shader objects instruction streams to linked shader instruction steam it should be cloned at first to keep source instruction steam unaltered. Fixes: `966a797e43` ("glsl/linker: Link all out vars from a shader objects on a single stage") Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105731	2018-10-30 10:19:17 +11:00
Marek Olšák	8676af12c8	ac: fix ac_build_fdiv for f64 trivial Fixes: `a5f35aa742`	2018-10-29 17:24:21 -04:00
Brian Paul	9007c0ed26	nir: fix yet another MSVC build break Trivial.	2018-10-29 11:15:12 -06:00
Eric Engestrom	f3a5757eba	vulkan/wsi: simplify meson file tracking Meson already automatically tracks included headers, so there's no need to add them everywhere; cleans up the code a bit. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-29 16:39:47 +00:00
Eric Engestrom	1df0c1e8fb	clover: add missing meson build dependency Fixes: `42ea0631f1` "meson: build clover" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-29 16:39:42 +00:00
Eric Engestrom	98e7c3e7a7	svga: add missing meson build dependency Fixes: `a537231b22` "meson: build svga driver on linux" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-29 16:39:38 +00:00
Eric Engestrom	912cd0ce3b	radv: add missing meson build dependency Fixes: `9d40ec2cf6` "radv: Add support for VK_KHR_driver_properties." Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-29 16:39:34 +00:00
Eric Engestrom	2be1f9ceba	anv: add missing meson build dependency Fixes: `e4538b93f5` "anv: Implement VK_KHR_driver_properties" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-29 16:39:07 +00:00
Samuel Pitoiset	b4eb029062	radv: implement VK_EXT_transform_feedback This implementation should work and potential bugs can be fixed during the release candidates window anyway. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:10:58 +01:00
Samuel Pitoiset	f8d0337299	radv: add multiple streams support for the GS copy shader Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	6c21645046	radv: emit stream outputs for vertex and tessellation stages Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	19f1b49236	radv: declare streamout SGPRs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	f4fa8de794	radv: gather stream output info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	fe551ec122	radv: allow to emit a vertex to a specified stream This is required for GS multiple streams support. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	a59f1b06ef	radv: allow to use up to 4 GSVS ring buffers For all streams. We basically just need to update the base address and compute a stride for every stream. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	98c09c3fcd	radv: adjust the number of output components per stream Same as the previous patch, except that is only the number of components. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	4649471a9e	radv: adjust the GSVS ring sizes based on the number of components For multiple streams support we have to set the different ring buffer sizes correctly. This relies on the number of output components per stream. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	8e428e24a8	radv: gather which GS stream is used for every outputs To only emit outputs for the given stream. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	dd996d1885	radv: gather the number of output components per stream This will be also used for splitting the GS->VS ring buffer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Samuel Pitoiset	87e6866b04	radv: gather the number of streams used by geometry shaders This will be used for splitting the GS->VS ring buffer. The stream ID is always 0 for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 17:09:08 +01:00
Jason Ekstrand	19064b8c3a	nir: Add a pass for gathering transform feedback info This is different from the GL_ARB_spirv pass because it generates a much simpler data structure that isn't tied to OpenGL and mtypes.h. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-29 17:09:08 +01:00
Jason Ekstrand	e8a5fa054d	vulkan: Update the XML and headers to 1.1.90 This doesn't include any new features but it does include an XML and header typo fix for modifiers. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-29 10:17:19 -05:00
Samuel Pitoiset	9e56ffb0b4	radv: remove wrong comment in calculate_gs_ring_sizes() about streams The computation seems correct compared to RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-29 12:33:58 +01:00
Rob Clark	a61952e737	freedreno: don't flush when new and old pfb is identical In the 'inorder' case (ie. FD_MESA_DEBUG=inorder, or old kernel), if the u_blitter clear path is used (a3xx, a4xx, and some fallback cases on newer gens), util_blitter_restore_fb_state() will set_framebuffer_state() to something that is identical to the current fb state, which triggers an unnecessary flush, and then eventually an assert: (gdb) bt #0 0x0000007fbf24a078 in kill () from /lib64/libc.so.6 #1 0x0000007fbe061278 in _debug_assert_fail (expr=0x7fbe93a820 "!batch->flushed", file=0x7fbe93a628 "../src/gallium/drivers/freedreno/freedreno_batch.c", line=491, function=0x7fbe93a990 <__func__.17380> "fd_batch_check_size") at ../src/gallium/auxiliary/util/u_debug.c:322 #2 0x0000007fbe1ccb8c in fd_batch_check_size (batch=0x55556d5a70) at ../src/gallium/drivers/freedreno/freedreno_batch.c:491 #3 0x0000007fbe1d0e08 in fd_clear (pctx=0x55555c61e0, buffers=5, color=0x55556e388c, depth=1, stencil=0) at ../src/gallium/drivers/freedreno/freedreno_draw.c:463 #4 0x0000007fbe57afa4 in st_Clear (ctx=0x55556e17b0, mask=18) at ../src/mesa/state_tracker/st_cb_clear.c:452 The assert was introduced in `4b847b38ae`, so from a functionality standpoint this patch fixes that commit. But it should also avoid an unnecessary flush in the 'inorder' case, fixing a performance bug. Fixes: `4b847b38ae` freedreno: make fd_batch a one-shot thing Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-28 14:03:38 -04:00
Rob Clark	32dd75b927	freedreno: dependency tracking for z/s depends on ZSA state ZSA state can change whether depth or stencil is enabled This plus previous patch fix stk, and various things w/ FD_MESA_DEBUG=inorder Fixes: `ec717fc629` freedreno: reduce resource dependency tracking overhead Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-28 14:03:38 -04:00
Rob Clark	05e868925c	freedreno: mark all state dirty after switching batch The problem isn't directly with `ec717fc629` but rather that commit exposes the problem. When we switch batch we cannot assume previous state is clean so we should mark all state dirty. Fixes: `ec717fc629` freedreno: reduce resource dependency tracking overhead Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-28 14:03:38 -04:00
Jason Ekstrand	1bd4f8fefc	anv: Use absolute timeouts in wait_for_bo_fences We were previously using relative timeouts and decrementing the user-provided timeout as we waited. Instead, this commit refactors things to use absolute timeouts throughout. This should fix a subtle bug in the waitAll case where we aren't decrementing the timeout after a successful GPU wait. Since pthread_cond_timedwait already takes an absolute timeout, it's also significantly simpler. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-27 16:18:33 -05:00
Jason Ekstrand	cbd4468695	anv: Flag semaphore BOs as external It probably doesn't actually break anything but it does cause some assertions in debug builds. Fixes: `7a89a0d9ed` "anv: Use separate MOCS settings for external BOs" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-27 00:02:32 -05:00
Jason Ekstrand	663a113700	anv: Improve the asserts in anv_buffer_get_range Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-27 00:02:32 -05:00
Rob Clark	c41772d17a	freedreno/a6xx: inline draw_impl() Now that it is just called once per draw (instead of once for binning and once for draw), let's just inline it. If nothing else, it makes perf-annotate easier to look at. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-26 18:10:00 -04:00
Rob Clark	604b5f1dca	freedreno/a6xx: small cleanup Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-26 18:10:00 -04:00
Rob Clark	2a74d9ae8d	freedreno/a6xx: move where we handle dirty vbo state Historically this wasn't in fdN_emit_state(), because prior to addition of blitter in a5xx, fdN_emit_state() was also used in the clear path. These days that is only true for a2xx (a3xx and a4xx use u_blitter). So the reason for it not to be in fd6_emit_state() no longer exists. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-26 18:10:00 -04:00
Rob Clark	ddb7fadaf8	freedreno: avoid no-op flushes by re-using last-fence Noticed that with webgl (in chromium, at least) we end up generating a lot of no-op submits just to get a fence. Tracking the last fence and returning that if there is no rendering since last flush avoids this. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-26 18:10:00 -04:00
Kristian H. Kristensen	01194cd582	freedreno/a6xx: Move stencil/depth/alpha state to IB Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-10-26 18:10:00 -04:00
Kristian H. Kristensen	a664dc2d59	freedreno/a6xx: Move stencil mask emit to FD_DIRTY_ZSA group Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-10-26 18:10:00 -04:00
Kristian H. Kristensen	3073926512	freedreno/a6xx: Rename FD6_GROUP_ZSA ro FD6_GROUP_LRZ Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-10-26 18:10:00 -04:00
Kristian H. Kristensen	edc0f1b10f	freedreno/a6xx: Move rasterizer state to state object Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-10-26 18:10:00 -04:00
Kristian H. Kristensen	3264eb691a	freedreno/a6xx: Fix set_blit_scissor helper The scissor maxx/maxy are non-inclusive, so don't subtract one from framebuffer width and height. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-10-26 18:10:00 -04:00
Kristian H. Kristensen	4222fe8af2	freedreno/a2xx: Squash a compiler warning We get a warning here for assigning a const char * pointer to char *swizzle in struct ir2_src_register. The constructor strdups a 4 byte string here, so just memcpy to that instead. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-10-26 18:10:00 -04:00
Kristian H. Kristensen	4fd6265f42	freedreno/a6xx: Use fd6_emit_ib from a6xx Move it to a header and use it where possible to avoid vfunc call. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-10-26 18:10:00 -04:00
Rob Clark	f3cc0d2747	freedreno: import libdrm_freedreno + redesign submit In the pursuit of lowering driver overhead, it became clear that some amount of redesign of how libdrm_freedreno constructs the submit ioctl would be needed. In particular, as the gallium driver is starting to make heavier use of CP_SET_DRAW_STATE state groups/objects, the over- head of tracking cmd buffers and relocs becomes too much. And for "streaming" state, which isn't ever reused (like uniform uploads) the overhead of allocating/freeing ringbuffer[1] objects is too high. This redesign makes two main changes: 1) Introduces a fd_submit object for tracking bos and cmds table for the submit ioctl, making ringbuffer objects more light- weight. This was previously done in the ringbuffer. But we have many ringbuffer instances involved in a submit (gmem + draw + potentially 1000's of state-group rbs), and only need a single bos and cmds table. (Reloc table is still per-rb) The submit is also a convenient place for a slab allocator for ringbuffer objects. Other options would have required locking because, while we can guarantee allocations will only happen on a single thread, free's could happen either on the application thread or the flush_queue thread. With the slab allocator in the submit object, any frees that happen on the flush_queue thread happen after we know that the application thread is done with the submit. 2) Introduce a new "softpin" msm_ringbuffer_sp implementation that does not use relocs and only has cmds table entries for IB1 (ie. the cmdstream buffers that kernel needs to CP_INDIRECT_BUFFER to from the RB). To do this properly will require some updates on the kernel side, so whether you get the softpin or legacy submit/ringbuffer implementation at runtime depends on your kernel version. To make all these changes in libdrm would basically require adding a libdrm_freedreno2, so this is a good point to just pull the libdrm code into mesa. Plus it allows for using mesa's hashtable, slab allocator, etc. And it lets us have asserts enabled for debug mesa buids but omitted for release builds. And it makes life easier if further API changes become necessary. At this point I haven't tried to pull in the kgsl backend. Although I left the level of vfunc indirection which would make it possible to have other backends. (And this was convenient to keep to allow for the "softpin" ringbuffer to coexist.) NOTE: if bisecting a build error takes you here, try a clean build. There are a bunch of ways things can go wrong if you still have libdrm_freedreno cflags. [1] "ringbuffer" is probably a bad name, the only level of cmdstream buffer that is actually a ring is RB managed by kernel. User- space cmdstream is all IB1/IB2 and state-groups. Reviewed-by: Kristian H. Kristensen <hoegsberg@chromium.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-26 18:10:00 -04:00
Jason Ekstrand	aa02d7e878	Revert "anv/skylake: disable ForceThreadDispatchEnable" This reverts commit `0fa9e6d7b3`. The real issue appears to have been that HiZ ops don't like having WM thread dispatch force-enabled. The previous commit fixes that problem so we can go back to using the ForceThreadDispatchEnable bit even on SKL+. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-26 16:39:47 -05:00
Jason Ekstrand	b6b2b27809	blorp: Emit a dummy 3DSTATE_WM prior to 3DSTATE_WM_HZ_OP Cc: mesa-stable@lists.freedesktop.org Suggested-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-26 16:39:35 -05:00
Axel Davy	2318ca68bb	st/nine: Handle window resize when a presentation buffer is used Usually when a window is resized, the app calls d3d to resize the back buffer to the window size. In some cases, it is not done, and it expects the output resizes to the window size, even if the back buffer size is unchanged. This patch introduces the behaviour when a presentation buffer is used. ID3DPresent_GetWindowInfo is a function available with D3DPresent v1.0, and thus we don't need to check if the function is available. The function had been introduced to implement this very feature. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	e50d374b61	d3dadapter: Fix wrong naming in header file GetWindowInfo used to be GetWindowSize before gallium nine was merged. A left-over remained... Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	3d975e98e4	st/nine: Reduce MaxSimultaneousTextures to 8 Windows drivers don't set this flag (which affects ff) to more than 8. Do the same in case some games check for 8. v2: Remove any dependence on MaxSimultaneousTextures. For non-ff the number of textures is 16 when the device is able of vs/ps3. Add this requirement of 16 textures to the driver requirements. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	739c700950	st/nine: Enable shadow mapping for ps 1.X We didn't implement shadow textures for ps 1.X, assuming the case couldn't happen... Well it does. Fixes: https://github.com/iXit/Mesa-3D/issues/261 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	847861aab4	st/nine: Do not set unused states for stateblocks A lot of these states are used only for the context, and are unused for stateblocks (which just uses the changed.* fields instead for a lot of them). Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	6f373b9b74	st/nine: Fix aliasing states for stateblocks If NINE_STATE_FF_MATERIAL is set, the stateblock will upload its recorded materials matrix. If NINE_STATE_FF_LIGHTING is set, the lighting set is uploaded. These flags could be set by a NineDevice9_SetTransform call or by setting some states related to ff, but that shouldn't trigger these stateblock behaviours. We don't need to follow the context states dirtied by render states. NINE_STATE_FF_VSTRANSF is exactly the state controlling stateblock updates of transformation matrices, NINE_STATE_FF is too broad. These two changes avoid setting the two mentionned states when we shouldn't. Fixes: https://github.com/iXit/Mesa-3D/issues/320 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	454201b452	st/nine: Never update device changed.* fields The device state changed.* field are never used. These fields are used only for stateblocks. Avoid setting them at all for clarity. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	2594b2efdc	st/nine: Capture also default matrices for D3DSBT_ALL We avoid allocating space for never unused matrices. However we must do as if we had captured them. Thus when a D3DSBT_ALL stateblock apply has fewer matrices than device state, allocate the default matrices for the stateblock before applying. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	bbeddb801e	st/nine: Mark transform matrices dirty for D3DSBT_ALL D3DSBT_ALL stateblocks capture the transform matrices. Fixes some d3d test programs not displaying properly. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	a4e9bbb8f8	st/nine: Don't update unused world matrices While to the application we have to track accurately all 256 world matrices (including in stateblocks), hw vertex processing enables to set a limit to the number of world matrices the hardware can access to in the advertised caps, which is 8 for nine. Thus don't bother in the stateblock code to send the updated values for the unreachable matrices. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	2e51c4c7cc	st/nine: Remove two unused states. NINE_STATE_MATERIAL was used incorrectly at one location. Replace it with the correct state. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Axel Davy	cb8ea21e1c	st/nine: Remove commented nine_context_apply_stateblock At some point the project was to adapt the commented version to csmt. The csmt rework enabled to fix some state aliasing issues between stateblocks and internal state updates. The commented version needs a lot of work to work with that. Just drop it. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-10-26 22:16:16 +02:00
Brian Paul	7e64e39f8b	nir: Fix array initializer Empty initializer is not standard C. This fixes MSVC build. Trivial.	2018-10-26 12:35:48 -06:00
Jason Ekstrand	07eb8e7466	anv: Return VK_ERROR_DEVICE_LOST from anv_device_set_lost This lets us get rid of a bunch of duplicated error messages. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-26 13:27:21 -05:00
Jason Ekstrand	ade22ae1ac	anv/util: Split a vk_errorv helper out of vk_errorf Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-26 13:27:21 -05:00
Brian Paul	d6be0b5556	scons/svga: remove opt from the list of valid build types This reverts commit `a5fd54f8bf`. The whole point was to add a way to pass -DVMX86_STATS to the build, but we can do that with a command line argument when we invoke scons. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2018-10-26 12:09:00 -06:00
Nanley Chery	5bcf479524	intel/blorp: Define the clear value bounds for HiZ clears Follow the restriction of making sure the clear value is between the min and max values defined in CC_VIEWPORT. Avoids a simulator warning for some piglit tests, one of them being: ./bin/depthstencil-render-miplevels 146 d=z32f_s8 Jason found this to fix incorrect clearing on SKL. Fixes: `09948151ab` ("intel/blorp: Add the BDW+ optimized HZ_OP sequence to BLORP") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-26 10:34:07 -07:00
Eric Engestrom	285ebc84c7	radv: remove duplicate brackets in version string MESA_GIT_SHA1 resolves to either an empty "" string if not build from git, or " (git-DEADBEEF)" if it is. No need to wrap it in additional "()". Fixes: `9d40ec2cf6` "radv: Add support for VK_KHR_driver_properties." Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-26 18:33:11 +01:00
Eric Engestrom	738f0f789b	vulkan: drop always-true param Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-26 18:33:11 +01:00
Boyuan Zhang	f4126cfaab	radeon/vcn: use util function to get h264 profile idc Use utility function for converting h264 pipe video profile to profile idc, instead of using array. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Acked-by: Christian König <christian.koenig at amd.com>	2018-10-26 13:23:06 -04:00
Boyuan Zhang	55cf565698	radeon/vce: use util function to get h264 profile idc Use utility function for converting h264 pipe video profile to profile idc, instead of using array. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Acked-by: Christian König <christian.koenig at amd.com>	2018-10-26 13:23:06 -04:00
Boyuan Zhang	b15d0200a9	vl: get h264 profile idc Adding a function for converting h264 pipe video profile to profile idc Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Acked-by: Christian König <christian.koenig at amd.com>	2018-10-26 13:23:06 -04:00
Jason Ekstrand	5cdeefe057	intel/nir: Use the OPT macro for more passes Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	18fb2c5d92	spirv: Initialize subgroup destinations with the destination type Instead of initializing them manually, just use the type that we already have sitting there. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	8fa70cfcfd	spirv: Use the right bit-size for spec constant ops Previously, we would always pull the bit size from the destination which is wrong for opcodes like nir_ilt where the sources are variable-sized but the destination is a fixed size. We were getting lucky before because nir_op_ilt returns a 32-bit value and basically everyone who uses spec constants uses 32-bit ones. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	1d2ed694c1	nir/prog: Use nir_bany in kill handling We have a helper that does exactly what the bany_inequal was doing. It emits the same code but is a bit higher level and is designed to operate on a bvec4. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	2fe3031440	glsl/nir: Use i2b instead of ine for fixing UBO/SSBO Booleans They do the same thing in the end but i2b is a bit simpler. Also, let's clean up the mess of code for SSBO handling with one line of builder. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	5bfce5fcc2	nir/system_values: Use the bit size from the load_deref This isn't a great solution for bit-sizes but we don't have a particularly convenient way to get a bit size from the system value enum and this keeps the lowering pass from changing it. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	a3b4cb3458	nir/opt_if: Rework condition propagation Instead of doing our own constant folding, we just emit instructions and let constant folding happen. This is substantially simpler and lets us use the nir_imm_bool helper instead of dealing with the const_value's ourselves. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	4cd8a58595	nir/search: Use the nir_imm_* helpers from nir_builder This requires that we rework the interface a bit to use nir_builder but that's a nice little modernization anyway. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	6e32115bd6	nir/builder: Handle 16-bit floats in nir_imm_floatN_t Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	ff45649bc2	nir/builder: Add a nir_imm_true/false helpers Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	249e32ab17	nir/constant_folding: Use nir_src_as_bool for discard_if Missed one while converting to the nir_src_as_* helpers. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	6de1869e86	nir/constant_folding: Add an unreachable to a switch Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	28bb6abd1d	nir/validate: Print when the validation failed Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-26 11:45:29 -05:00
Jason Ekstrand	292ebdbf98	anv: Handle the device loss abort in anv_device_set_lost Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-26 08:40:23 -05:00
Jason Ekstrand	cd0960b430	anv: Add helpers for setting/checking device lost Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-26 08:40:21 -05:00
Jason Ekstrand	319ff6f1ad	anv: Provide a error message with a DEVICE_LOST Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-26 08:40:10 -05:00
Alex Smith	3bd239f71d	anv: Fix sanitization of stencil state when the depth test is disabled When depth testing is disabled, we shouldn't pay attention to the specified depthCompareOp, and just treat it as always passing. Before, if the depth test is disabled, but depthCompareOp is VK_COMPARE_OP_NEVER (e.g. from the app having zero-initialized the structure), then sanitize_stencil_face() would have incorrectly changed passOp to VK_STENCIL_OP_KEEP. v2: Roll the depthTestEnable check into the ds_aspect check below since they now both do the same thing. Fixes: `028e1137e6` "anv/pipeline: Be smarter about depth/stencil state" Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-26 10:25:40 +01:00
Samuel Pitoiset	79bbdf8e45	radv: implement image to image operations for R32G32B32 This should address the remaining failures in Batman Arkhman City. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107765 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-26 10:50:08 +02:00
Samuel Pitoiset	6198245775	radv: fix a comment in radv_meta_buffer_to_image_cs_r32g32b32() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-26 10:50:05 +02:00
Samuel Pitoiset	02ccef7874	radv: add get_image_stride_for_r32g32b32() helper For the special R32G32B32 paths. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-26 10:50:03 +02:00
Samuel Pitoiset	468c33e2f7	radv: add create_bview_for_r32g32b32() helper For the special R32G32B32 paths. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-26 10:50:00 +02:00
Samuel Pitoiset	e60e3e1b3f	radv: add create_buffer_from_image() helper For the special R32G32B32 paths. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-26 10:49:58 +02:00
Sagar Ghuge	416abe809a	intel/compiler: Print message descriptor as immediate source While disassembling send(c) instruction print message descriptor as immediate source operand along with message descriptor. This allows assembler to read immediate source operand and set bits accordingly. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-26 06:42:14 +02:00
Sagar Ghuge	d15fa24860	intel/compiler: Print hex representation along with floating point value While encoding the immediate floating point values in instruction we use values upto precision 9, but while disassembling, we print precision to 6 places, which round up the value and gives wrong interpretation for encoded immediate constant. To avoid misinterpretation of encoded immediate values in instruction and disassembled output, print hex representation along with floating point value which can be used by assembler in future. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-26 06:41:08 +02:00
David McFarland	07a00a8729	util: Change remaining uint32 cache ids to sha1 After discussion with Timothy Arceri. disk_cache_get_function_identifier was using only the first byte of the sha1 build-id. Replace disk_cache_get_function_identifier with implementation from radv_get_build_id. Instead of writing a uint32_t it now writes to a mesa_sha1. All drivers using disk_cache_get_function_identifier are updated accordingly. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Fixes: `83ea8dd99b` ("util: add disk_cache_get_function_identifier()")	2018-10-26 14:49:22 +11:00
Hyunjun Ko	3d198926a4	freedreno: use fd_bc_alloc_batch instead of fd_batch_create. Following the commit `2385d7b066` and `8e798e28f7`, for resource dependancy tracking. Fixes: dEQP-GLES31.functional.image_load_store.early_fragment_tests.no_early_fragment_tests_depth_fbo with FD_MESA_DEBUG=inorder Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-25 18:46:19 -04:00
Hyunjun Ko	703271c22a	freedreno/ir3: take reg->num out of union in ir3_register To avoid wrong result when identifying the type of register. Ie. If the reg is an array, it might be identified as address or predicate register. Fixes: dEQP-GLES31.functional.ssbo.layout.random.arrays_of_arrays.6 Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-25 18:45:45 -04:00
Rob Clark	3c402d0dc2	freedreno/a6xx: disable unused groups Don't leave vsconst/fsconst group enabled if we switch to shader with no uniforms. Fixes: `abcdf5627a` freedreno/a6xx: move const emit to state group Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-25 18:38:53 -04:00
Rob Clark	d53074d3f1	freedreno: add useful assert Would have been useful to catch the problem fixed in `8e798e28f7` Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-25 18:38:53 -04:00
Alok Hota	edf38019a0	swr/rast: ignore CreateElementUnorderedAtomicMemCpy This function's API changed between LLVM 5 and 6. Compile errors occur when building with LLVM 6+ if LLVM 5 was used for a dist tarball CC: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107865 Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-25 11:05:59 -05:00
Alok Hota	8c872ac2e3	swr/rast: fix intrinsic/function for LLVM 7 compatibility Converted from x86 VFMADDPS intrinsic to generic LLVM intrinsic, and removed createInstructionSimplifierPass, which were both removed in LLVM 7.0.0 These changes combine patches we received from the community and our own internal patches Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> Tested-by: Chuck Atkins <chuck.atkins@kitware.com>	2018-10-25 10:32:27 -05:00
Rhys Perry	26ed0f0234	nvc0: increase NOUVEAU_TRANSFER_PUSHBUF_THRESHOLD to 1024 on Kepler+ Gives a +3.89% to +5.27% FPS improvement with Hitman and +2.73% to +2.82% FPS improvement with Dirt Rally on my GTX 1060. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-10-25 15:25:10 +01:00
Bas Nieuwenhuizen	d41c3cc013	radv: Emit enqueued pipeline barriers on event write. Since the CPU can read them we need to execute any GPU->CPU flushes before the event is written. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108524 Fixes: `f4e499ec79` "radv: add initial non-conformant radv vulkan driver" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-25 16:17:54 +02:00
Bas Nieuwenhuizen	9d40ec2cf6	radv: Add support for VK_KHR_driver_properties. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-25 16:14:43 +02:00
Eric Engestrom	e27902a261	util: use C99 declaration in the for-loop set_foreach() macro Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-25 12:43:18 +01:00
Eric Engestrom	bb84fa146f	util: use C99 declaration in the for-loop hash_table_foreach() macro Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-25 12:43:18 +01:00
Dylan Baker	3d261cf77b	gen: Add AMD_gpu_shader_int64.xml to tarball CC: Ian Romanick <ian.d.romanick@intel.com> CC: Marek Olšák <marek.olsak@amd.com> Fixes: `b3c17330e6` ("mesa: expose AMD_gpu_shader_int64") Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-10-24 11:29:30 -07:00
Dylan Baker	6d5fa65c74	gen: Add EXT_vertex_attrib_64bit.xml to dependency lists Which is also required to put it in the tarball, a requirement for building with meson from the tarball. CC: Ian Romanick <ian.d.romanick@intel.com> CC: Marek Olšák <marek.olsak@amd.com> Fixes: `263c962cfd` ("mesa: expose EXT_vertex_attrib_64bit") Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-10-24 11:29:29 -07:00
Eric Engestrom	edc06dd533	anv: move variable to proper scope and mark as MAYBE_UNUSED Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-24 18:16:20 +01:00
Eric Engestrom	ed5d65a6a1	anv: use snprintf() instead of memset()+strcpy() snprintf() guarantees that it will not write more chars than allowed, and that the string will be null-terminated, without the need to fill the whole thing with zeroes to begin with. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-24 18:15:56 +01:00
Eric Engestrom	33d757096d	anv: drop unused includes Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-24 18:15:05 +01:00
Dylan Baker	c4de8ba036	autotools: include intel_tiled_memcopy.c There are two problems with the fixed patch. First, it fails to create a dependency on the sourced .c file, so changes to intel_tiled_memcpy.c won't trigger a rebuild. It also doesn't get included in the dist tarball. Fixes: `11b1afdc92` ("i965/tiled_memcpy: inline movntdqa loads in tiled_to_linear") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-10-24 09:22:15 -07:00
Dylan Baker	43b0d5fa04	meson: fix formatting and add extra_files to i965 extra_files is just a nice way to to tell certain IDEs (and those reading the file) that this file is also a dependency. Meson will use the .d file generated by the compiler to figure out what the target actually depends on. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com>	2018-10-24 09:22:13 -07:00
Eduardo Lima Mitev	b0c427043b	ir3_compiler/nir: fix imageSize() for buffer-backed images GL_EXT_texture_buffer introduced texture buffers, which can be used in shaders through a new type imageBuffer. Because how image access is implemented in freedreno, calling imageSize on an imageBuffer returns the size in bytes instead of texels, which is incorrect. This patch adds a division of imageSize result by the bytes-per-pixel of the image format, when image is buffer-backed. Fixes all tests under dEQP-GLES31.functional.image_load_store.buffer.image_size.* v2: Pre-compute and submit the log2 of the image format's bpp as shader constant instead of emitting the LOG2 instruction in code. (Rob Clark) v3: Use ffs (find-first-bit) helper for computing log2 (Ilia Mirkin) Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-10-24 18:18:35 +02:00
Jose Fonseca	d9a04196d9	nir: Fix array initializer. Empty initializer is not standard C. This fixes MSVC build. Trivial.	2018-10-24 11:37:09 +01:00
Liviu Prodea	d99fda17c8	scons: Put to rest zombie texture_float build option. I found a remnant of texture_float build option that wasn't removed in commit `66673bef94` This patch removes it. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-10-24 11:10:17 +01:00
Alex Smith	6c56c1fbd4	anv: Allow presenting via a different GPU anv_GetPhysicalDeviceSurfaceSupportKHR will already return success for this, but anv_GetPhysicalDevice{Xcb,Xlib}PresentationSupportKHR do not. Apps which check for presentation support via the latter (all Feral Vulkan games at least) will therefore fail. This allows me to render on an Intel GPU and present to a display connected to an AMD card (tested HD 530 + Vega 64). v2: Rebase on current master. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-24 09:40:02 +01:00
Juan A. Suarez Romero	3112da346b	nir: fix nir_copy_propagation test Use nir_src_comp_as_uint() to read the proper second component, as nir_src_as_uint() returns the first one. v2: Use nir_src_comp_as_uint() [Jason] Fixes: `16870de8a0` ("nir: Use nir_src_is_const and nir_src_as_* in core code") Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108532 Tested-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-24 09:13:24 +02:00
Timothy Arceri	0ff1ccca25	radv: call nir_link_xfb_varyings() Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-24 08:21:29 +11:00
Timothy Arceri	c769ed10de	radv: move nir_lower_io_to_scalar_early() to radv_link_shaders() nir_lower_io_to_scalar_early() is really part of the link time optimisations. Moving it here allows the code to be simplified and also keeps the code easy to follow in the next patch. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-24 08:21:29 +11:00
Samuel Pitoiset	7c694cbfa4	nir: add linking helper nir_link_xfb_varyings() The linking opts shouldn't try removing or compacting XFB varyings in the consumer. To avoid this we copy the always_active_io flag from the producer. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-24 08:21:29 +11:00
Sagar Ghuge	0a7664fe8c	intel/compiler: Change src1 reg type to unsigned doubleword To have uniform behavior while disassembling send(c) instruction use register type of unsigned doubleword for src1 when message descriptor is immediate value. Bspec does not specifiy anything for src1 immediate default type. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2018-10-23 12:44:24 -07:00
Eduardo Lima Mitev	22ddd4988e	mesa/glformats: Remove redundant helper _mesa_base_format_component_count There exists _mesa_components_in_format() which already includes all cases handled in _mesa_base_format_component_count(). Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-23 21:29:15 +02:00
Jason Ekstrand	ecb7775e1c	nir/algebraic: Fix a typo in the bit size validation code The conon_bit_class and canon_var_class variables got switched. Fixes: `932c650e0b` "nir/algebraic: Loosen a restriction on variables" Reported-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-23 12:22:29 -05:00
Leo Liu	b75fb8ee36	amd/common: check DRM version 3.27 for JPEG decode JPEG was added after DRM version 3.26 Signed-off-by: Leo Liu <leo.liu@amd.com> Fixes: 4558758c51749(amd/common: add vcn jpeg ip info query) Cc: Boyuan Zhang <boyuan.zhang@amd.com> Cc: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2018-10-23 13:12:05 -04:00
Juan A. Suarez Romero	a8c2a6b0ac	docs: update calendar I'll take care of 18.2 releases series on Andres behalf. CC: Andres Gomez <agomez@igalia.com> CC: Dylan Baker <dylan@pnwbakers.com> CC: Emil Velikov <emil.l.velikov@gmail.com>	2018-10-23 18:40:09 +02:00
Lionel Landwerlin	a8594887bc	intel/decoders: fix end of batch limit Pointer arithmetic... v2: s/4/sizeof(uint32_t)/ (Eric) v3: Give bytes to print_batch() in error_decode (Lionel) Make clear what values we're dealing with in error_decode (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v2) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-23 14:49:33 +01:00
Boyuan Zhang	55e7de7b19	radeonsi: enable vcn jpeg decode for raven Enable vcn jpeg decode for raven. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	97c473bb29	winsys/amdgpu: add vcn jpeg cs support Add vcn jpeg cs support, align cs by no-op. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	4558758c51	amd/common: add vcn jpeg ip info query Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	6d2d910653	radeon/vcn: implement jpeg target buffer cmd Implement jpeg target buffer cmd by programming registers directly, since there is no firmware for VCN Jpeg decode. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Acked-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	0ee5630cfc	radeon/vcn: implement jpeg bitstream buffer cmd Implement jpeg bitstream buffer cmd by programming registers directly, since there is no firmware for VCN Jpeg decode. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Acked-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	9b478b0c7a	radeon/uvd: remove get mjpeg slice header Move the previous get_mjpeg_slice_heaeder function and eoi from "radeon/vcn" to "st/va". Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	4fc2368e3b	st/va: get mjpeg slice header Move the previous get_mjpeg_slice_heaeder function and eoi from "radeon/vcn" to "st/va". Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	c7a5ef26ad	radeon/vcn: add jpeg decode implementation Add a new file to handle VCN Jpeg decode specific functions. Use Jpeg specific cmd sending function in end_frame call. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	40fceb55f3	radeon/vcn: separate send cmd call from end frame Use function pointer for sending cmd in end_frame call. By doing this, we can assign different cmd sending logics for Jpeg decode later. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	4f1f128f8e	radeon/vcn: create cs based on ring type Add RING_VCN_JPEG for VCN Jpeg decode, and keep RING_VCN_DEC for other codecs. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	f7116e4ff8	radeon/winsys: add vcn jpeg ring type Add a new ring type for vcn jpeg. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	e7e68d15b5	radeon/vcn: add vcn jpeg decode interface Add VCN Jpeg decode interfaces and register defines. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	6bc0a3a834	radeon/vcn: move radeon decoder define to header file Move radeon_decoder definition from "radeon_vcn_dec.c" to "radeon_vcn_dec.h", so that it can be included by other files later. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	0f59e3f088	meson: update required amdgpu version to 2.4.95 VCN jpeg requires new hw ip Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Boyuan Zhang	2e768ade61	configure.ac: update libdrm amdgpu version to 2.4.95 VCN jpeg requires new hw ip Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Samuel Pitoiset	69c44de798	radv: fix btoi for R32G32B32 when the dest offset is not 0 Fixes: `593996bc02` ("radv: implement buffer to image operations for R32G32B32") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-23 14:29:26 +02:00
Scott D Phillips	54c823ec79	i965/miptree: Use cpu tiling/detiling when mapping Rename the (un)map_gtt functions to (un)map_map (map by returning a map) and add new functions (un)map_tiled_memcpy that return a shadow buffer populated with the intel_tiled_memcpy functions. Tiling/detiling with the cpu will be the only way to handle Yf/Ys tiling, when support is added for those formats. v2: Compute extents properly in the x\|y-rounded-down case (Chris Wilson) v3: Add units to parameter names of tile_extents (Nanley Chery) Use _mesa_align_malloc for the shadow copy (Nanley) Continue using gtt maps on gen4 (Nanley) v4: Use streaming_load_memcpy when detiling v5: (edited by Ken) Move map_tiled_memcpy above map_movntdqa, so it takes precedence. Add intel_miptree_access_raw, needed after rebasing on commit `b499b85b0f`. v6: refactor to changes done for sse41 separation (Tapani) Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v5) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-23 14:08:05 +03:00
Scott D Phillips	11b1afdc92	i965/tiled_memcpy: inline movntdqa loads in tiled_to_linear The reference for MOVNTDQA says: For WC memory type, the nontemporal hint may be implemented by loading a temporary internal buffer with the equivalent of an aligned cache line without filling this data to the cache. [...] Subsequent MOVNTDQA reads to unread portions of the WC cache line will receive data from the temporary internal buffer if data is available. This hidden cache line sized temporary buffer can improve the read performance from wc maps. v2: Add mfence at start of tiled_to_linear for streaming loads (Chris) v3: add Android build support (Tapani) v4: squash 'fix i915: Fix streaming loads for intel_tiled_memcpy' separate sse41 to own static library (Tapani) Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v2) Reviewed-by: Matt Turner <mattst88@gmail.com> (v2) Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-23 14:08:05 +03:00
Tapani Pälli	91d3a5d1a8	i965: expose type of memcpy instead of memcpy function itself There is currently no use of returned memcpy functions outside intel_tiled_memcpy. Patch changes intel_get_memcpy to return memcpy type instead of actual function. This makes it easier later to separate streaming load copy in to own static library. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-10-23 14:08:05 +03:00
Eric Engestrom	bc021be78d	util: use unsigned ints for bit operations Fixes errors thrown by GCC's Undefined Behaviour sanitizer (ubsan) every time this macro is used. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-23 11:44:02 +01:00
Eric Engestrom	17b03b5320	radv: s/abs/fabsf/ for floats Fixes: `a4c4efad89` "radv: Rework guard band calculation" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-23 11:43:51 +01:00
Eric Engestrom	8629d807aa	meson: drop option description relic `platforms` is no longer a comma-separated string, and some of our option descriptions are way too long already. Just drop the incorrect bit. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-23 11:43:51 +01:00
Jason Ekstrand	8b626a22b2	st/mesa: Record shader access qualifiers for images They're not required to be the same as the access flag on the image unit. For hardware that does shader image lowering based on the qualifier (Intel), it may be required for state setup. v2: (by Kenneth Graunke, incorporating feedback from Marek Olšák) - Reduce both access and shader_access to uint16_t to avoid making the pipe_image_view structure larger. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-23 02:36:24 -07:00
Jason Ekstrand	bf441d22a7	nir/algebraic: Provide descriptive asserts for bit size checks This will hopefully make debugging opt_algebraic bit-size compile failures easier. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-22 16:00:18 -05:00
Jason Ekstrand	932c650e0b	nir/algebraic: Loosen a restriction on variables Previously, we would fail if a variable had an assigned but unknown bit size X and we tried to assign it an actual bit size. However, this is ok because, at the time we do the search, the variable does have an actual bit size and it will match X because of the NIR rules. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-22 16:00:18 -05:00
Jason Ekstrand	ea9e651423	nir/algebraic: A bit of validation refactoring' We rename some local variables in validate() to be more readable and plumb the var through to get/set_var_bit_class instead of the var index. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-22 16:00:18 -05:00
Jason Ekstrand	641f4be8e8	nir/algebraic: Make internal classes str-able Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-22 16:00:18 -05:00
Jason Ekstrand	6068be543b	nir/algebraic: Generalize an optimization There's nothing boolean about (a \| ~a) ~> -1 Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-22 16:00:18 -05:00
Jason Ekstrand	69618a8678	nir/algebraic: Use bool internally instead of bool32 Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-10-22 16:00:18 -05:00
Kenneth Graunke	00103db04a	intel: Fix decoding for partial STATE_BASE_ADDRESS updates. STATE_BASE_ADDRESS only modifies various bases if the "modify" bit is set. Otherwise, we want to keep the existing base address. Iris uses this for updating Surface State Base Address while leaving the others as-is. v2: Also update aubinator_viewer_decoder (caught by Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-22 13:38:44 -07:00
Jason Ekstrand	16870de8a0	nir: Use nir_src_is_const and nir_src_as_* in core code Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 14:24:15 -05:00
Jason Ekstrand	ce36f412c9	nir/search_helpers: Use nir_src_is_const and friends This not only makes them safe for more bit sizes but it also fixes a bug in is_zero_to_one where it would return true for constant NaN. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 14:24:15 -05:00
Jason Ekstrand	7bae7828aa	nir/search: Use nir_src_is_const and friends Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 14:24:15 -05:00
Jason Ekstrand	bca5c2c688	nir: Add some new helpers for working with const sources Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 14:24:15 -05:00
Alyssa Rosenzweig	e0c267c752	mesa/st: Only call nir_lower_io_to_scalar_early on scalar ISAs On scalar ISAs, nir_lower_io_to_scalar_early enables significant optimizations. However, on vector ISAs, it is counterproductive and impedes optimal codegen. This patch only calls nir_lower_io_to_scalar_early for scalar ISAs. It appears that at present there are no upstreamed drivers using Gallium, NIR, and a vector ISA, so for existing code, this should be a no-op. However, this patch is necessary for the upcoming Panfrost (Midgard) and Lima (Utgard) compilers, which are vector. With this patch, Panfrost is able to consume NIR directly, rather than TGSI with the TGSI->NIR conversion. For how this affects Lima, see https://www.mail-archive.com/mesa-dev@lists.freedesktop.org/msg189216.html Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2018-10-22 20:37:07 +02:00
Dylan Baker	4e785fb383	meson: don't require libelf for r600 without LLVM r600 doesn't have a hard requirement on LLVM, and therefore doesn't have a hard requirement on libelf. Currently the logic doesn't allow that however. Distro-bug: https://bugs.gentoo.org/669058 Fixes: `5060c51b6f` ("meson: build r600 driver") Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-10-22 11:29:55 -07:00
Jason Ekstrand	ca4e465f7d	anv,radv: Trivially expose two new VK_GOOGLE extensions This patch exposes support for the following two extensions: * VK_GOOGLE_decorate_string * VK_GOOGLE_hlsl_functionality1 There's nothing for the driver to do; it's all handled in spirv_to_nir. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107971 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 10:50:20 -05:00
Jason Ekstrand	891886da2f	spirv: Add no-op support for VK_GOOGLE_hlsl_functionality1 This extension adds two new decorations which carry meaning only for HLSL shaders. They are expected to be handled by higher level layers and can be ignored by implementations. However, it does save the client a bit of work if the implementation safely ignores them instead of the client having to strip them out of the SPIR-V in order for it to be valid. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 10:49:53 -05:00
Jason Ekstrand	5f0322d5c3	spirv: Add support for SPV_GOOGLE_decorate_string Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 10:49:53 -05:00
Rob Herring	2bb05d70af	android: Build kms_swrast for the Android platform Signed-off-by: Rob Herring <robh@kernel.org> Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-22 13:08:17 +01:00
Connor Abbott	27fe3f5b5a	ac: Fix loading a dvec3 from an SSBO The comment was wrong, since the loop above casts to a type with the correct bitsize already. Fixes: `7e7ee82698` ("ac: add support for 16bit buffer loads") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 09:44:51 +02:00
Connor Abbott	59535b05cf	ac: Introduce ac_build_expand() And implement ac_bulid_expand_to_vec4() on top of it. Fixes: `7e7ee82698` ("ac: add support for 16bit buffer loads") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-22 09:44:51 +02:00
Eduardo Lima Mitev	fdd926d5b2	ir3/nir: Set up image_dims consts for image_deref_size intrinsic too `nir_intrinsic_image_deref_size` is not being considered during scan for driver constants, so image constants are not emitted if a shader only ever query the size of an image (no load, store, atomic op, etc). This is unlikely, but possible. Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-10-21 21:29:18 +02:00
Karol Herbst	2d235d69c8	nv50/ir: fix ConstantFolding::createMul for 64 bit muls Fixes: `2f52925f5c` "nv50/ir: move a * b -> a << log2(b) code into createMul()" Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-10-20 03:00:04 +02:00
Sonny Jiang	bfb2b90246	radeonsi: Disable clear_state with radeon kernel driver Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com>	2018-10-19 16:16:57 -04:00
Kenneth Graunke	f91f9bab83	meson: Add -Werror=return-type when supported. This warning detects non-void functions with a missing return statement, return statements with a value in void functions, and functions with an bogus return type that ends up defaulting to int. It's already enabled by default with -Wall. Generally, these are fairly serious bugs in the code, which developers would like to notice and fix immediately. This patch promotes it from a warning to an error, to help developers catch such mistakes early. I would not expect this warning to change much based on the compiler version, so hopefully it won't become a problem for packagers/builders. See the GCC documentation or 'man gcc' for more details: https://gcc.gnu.org/onlinedocs/gcc-7.3.0/gcc/Warning-Options.html#index-Wreturn-type Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-19 10:16:57 -07:00
Jason Ekstrand	0d380af809	anv: Define trampolines as the weak functions Instead of having weak references to the anv functions and separate trampoline functions with their own dispatch table, just make the trampoline functions weak. This gets rid of a dispatch table and potentially lets the compiler delete the unused weak function. The end result is a reduction in the .text section of 5.7K and a reduction in the .data section of 1.4K. Before: text data bss dec hex filename 3190329 282232 8960 3481521 351fb1 _install/lib64/libvulkan_intel.so After: text data bss dec hex filename 3184548 280792 8960 3474300 35037c _install/lib64/libvulkan_intel.so Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-19 11:52:00 -05:00
Juan A. Suarez Romero	f8e789d2ac	docs: fix typo in 18.2.3 release notes link Fixes: `86b4bd52dc` ("docs: update calendar, add news item and link release notes for 18.2.3") Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-10-19 18:48:12 +02:00
Juan A. Suarez Romero	86b4bd52dc	docs: update calendar, add news item and link release notes for 18.2.3 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-10-19 18:45:41 +02:00
Juan A. Suarez Romero	01f5d37d3e	docs: add sha256 checksums for 18.2.3 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `27fd12857b`)	2018-10-19 18:43:49 +02:00
Juan A. Suarez Romero	e30970e2cd	docs: add release notes for 18.2.3 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `d219361b42`)	2018-10-19 18:43:48 +02:00
Jose Fonseca	45bacc4b63	scons: Remove gles option. It's broken, and WGL state tracker is always built with GLES support noawadays. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-10-19 16:50:26 +01:00
Bas Nieuwenhuizen	68c7833540	radv: Fix WSI & PCI bus info initialization order. Trying to access the bus info before it is initialized is not going to work. Fixes: `baa38c144f` "vulkan/wsi: Use VK_EXT_pci_bus_info for DRM fd matching" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108491 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Andre Heider <a.heider@gmail.com>	2018-10-19 13:24:19 +02:00
Marek Olšák	69a87b5d47	radeonsi: fix a typo in a comment in emit_guardband	2018-10-18 18:01:22 -04:00
Marek Olšák	2a26b1c045	radeonsi: fix gnome-shell crash I wasn't expecting to get viewports with the center having negative coordinates. Broken by: `6cc79e4411`	2018-10-18 17:55:44 -04:00
Jason Ekstrand	8c0b9fdfa1	Revert "anv: Stop generating weak references for instance entrypoints" This reverts commit `00bb42105d`. It was not as well thought out as I had intended and broke the build when VK_KHR_display is disabled in the build.	2018-10-18 15:36:26 -05:00
Marek Olšák	77bcbe712e	radeonsi: clamp point size to the limit This fixes dEQP-GLES2.functional.rasterization.limits.points. Broken by: `ea039f789d` Tested-by: Jakob Bornecrantz <jakob@collabora.com>	2018-10-18 16:08:56 -04:00
Marek Olšák	eae8f49fc6	radeonsi: fix a VGT hang with primitive restart on Polaris10 and later Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Tested-by: Jakob Bornecrantz <jakob@collabora.com>	2018-10-18 16:08:56 -04:00
Marek Olšák	165817d47f	radeonsi: fix a deadlock due to partially-initialized context on CI	2018-10-18 16:08:56 -04:00
Jan Vesely	06bf56725d	radeonsi: Bump number of allowed global buffers to 32 Fixes assertion failure/crash when running luxmark/luxball on clover. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108272 CC: mesa-stable@lists.freedesktop.org Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-18 16:02:42 -04:00
Andres Rodriguez	e71a87775e	radv: fix check for perftest options size It was using the debug options array size. CC: mesa-stable@lists.freedesktop.org Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-18 15:42:20 -04:00
Marek Olšák	6cc79e4411	radeonsi: fix incorrect hw screen offset and guardband computation It resulted in assertion failures or incorrect rendering. Broken by: `9e182b8313`	2018-10-18 14:42:42 -04:00
Jason Ekstrand	baa38c144f	vulkan/wsi: Use VK_EXT_pci_bus_info for DRM fd matching This lets us avoid passing the DRM fd around all over the place and gets us closer to layer utopia. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-18 11:29:00 -05:00
Michel Dänzer	c20ba1be18	loader/dri3: Also wait for front buffer fence if we triggered it In that case, we have to wait for the fence to synchronize with the corresponding drawing we triggered in the X server. Fixes incorrect display with the i965 driver and some applications, e.g. solvespace. Bugzilla: https://bugs.freedesktop.org/108097 Fixes: `aefac10fec` "loader/dri3: Only wait for back buffer fences in dri3_get_buffer" Tested-by: Sergii Romantsov <sergii.romantsov@globallogic.com>	2018-10-18 16:52:06 +02:00
Jason Ekstrand	00bb42105d	anv: Stop generating weak references for instance entrypoints We don't need weak references to instance entrypoints because we never have more than one of each so we don't need the NULL fall-back. This also helps us avoid forgetting things because we now get link errors for missing instance entrypoints. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-18 09:17:39 -05:00
Jason Ekstrand	7c65cf9844	vulkan/wsi: Implement GetPhysicalDevicePresentRectanglesKHR This got missed during 1.1 enabling because it was defined as an interaction between device groups and WSI and it wasn't obvious it was in the delta. The idea behind it is that it's supposed to provide a hint to the application in a multi-GPU setup to indicate which regions of the screen are being scanned out by which GPU so a multi-device split-screen rendering application can render each part of the screen on the GPU that will be presenting it and avoid extra bus traffic between GPUs. On a single-GPU setup or one which doesn't support this present mode, we need to do something. We choose to return the window size (or a max-size rect) if the compositor, X server, or crtc is associated with the given physical device and zero rectangles otherwise. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-18 09:17:39 -05:00
Jason Ekstrand	7629c00557	vulkan/wsi: Store the instance allocator in wsi_device We already have wsi_device and we know the instance allocator at wsi_device_init time so there's no need to pass it into the physical device queries. This also fixes a memory allocation domain bug that can occur if CreateSwapchain gets called prior to any queries (not likely) in which case the cached connection gets allocated off the device instead of the instance. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-18 09:17:39 -05:00
Michał Janiszewski	0ef50ecc69	st/xlib: Use more appropriate include guard Signed-off-by: Michał Janiszewski <janisozaur+signed@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com	2018-10-18 11:03:04 +01:00
Michał Janiszewski	bcc613acc1	gallium: Fix mismatched ifdef-guards Signed-off-by: Michał Janiszewski <janisozaur+signed@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-18 11:03:03 +01:00
Gert Wollny	74adc624b6	softpipe: dynamically allocate space for immediate constants The number of immediate constants was fixed and the size check was only done by means of an assertion. Given this a shader that emits more immediate constants would result in a memory corruption when mesa is build in release mode. Instead of using this fixed limit allocate the space dynamically, let it grow as needed, and also remove the unused ImmArray. Fixes: dEQP-GLES31.functional.ssbo.layout.random.arrays_of_arrays.1 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-10-18 10:59:51 +02:00
Timothy Arceri	3a95396f3c	radv: use nir_shrink_vec_array_vars() Totals from affected shaders: SGPRS: 1096 -> 1096 (0.00 %) VGPRS: 1192 -> 1056 (-11.41 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 100940 -> 94384 (-6.49 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 100 -> 112 (12.00 %) Wait states: 0 -> 0 (0.00 %) All affected shaders are from Batman Arkham City. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-18 15:04:09 +11:00
Timothy Arceri	8086fa1bcd	radv: use nir_split_array_vars() We call in the opt loop in case another pass results in an array with indirect access being turned into direct access. Totals from affected shaders: SGPRS: 512 -> 496 (-3.12 %) VGPRS: 456 -> 452 (-0.88 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 40040 -> 39664 (-0.94 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 41 -> 43 (4.88 %) Wait states: 0 -> 0 (0.00 %) All affected shaders are from Batman Arkham City. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-18 15:04:09 +11:00
Timothy Arceri	06675711e7	radv: use nir_opt_find_array_copies() Totals from affected shaders: SGPRS: 1112 -> 1112 (0.00 %) VGPRS: 1492 -> 1196 (-19.84 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 112172 -> 101316 (-9.68 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 93 -> 98 (5.38 %) Wait states: 0 -> 0 (0.00 %) All affected shaders are from "Batman: Arkham City" over DXVK. The pass detects that the temporary array created by DXVK for storing TCS inputs is a copy of the input arrays and allows us to avoid copying all of the input data and then indirecting on it with if-ladders, instead we just do indirect indexing. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-18 15:04:09 +11:00
Timothy Arceri	9d5b106b2e	radv: use nir_opt_copy_prop_vars and nir_opt_dead_write_vars Totals from affected shaders: SGPRS: 2856 -> 2856 (0.00 %) VGPRS: 3236 -> 3248 (0.37 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 236560 -> 233548 (-1.27 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 277 -> 283 (2.17 %) Wait states: 0 -> 0 (0.00 %) Even in the cases were we have increased VGPR use it appears the NIR is improved significantly. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-18 15:04:09 +11:00
Keith Packard	67a2c1493c	vulkan: Add VK_EXT_calibrated_timestamps extension (radv and anv) [v5] Offers three clocks, device, clock monotonic and clock monotonic raw. Could use some kernel support to reduce the deviation between clock values. v2: Ensure deviation is at least as big as the GPU time interval. v3: Set device->lost when returning DEVICE_LOST. Use MAX2 and DIV_ROUND_UP instead of open coding these. Delete spurious TIMESTAMP in radv version. Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> v4: Add anv_gem_reg_read to anv_gem_stubs.c Suggested-by: Jason Ekstrand <jason@jlekstrand.net> v5: Adjust maxDeviation computation to max(sampled_clock_period) + sample_interval. Suggested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Keith Packard <keithp@keithp.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-17 20:10:15 -07:00
Topi Pohjolainen	a11cafbd7a	intel/compiler/icl: Use invocation id bits 22:16 instead of 23:17 Identifier bits in the dispatch header have changed. See Bspec: SINGLE_PATCH Payload: 3D Pipeline Stages - 3D Pipeline Geometry - Hull Shader (HS) Stage IVB+ - Payloads IVB+ Fixes: KHR-GL46.tessellation_shader.tessellation_shader_tc_barriers.barrier_guarded_read_write_calls Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-10-17 21:19:57 +03:00
Neil Roberts	a9475d9337	Fix setting indent-tabs-mode in the Emacs .dir-locals.el files Some of the .dir-locals.el had the wrong name for the truthy value so it wasn’t setting indent-tabs-mode. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-10-17 19:03:08 +02:00
Rob Clark	d27b1c83b9	freedreno/a6xx: don't allocate binning rb Now that a single cmdstream is used for both binning and draw passes, we can skip allocation of cmdstream buffer for binning. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:49 -04:00
Rob Clark	24d57a6d8f	freedreno/a6xx: single cmdstream for draw+binning Now that state which is different for draw vs binning pass is split out into different state-groups with appropriate enable_mask (so the appropriate one is chosen for draw vs binning), switch over to using a single cmdstream for both passes. This should significantly lower draw overhead for CPU bound benchmarks. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:49 -04:00
Rob Clark	72f6164fef	freedreno/a6xx: split binning vs draw program stateobj's Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:49 -04:00
Rob Clark	3313d693af	freedreno/a6xx: split VBO state into binning/draw variants Blob seems to manage to use same input registers for BS (binning pass) vs VS (draw pass) shaders, so it can use the same VBO state for both. We can't quite do that yet, so split them. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:49 -04:00
Rob Clark	b23fc4cacb	freedreno/a6xx: move VBO state to stateobj Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:49 -04:00
Rob Clark	e194056832	freedreno/a6xx: move ZSA state to stateobj Step towards single cmdstream, where we need different state-group-id's for binning vs draw ZSA state. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	a50a9a44e8	freedreno/a6xx: remove vismode param We don't need to keep this IGNORE_VISIBILITY in binning pass. Prep work for using single cmdstream for both draw and binning passes. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	d9dbc9c21f	freedreno/ir3: move binning-pass fixup for a6xx+ Move this to after ir3_cp (which can add lowered immediates to the const state) for a6xx+, to ensure the uniform state matches between binning and vertex shaders. This way we can emit just a single VS_CONST state- group when we re-use single cmdstream for both binning and draw passes. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	1a51c4a87e	freedreno/a6xx: a bit more state emit cleanup Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	2ffc79c7d1	freedreno/a6xx: move framebuffer state emit to emit_mrt() No point in checking this per-draw, since framebuffer change means new batch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	5894f37b85	freedreno/a6xx: small emit_mrt() cleanup On a6xx, this is only used for pfb->cbufs so we can just directly pass the pfb state. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	b4e94af37d	freedreno/a6xx: use program cache Use the in-memory cache to construct shader program state and re-use it on subsequent draws, to lower driver overhead. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	1d7fbe2cd1	freedreno/ir3: shader variant cache Cache that maps gallium hwcso (in this case, 'struct ir3_shader') plus shader variant key to a generation specific state object. This could eventually replace the linked list of shader variants, but for now it lets us re-use the work currently done in fdN_program_emit() Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	2e9c08c0bc	freedreno/ir3: move binning_pass out of shader variant key Prep work for a following patch, that introduces a cache to map from program state (all shader stages) plus variant key to pre-baked hw state (which could be emit'd via CP_SET_DRAW_STATE, for example). To do that, we really want the variant key to be immutable, and to treat the binning pass shader as an extra shader stage, rather than as a VS variant. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	8b1a3b5dde	freedreno/ir3: track # of samplers used by shader This is useful for a6xx to avoid program state from depending on bound tex/samp state. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	1b9d69410c	freedreno/a6xx: texture state obj Unfortunately gallium doesn't match what the hw wants perfectly here, in using a separate CSO for each texture/sampler. So we have to use a hash table to map the collection of texture/samplers to hw state object. We probably could use separate hw state objects for texture and sampler state, but mesa/st tends to update the tex and samp state together. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	e8606b11dd	freedreno: add resource seqno Intended to be something more compact than a 64b pointer, which could be used as a key into hashtables. Prep work for texture state objects. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	abcdf5627a	freedreno/a6xx: move const emit to state group Eventually we want to move nearly everything, but no other state depends on const state, so this is the easiest one to move first. For webgl aquarium, this reduces GPU load by about 10%, since for each fish it does a uniform upload plus draw.. fish frequently are visible in only a single tile, so this skips the uniform uploads for other tiles. The additional step of avoiding WFI's when using CP_SET_DRAW_STATE seems to be work an additional 10% gain for aquarium. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	a398d26fd2	freedreno/a6xx: add infrastructure for CP_DRAW_STATE Add helper to add state-groups to emit, and code to emit CP_DRAW_STATE packet if we have any state-groups. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	ec717fc629	freedreno: reduce resource dependency tracking overhead Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Neil Roberts	ee61790daf	freedreno: Remove the Emacs mode lines These are not necessary because the corresponding settings are set via the .dir-locals.el file anyway. Most of them were missing a ‘:’ after “tab-width” which was making Emacs display an annoying warning whenever you open the file. This patch was made with: sed -ri '/-\- mode:/,/^$/d' \ $(find src/gallium/{drivers,winsys} -name \.\[ch\] \ -exec grep -l -- '-\*- mode:' {} \+) Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Neil Roberts	afe640b360	freedreno: Fix the Emacs indentation configuration file The .dir-locals.el had the wrong name for the truthy value so it wasn’t setting indent-tabs-mode. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Hyunjun Ko	8e798e28f7	freedreno: allocate batches from the cache in launch_grid Needs to allocate batches from the cache so that it could get a valid index and make resource dependancy tracking right. In addition this fixes assertion on debug build since the commit `1a40faa8` landed. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Hyunjun Ko	2385d7b066	freedreno: adds nondraw param to fd_bc_alloc_batch Needs to specify nondraw when creating a batch through fd_bc_alloc_batch since it'd better create a batch through it rather than fd_batch_create. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	9e6019bd46	freedreno/a6xx: remove fd6_emit_render_cntl() It was dead code carried over from a5xx Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	835cb06965	freedreno/ir3: fix broken texcoord inputs TODO not sure if this is best solution, but current logic is broken for texcoord inputs. It is definitely the simplest solution. Fixes: `1a24f51966` freedreno/ir3: ignore unused inputs Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Rob Clark	cbf9fe50b5	freedreno: fix off-by-one error in BEGIN_RING() Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Marek Olšák	669dd22983	util: document a limitation of util_fast_udiv32 trivial	2018-10-17 12:27:58 -04:00
Matt Turner	58a51d0a67	i965/fs: Add 64-bit int immediate support to dump_instructions() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-10-16 17:48:17 -07:00
Marek Olšák	fcc70e4855	radeonsi: track context rolls better for the Vega scissor bug workaround We should get fewer context rolls with the SET_CONTEXT_REG optimization, but it would have been for nothing if the scissor state rolled the context anyway. Don't emit the scissor state if there is no context roll.	2018-10-16 17:23:25 -04:00
Marek Olšák	25ddb15cfe	radeonsi: emit sample locations for 1xAA only when the hw bug is present	2018-10-16 17:23:25 -04:00
Marek Olšák	9b331e462e	radeonsi: use compute shaders for clear_buffer & copy_buffer Fast color clears should be much faster. Also, fast color clears on evicted buffers should be 200x faster on GFX8 and older.	2018-10-16 17:23:25 -04:00
Marek Olšák	5030adcbe0	radeonsi: use copy_buffer in buffer_do_flush_region directly	2018-10-16 17:23:25 -04:00
Marek Olšák	0b40fbc879	radeonsi: use faster integer division for instance divisors We know the divisors when we upload them, so instead we can precompute and upload division factors derived from each divisor. This fast division consists of add, mul_hi, and two shifts, and we have to load 4 dwords intead of 1. This probably won't affect any apps.	2018-10-16 17:23:25 -04:00
Marek Olšák	bfc795670e	ac: add helpers for fast integer division by a constant	2018-10-16 17:23:25 -04:00
Marek Olšák	ea039f789d	radeonsi: use higher subpixel precision (QUANT_MODE) for smaller viewports	2018-10-16 15:28:22 -04:00
Marek Olšák	4fd8d2df9c	radeonsi: move emission of PA_SU_VTX_CNTL into emit_guardband We'll modify the quant mode there, which also affects the guarband computation.	2018-10-16 15:28:22 -04:00
Marek Olšák	41a6c3de1f	radeonsi: don't re-upload the sample position constant buffer repeatedly	2018-10-16 15:28:22 -04:00
Marek Olšák	b94824c787	radeonsi: set PA_SU_PRIM_FILTER_CNTL optimally	2018-10-16 15:28:22 -04:00
Marek Olšák	9e182b8313	radeonsi: center viewport to improve guardband clipping for high resolutions This will be more useful when we change the quant mode to increase subpixel precision and decrease the viewport range (which might not be possible if the viewport is not centered in the viewport range).	2018-10-16 15:28:22 -04:00
Marek Olšák	fedc1fda30	radeonsi: save raster config in screen, add se_tile_repeat	2018-10-16 15:28:22 -04:00
Marek Olšák	ac76aeef20	radeonsi: switch back to standard DX sample positions Apps may rely on them.	2018-10-16 15:28:22 -04:00
Marek Olšák	67f02cf810	radeonsi: add GDS support to CP DMA	2018-10-16 15:28:22 -04:00
Marek Olšák	0d05581578	radeonsi: rename si_gfx_* functions to si_cp_* and write_event_eop -> release_mem	2018-10-16 15:28:22 -04:00
Marek Olšák	6e1cf6532d	radeonsi: make si_gfx_write_event_eop more configurable	2018-10-16 15:28:22 -04:00
Sergii Romantsov	0fa9e6d7b3	anv/skylake: disable ForceThreadDispatchEnable On Skylake enabling of ForceThreadDispatchEnable causes gpu-hang. -v2: enabling of ForceThreadDispatchEnable is only for gen8, for gen9 and higher reverted enabling of PixelShaderHasUAV. -v3 (Jason Ekstrand): Rework the comments a bit. CC: Jason Ekstrand <jason.ekstrand@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107941 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107760 Fixes: `79270d2140` (anv: Stop setting 3DSTATE_PS_EXTRA::PixelShaderHasUAV) Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-16 13:20:51 -05:00
Lionel Landwerlin	322a919a41	anv: Implement VK_EXT_pci_bus_info Even though the Intel GPU are always at the same PCI location, all the info we need is already provided by libdrm. Let's be future proof. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-16 12:47:55 +01:00
Jose Fonseca	8550be7a2f	appveyor: Cache pip's cache files. It should speed up the Python packages installation. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-10-16 09:41:14 +01:00
Jose Fonseca	bfb8afb14d	appveyor: Update to newer Mako/winflexbison versions. As that's what most people are bound to use. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-10-16 09:41:12 +01:00
Jose Fonseca	b94f9cd8f9	appveyor: Update to MSVC 2017. That's what we (and I suppose most people out there) are using now. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-10-16 09:41:07 +01:00
Samuel Pitoiset	647c2b90e9	radv: disable VK_SUBGROUP_FEATURE_VOTE_BIT This feature isn't used for now, so disable it until wwm is fixed in LLVM. Fixes dEQP-VK.subgroups.vote.graphics.subgroupallequal* https://bugs.freedesktop.org/show_bug.cgi?id=108115 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-16 10:24:19 +02:00
Samuel Pitoiset	593996bc02	radv: implement buffer to image operations for R32G32B32 This should fix rendering issues with Batman Arkham City. We will probably need to implement itob and itoi at some point, but currently nothing hits these paths. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107765 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-16 09:22:38 +02:00
Alex Smith	ca83d51cfb	ac/nir: Use context-specific LLVM types LLVMInt*Type() return types from the global context and therefore are not safe for use in other contexts. Use types from our own context instead. Fixes frequent crashes seen when doing multithreaded pipeline creation. Fixes: `4d0b02bb5a` "ac: add support for 16bit load_push_constant" Fixes: `7e7ee82698` "ac: add support for 16bit buffer loads" Cc: "18.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-16 08:18:24 +01:00
Vadym Shovkoplias	ad558408ff	glsl: Check the subroutine associated functions names Adding compile time check for subroutine functions with the same names. Similar check for intrastage linking was already landed in commit `5f0567a4f6`. From Section 6.1.2 (Subroutines) of the GLSL 4.00 specification "A program will fail to compile or link if any shader or stage contains two or more functions with the same name if the name is associated with a subroutine type." Fixes: * no-overloads.vert Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108109 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-16 08:15:21 +03:00
Vadym Shovkoplias	d2ea3d4a76	glsl/linker: Change the format of spec quotation Also there is no "OpenGL ES Shading Language 4.00" spec, so change it to GLSL 4.00 spec. Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-16 08:15:21 +03:00
Dave Airlie	ff281e6204	nir: fix clip cull lowering to not assert if GLSL already lowered. If GLSL has already done the lowering, we'd rather not crash in this pass. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-15 18:53:48 -07:00
Kenneth Graunke	5bd8369681	i965: Add PCI IDs for new Amberlake parts that are Coffeelake based See commit c0c46ca461f136a0ae1ed69da6c874e850aeeb53 in the Linux kernel, where José Roberto de Souza added this new PCI ID there. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com>	2018-10-15 18:10:27 -07:00
Kenneth Graunke	8f8111646c	intel: disable FS IR validation in release mode. We probably don't need to iterate, fprintf, and abort in release mode. Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-10-15 18:10:27 -07:00
Caio Marcelo de Oliveira Filho	b3c6146925	nir: Copy propagation between blocks Extend the pass to propagate the copies information along the control flow graph. It performs two walks, first it collects the vars that were written inside each node. Then it walks applying the copy propagation using a list of copies previously available. At each node the list is invalidated according to results from the first walk. This approach is simpler than a full data-flow analysis, but covers various cases. If derefs are used for operating on more memory resources (e.g. SSBOs), the difference from a regular pass is expected to be more visible -- as the SSA copy propagation pass won't apply to those. A full data-flow analysis would handle more scenarios: conditional breaks in the control flow and merge equivalent effects from multiple branches (e.g. using a phi node to merge the source for writes to the same deref). However, as previous commentary in the code stated, its complexity 'rapidly get out of hand'. The current patch is a good intermediate step towards more complex analysis. The 'copies' linked list was modified to use util_dynarray to make it more convenient to clone it (to handle ifs/loops). Annotated shader-db results for Skylake: total instructions in shared programs: 15105796 -> 15105451 (<.01%) instructions in affected programs: 152293 -> 151948 (-0.23%) helped: 96 HURT: 17 All the HURTs and many HELPs are one instruction. Looking at pass by pass outputs, the copy prop kicks in removing a bunch of loads correctly, which ends up altering what other other optimizations kick. In those cases the copies would be propagated after lowering to SSA. In few HELPs we are actually helping doing more than was possible previously, e.g. consolidating load_uniforms from different blocks. Most of those are from shaders/dolphin/ubershaders/. total cycles in shared programs: 566048861 -> 565954876 (-0.02%) cycles in affected programs: 151461830 -> 151367845 (-0.06%) helped: 2933 HURT: 2950 A lot of noise on both sides. total loops in shared programs: 4603 -> 4603 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 11085 -> 11073 (-0.11%) spills in affected programs: 23 -> 11 (-52.17%) helped: 1 HURT: 0 The shaders/dolphin/ubershaders/12.shader_test was able to pull a couple of loads from inside if statements and reuse them. total fills in shared programs: 23143 -> 23089 (-0.23%) fills in affected programs: 2718 -> 2664 (-1.99%) helped: 27 HURT: 0 All from shaders/dolphin/ubershaders/. LOST: 0 GAINED: 0 The other generations follow the same overall shape. The spills and fills HURTs are all from the same game. shader-db results for Broadwell. total instructions in shared programs: 15402037 -> 15401841 (<.01%) instructions in affected programs: 144386 -> 144190 (-0.14%) helped: 86 HURT: 9 total cycles in shared programs: 600912755 -> 600902486 (<.01%) cycles in affected programs: 185662820 -> 185652551 (<.01%) helped: 2598 HURT: 3053 total loops in shared programs: 4579 -> 4579 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 80929 -> 80924 (<.01%) spills in affected programs: 720 -> 715 (-0.69%) helped: 1 HURT: 5 total fills in shared programs: 93057 -> 93013 (-0.05%) fills in affected programs: 3398 -> 3354 (-1.29%) helped: 27 HURT: 5 LOST: 0 GAINED: 2 shader-db results for Haswell: total instructions in shared programs: 9231975 -> 9230357 (-0.02%) instructions in affected programs: 44992 -> 43374 (-3.60%) helped: 27 HURT: 69 total cycles in shared programs: 87760587 -> 87727502 (-0.04%) cycles in affected programs: 7720673 -> 7687588 (-0.43%) helped: 1609 HURT: 1416 total loops in shared programs: 1830 -> 1830 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 1988 -> 1692 (-14.89%) spills in affected programs: 296 -> 0 helped: 1 HURT: 0 total fills in shared programs: 2103 -> 1668 (-20.68%) fills in affected programs: 438 -> 3 (-99.32%) helped: 4 HURT: 0 LOST: 0 GAINED: 1 v2: Remove the DISABLE prefix from tests we now pass. v3: Add comments about missing write_mask handling. (Caio) Add unreachable when switching on cf_node type. (Jason) Properly merge the component information in written map instead of replacing. (Jason) Explain how removal from written arrays works. (Jason) Use mode directly from deref instead of getting the var. (Jason) v4: Register the local written mode for calls. (Jason) Prefer cf_node instead of node. (Jason) Clarify that remove inside iteration only works in backward iterations. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	dc349f07b5	nir: Take call instruction into account in copy_prop_vars Calls are not used yet (functions are inlined), but since new code is already taking them into account, do it here too. The convention here and in other places is that no writable memory is assumed to remain unchanged, as well as global variables. Also, explicitly state the modes affected (instead of using the reverse logic) in one of the apply_for_barrier_modes calls. Suggested by Jason. v2: Consider local vars used by a call to be conservative, SPIR-V has such cases. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	797f01c220	nir: Add tests for copy propagation of derefs Also tests for removal of redundant loads, that we currently handle as part of the copy propagation. Note some tests involve multiple blocks and are currently DISABLED because they (expectedly) fail. v2: Add missing DISABLED prefix to "multi block" tests. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	4dfa7adc10	nir: Remove handling of dead writes from copy_prop_vars These are covered by another pass now. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	c20dd1f77c	intel/nir, freedreno/ir3: Use the separated dead write vars pass No changes to shader-db for intel. No changes to shader-db expected for freedreno. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	cb126cf67a	nir: Separate dead write removal into its own pass Instead of doing this as part of the existing copy_prop_vars pass. Separation makes easier to expand the scope of both passes to be more than per-block. For copy propagation, the information about valid copies comes from previous instructions; while the dead write removal depends on information from later instructions ("have any instruction used this deref before overwrite it?"). Also change the tests to use this pass (instead of copy prop vars). Note that the disabled tests continue to fail, since the standalone pass is still per-block. v2: Remove entries from dynarray instead of marking items as deleted. Use foreach_reverse. (Caio) (all from Jason) Do not cache nir_deref_path. Not worthy for this patch. Clear unused writes when hitting a call instruction. Clean up enumeration of modes for barriers. Move metadata calls to the inner function. v3: For copies, use the vector length to calculate the mask. (all from Jason) Use nir_component_mask_t when applicable. Rename functions for clarity. Consider local vars used by a call to be conservative (SPIR-V has such cases). Comment and assert the assumption that stores and copies are always to a deref that ends with a vector or scalar. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	a02fd7000d	nir: Add tests for dead write elimination Note at the moment the pass called is nir_opt_copy_prop_vars, because dead write elimination is implemented there. Also added tests that involve identifying dead writes in multiple blocks (e.g. the overwrite happens in another block). Those currently fail as expected, so are marked to be skipped. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	bbda2a17f7	nir: Add test file for vars related passes Add basic helpers for doing tests on the vars related optimization passes. The main goal is to lower the barrier to create tests during development and debugging of the passes. Full coverage is not a requirement. v2: Make find_next_intrinsic() skip blocks before 'after'. (Jason) Move nir_imm_ivec2() to nir_builder.h. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	c869646b7d	nir: Add nir_imm_ivec2 helper Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Caio Marcelo de Oliveira Filho	3966f053a1	util: Add foreach_reverse for dynarray Useful to walk the array removing elements by swapping them with the last element. v2: Change iteration to make sure we never underflow. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 17:29:46 -07:00
Eric Anholt	8ec83dc51e	v3d: Add support for hardware pack/unpack of half floats. Cuts the formerly 7-minute simulation time of fs-packHalf2x16.shader_test in half.	2018-10-15 17:16:44 -07:00
Eric Anholt	7d77fe1bcc	nir: Expose nir_remove_unused_io_vars(). For gallium drivers where you want to do some linking at variant compile time, you don't have the other producer/consumer shader on hand to modify. By exposing the inner function, the driver can have the used varyings in the compiled shader cache key and still do linking. This is also useful for V3D, where the binning shader wants to only output position and TF varyings. We've been removing those after nir_lower_io, but this will be less driver-specific code and let more of the shader get DCEed early in NIR. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-15 17:16:44 -07:00
Eric Anholt	b788ab6d5c	nir: Be sure to fix deref modes after demoting shader i/o vars to global. Fixes assertion failures when calling nir_remove_unused_varyings() or nir_remove_unused_io_vars(). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-15 17:16:44 -07:00
Eric Anholt	dda1ae9b3c	gallium/ttn: Convert inputs and outputs to derefs of variables. This means that TTN shaders more closely resemble GTN shaders: they have inputs and outputs as variable derefs, with the variables having their .driver_location already set up for you. This will be useful for v3d to do input variable DCE in NIR, which we can't do when the TTN shaders never have a pre-nir_lower_io stage. Acked-by: Rob Clark <robdclark@gmail.com>	2018-10-15 17:16:43 -07:00
Eric Anholt	da15a0d88e	gallium/ttn: Fix the type of gl_FragDepth. In TGSI we have a vec4 of which only .z is used, but for NIR we should be using a float the same as other NIR IR. We were already moving TGSI's .z to the .x channel. Acked-by: Rob Clark <robdclark@gmail.com>	2018-10-15 17:16:43 -07:00
Kristian H. Kristensen	f93e431272	freedreno/a6xx: Enable blitter Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-10-15 15:22:38 -07:00
Kristian H. Kristensen	47bc9fad3e	freedreno/a6xx: Update headers Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-10-15 15:22:35 -07:00
Kristian H. Kristensen	421863412c	freedreno/a6xx: Remove unnecessary GRAS_2D_BLIT_INFO write Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-10-15 15:20:28 -07:00
Jason Ekstrand	e4c9bcd037	anv: Don't advertise ASTC support on BSW Tested-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-10-15 16:55:25 -05:00
Samuel Pitoiset	26a2ce35ab	radv: do not force the flat qualifier for clip/cull distances This fixes some new CTS that reads clip/cull distances from the fragment shader stage: dEQP-VK.clipping.user_defined.clip_* Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-15 21:55:28 +02:00
Samuel Pitoiset	80c84bdba9	radv: bump discreteQueuePriorities to 2 It's the minimum value required by the spec. This fixes dEQP-VK.api.info.device.properties. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-15 21:55:25 +02:00
Jason Ekstrand	ae18c53ba6	anv: Split dispatch tables into device and instance There's no reason why we need generate trampoline functions for instance functions or carry N copies of the instance dispatch table around for every hardware generation. Splitting the tables and being more conservative shaves about 34K off .text and about 4K off .data when built with clang. Before splitting dispatch tables: text data bss dec hex filename 3224305 286216 8960 3519481 35b3f9 _install/lib64/libvulkan_intel.so After splitting dispatch tables: text data bss dec hex filename 3190325 282232 8960 3481517 351fad _install/lib64/libvulkan_intel.so Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-15 13:30:24 -05:00
Kenneth Graunke	18cc65edf8	i965: Drop assert about number of uniforms in ARB handling. My recent prog_to_nir patch started making new sampler uniforms, which apparently increased the number of parameters. We used to poke at the one parameter directly, making it important that there was only one, but we haven't done that in a while. It should be safe to just delete the assertion. Fixes: `1c0f92d8a8` "nir: Create sampler variables in prog_to_nir." Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-15 10:56:12 -07:00
Jason Ekstrand	2241be1d1b	vulkan: Add the fuchsia headers These were missing in the last couple of spec updates. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-15 10:20:31 -05:00
Bas Nieuwenhuizen	6ed0fd24d4	radv: Implement VK_EXT_pci_bus_info. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-15 12:27:49 +02:00
Kenneth Graunke	38a23517fd	gallium/u_transfer_helper: Add support for separate Z24/S8 as well. u_transfer_helper already had code to handle treating packed Z32_S8 as separate Z32_FLOAT and S8_UINT resources, since some drivers can't handle that interleaved format natively. Other hardware needs depth and stencil as separate resources for all formats. For example, V3D3 needs this for 24-bit depth as well. This patch adds a new flag to lower all depth/stencils formats, and implements support for Z24_UNORM_S8_UINT. (S8_UINT_Z24_UNORM is left as an exercise to the reader, preferably someone who has access to a machine that uses that format.) Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-14 23:36:28 -07:00
Kenneth Graunke	c3d219837a	gallium/format: Add a helper to combine separate Z24 and S8 stencil. This new function takes separate Z24 depth and S8 stencil sources, and packs them into a single combined Z24S8 buffer. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-14 23:36:28 -07:00
Kenneth Graunke	5849e0612c	gallium/auxiliary: Add util_format_get_depth_only() helper. This will be used by u_transfer_helper.c shortly, in order to split packed depth-stencil into separate resources. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-14 23:36:28 -07:00
Kenneth Graunke	1c0f92d8a8	nir: Create sampler variables in prog_to_nir. This is needed for nir_gather_info to actually count the textures, since it operates solely on variables. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-14 23:35:47 -07:00
Kenneth Graunke	ed169c9ad2	nir: Create sampler2D variables in nir_lower_{bitmap,drawpixels}. This is needed for nir_gather_info to actually count the new textures, since it operates solely on variables. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-14 23:35:35 -07:00
Jason Ekstrand	b7397b09d5	spirv: Update SPIR-V json and headers to Khronos master This corresponds to commit 801cca8104245c07e8cc532 on GitHub. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-13 09:56:18 -05:00
Samuel Pitoiset	13fd4e601c	vulkan: Update the XML and headers to 1.1.88 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-13 09:56:18 -05:00
Vinson Lee	cc33621e3b	r600/sb: Fix constant-logical-operand warning. sb/sb_bc_parser.cpp:620:27: warning: use of logical '&&' with constant operand [-Wconstant-logical-operand] if (cf->bc.op_ptr->flags && FF_GDS) ^ ~~~~~~ sb/sb_bc_parser.cpp:620:27: note: use '&' for a bitwise operation if (cf->bc.op_ptr->flags && FF_GDS) ^~ & sb/sb_bc_parser.cpp:620:27: note: remove constant to silence this warning if (cf->bc.op_ptr->flags && FF_GDS) ~^~~~~~~~~ Fixes: `da977ad907` ("r600/sb: start adding GDS support") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-12 10:58:58 -07:00
Rafael Antognolli	ca168ec008	i965/miptree: Use enum instead of boolean. ISL_AUX_USAGE_NONE happens to be the same as "false", but let's do the right thing and use the enum. v2: fix intel_miptree_finish_depth too (Caio) Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-12 10:14:20 -07:00
Samuel Pitoiset	2c139e2cdf	radv: do not support blitting surfaces for R32G32B32 formats Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108113 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-12 15:28:21 +02:00
Jose Fonseca	7c5aececda	scons: Allow building with custom MSVC_USE_SCRIPT script. SCons MSVC support relies on vcvarsall.bat to extract the PATH, CPP includes, library paths, etc. And SCons also has an build env var named MSVC_USE_SCRIPT which one can use to point to alternative vcvarsall.bat script. This change exposes this MSVC_USE_SCRIPT build env variable as a SCons command line variable. This will enable using MSVC outside Program Files (e.g, network shares, etc.) This change also links advapi32 library, necessary for the Windows Registry API used by WGL state tracker, avoiding missing symbols. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-10-12 07:45:53 +01:00
Samuel Pitoiset	416013b4f5	radv: emit the GLC bit for SSBO loads/stores when needed This fixes some new memory model tests: dEQP-VK.memory_model.message_passing.core11.u32.coherent.fence_fence.atomicwrite.device.* Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108112 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-12 08:42:08 +02:00
Samuel Pitoiset	4b74f05f6b	spirv/nir: handle memory access qualifiers for SSBO loads/stores v2: - change how the access qualifiers are accumulated v3: - duplicate members in struct_member_decoration_cb() - handle access qualifiers on variables - remove access qualifiers handling in _vtn_variable_load_store() - fix setting access qualifiers on type->array_element Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net	2018-10-12 08:42:08 +02:00
Tapani Pälli	26a10e3844	anv/android: we need git_sha1.h in include paths Fixes: `e4538b9` "anv: Implement VK_KHR_driver_properties" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-12 07:29:03 +03:00
Nanley Chery	0ee0e0b6b9	anv: Clear WM_HZ_OP overrides in init_device_state This is basically a port of commit, `3ade766684` ("i965: Disable 3DSTATE_WM_HZ_OP fields.") The BDW+ docs describe how to use the 3DSTATE_WM_HZ_OP instruction in the section titled, "Optimized Depth Buffer Clear and/or Stencil Buffer Clear." It mentions that the packet overrides GPU state for the clear operation and needs to be reset to 0s to clear the overrides. Depending on the kernel, we may not get a context with the GPU state for this packet zeroed. Do it ourselves just in case. Prevents a number of GPU hangs when running crucible on ICL. I tried to get the exact number of hangs that occurs without this patch, but was unsuccessful. The test machine became unresponsive before completing the full run. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-10-11 16:31:08 -07:00
Jordan Justen	494d2ec277	i965/gen10+: Initialize new fields in STATE_BASE_ADDRESS Ref: `263b584d5e` "i965/skl: Emit extra zeros in STATE_BASE_ADDRESS on Skylake." Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-10-11 15:16:04 -07:00
Jordan Justen	d18a0d955e	anv/gen9+: Initialize new fields in STATE_BASE_ADDRESS Ref: `263b584d5e` "i965/skl: Emit extra zeros in STATE_BASE_ADDRESS on Skylake." Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-10-11 15:16:00 -07:00
Jason Ekstrand	d7e0d47b9d	nir: Add a bunch of b2[if] optimizations The b2f and b2i conversions always produce zero or one which are both representable in every type and size. Since b2i and b2f support all bit sizes, we can just get rid of the conversion opcode. total instructions in shared programs: 15089335 -> 15084368 (-0.03%) instructions in affected programs: 212564 -> 207597 (-2.34%) helped: 896 HURT: 0 total cycles in shared programs: 369831123 -> 369826267 (<.01%) cycles in affected programs: 2008647 -> 2003791 (-0.24%) helped: 693 HURT: 216 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-11 15:21:19 -05:00
Jason Ekstrand	0e0dc596a2	intel/vec4: Fix nir_op_b2[fi] with 64-bit result This is valid NIR but you can't actually hit this case today. GLSL IR doesn't have a bool to double opcode; it does f2d(b2f(x)). In SPIR-V we don't have any to/from bool conversion opcodes at all. However, the next commit will make us start generating it so we should be ready. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-11 15:21:19 -05:00
Jason Ekstrand	497675c21e	intel/fs: Fix nir_op_b2[fi] with 64-bit result on Gen8 LP and Gen9 LP Several of the Atom GPUs have additional restrictions on alignment when moving < 64-bit source to a 64-bit destination. All of the nir_op_264 code generation paths respected this, but nir_op_b2[fi] did not. Previous to commit `a68dd47b91` it was not possible to generate such an instruction from the GLSL path. It may have been possible from SPIR-V, but it's not clear. The aforementioned patch converts a 64-bit nir_op_fsign into a sequence of operations including a nir_op_b2f with a 64-bit result. This "just works" everywhere except these Atom parts. This problem was not detected during normal CI testing because the Atom parts are not included in developer builds. v2 (idr): Make the patch compile, and make some cosmetic changes. Add a commit message. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108319 Fixes: `a68dd47b91` "nir/algebraic: Simplify fsat of fsign" Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-11 15:21:19 -05:00
Vinson Lee	4ece6aa552	egl: Use correct shared libraries suffix on macOS. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-11 11:30:00 -07:00
Illia Iorin	b18f8e63ef	mesa: Fix pack_uint_Z_FLOAT32() Fixed pack_uint_Z_FLOAT32 by casting row data to float instead uint. Remove code duplicate function pack_uint_Z_FLOAT32_X24S8. Edited case in "_mesa_get_pack_uint_z_func". Now it looks like "_mesa_get_pack_float_z_func". Remove _mesa_problem call, which was added for debuging this issue. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=91433 Signed-off-by: Illia Iorin <illia.iorin@globallogic.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-10-11 10:15:09 -07:00
Rodrigo Vivi	24db1c7fcc	intel: Introducing Whiskey Lake platform Whiskey Lake uses the same gen graphics as Coffe Lake, including some ids that were previously marked as reserved on Coffe Lake, but that now are moved to WHL page. This follows the ids and approach used on kernel's commit b9be78531d27 ("drm/i915/whl: Introducing Whiskey Lake platform") and commit c1c8f6fa731b ("drm/i915: Redefine some Whiskey Lake SKUs") v2: Lionel noticed that GT{1,2,3} on kernel wasn't following spec when looking to number of EUs, so kernel has been updated. Cc: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: José Roberto de Souza <jose.souza@intel.com> Cc: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-11 10:02:40 -07:00
Boyuan Zhang	d76c277421	st/va: use provided sizes and coords for vlVaGetImage vlVaGetImage should respect the width, height, and coordinates x and y that passed in. Therefore, pipe_box should be created with the passed in values instead of surface width/height. v2: add input size check, return error when size out of bounds v3: fix the size check for vaimage v4: add size adjustment for x and y coordinates Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Cc: "18.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Christian König <christian.koenig@amd.com>	2018-10-11 09:00:18 -04:00
Samuel Pitoiset	229803b66a	radv: implement clear operations for R32G32B32 This fixes crashes for some CTS: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.color..linear__* dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.color.._linear_* Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108113 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-11 14:49:16 +02:00
Samuel Pitoiset	c3ba3c2611	radv: disallow 3D images and mipmaps/layers for R32G32B32 linear formats R32G32B32 are weird formats and we are only going to support some basic operations for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-11 14:49:14 +02:00
Samuel Pitoiset	d179312b53	radv: add a workaround for a VGT hang with prim restart and strips Otherwise, Yakuza and The Evil Within hang the GPU with DXVK. This apparently only works on Polaris. Suggested by Marek. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-11 10:16:11 +02:00
Timothy Arceri	3bc012a34e	glsl: remove redundant es_shader checks The es check is already covered by the is_version() check. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-11 14:45:43 +11:00
Dave Airlie	cc2fe57922	st/glsl_to_tgsi: initialise need_uarl in contructor Found by coverity Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-11 10:20:37 +10:00
Dave Airlie	c5c3da6c90	glspirv: drop pointless assert (size_t is unsigned) Found by coverity Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2018-10-11 10:19:48 +10:00
Dave Airlie	600d8ecb57	radv: remove unsigned comparison against 0 The value is always >= 0 here. Found by coverity Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-11 10:19:20 +10:00
Dave Airlie	6e1d294804	radv: remove dead code for master_fd close We have never opened master_Fd at this point, so remove code to close it. Found by coverity. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-11 10:19:16 +10:00
Dave Airlie	7c04b96f03	radv: don't pass shader key by copy Coverity pointed out we were copying 168 bytes here unnecessarily. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-10-11 10:18:43 +10:00
Dave Airlie	29a7631986	anv: add missing unlock in error path. Not going to matter, but be consistent. Found by coverity Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `caf41c78c` (anv/allocator: Support softpin in the BO cache)	2018-10-11 09:50:27 +10:00
Jason Ekstrand	4ba445e011	intel: Don't propagate conditional modifiers if a UD source is negated This fixes a bug uncovered by my NIR integer division by constant optimization series. Fixes: `19f9cb72c8` "i965/fs: Add pass to propagate conditional..." Fixes: `627f94b72e` "i965/vec4: adding vec4_cmod_propagation..." Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-10 13:13:12 -05:00
Jason Ekstrand	328d4d080b	util: Add tests for fast integer division by constants While I generally trust rediculousfish to have done his homework, we've made some adjustments to suit the needs of mesa and it'd be good to test those. Also, there's no better place than unit tests to clearly document the different edge cases of the different methods. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-10 13:13:12 -05:00
Marek Olšák	a9be8dddfe	util: Add power-of-two divisor support to compute_fast_udiv_info Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-10 13:13:12 -05:00
Jason Ekstrand	7cde4dbcd7	util: Generalize fast integer division to be variable bit-width There's nothing inherently fixed-width in the code. All that's required to generalize it is to make everything internally 64-bit and pass UINT_BITS in as a parameter to util_compute_fast_[us]div_info. With that, it can now handle 8, 16, 32, and 64-bit integer division by a constant. We also add support for division by 1 and by other powers of 2. This is useful if you want to divide by a uniform value in a shader where you have the opportunity to adjust the uniform on the CPU before passing it in. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-10 13:13:12 -05:00
Marek Olšák	64eb0738d4	util: Add fast division helpers Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-10 13:13:12 -05:00
Marek Olšák	2940c257a6	util: import public domain code for integer division by a constant Compilers can use this to generate optimal code for integer division by a constant. Additionally, an unsigned division by a uniform that is constant but not known at compile time can still be optimized by passing 2-4 division factors to the shader as uniforms and executing one of the fast_udiv* variants. The signed division algorithm doesn't have this capability. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-10 13:13:12 -05:00
Jason Ekstrand	0dca6730b4	util: Add a simple big math library Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-10 13:13:12 -05:00
Dylan Baker	b8521704ed	meson: Don't allow building EGL on Windows or MacOS Currently mesa only supports EGL on Unix like systems, cygwin, and haiku. Meson should actually enforce this. This fixes the default build on MacOS. v2: - invert the condition, mark darwin and windows as not supported instead of trying to mark what is supported. v3: - add missing ) v3: - Update comment to reflect condition change in v2 CC: 18.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-10 11:02:36 -07:00
Timothy Arceri	0346ad3774	glsl: ignore trailing whitespace when define redefined The Nvidia/AMD binary drivers allow this, as does GCC. This fixes shader compilation issues in the latest update of No Mans Sky. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-10 15:08:32 +11:00
Ian Romanick	b44c9292b7	intel/compiler: Don't handle fsign.sat No shader-db or CI changes on any Intel platform. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-10-09 13:56:42 -07:00
Ian Romanick	a68dd47b91	nir/algebraic: Simplify fsat of fsign These allows us to not support fsign.sat in the Intel compiler backend, and that will simplify some later changes. No shader-db changes on any Intel platform. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-10-09 13:56:42 -07:00
Ian Romanick	1546204cdd	nir/algebraic: sign(x)xx is abs(x)*x shader-db results: All Gen7+ platforms had similar results. (Skylake shown) total instructions in shared programs: 15106023 -> 15105981 (<.01%) instructions in affected programs: 300 -> 258 (-14.00%) helped: 6 HURT: 0 helped stats (abs) min: 7 max: 7 x̄: 7.00 x̃: 7 helped stats (rel) min: 14.00% max: 14.00% x̄: 14.00% x̃: 14.00% 95% mean confidence interval for instructions value: -7.00 -7.00 95% mean confidence interval for instructions %-change: -14.00% -14.00% Instructions are helped. total cycles in shared programs: 566050327 -> 566050075 (<.01%) cycles in affected programs: 2826 -> 2574 (-8.92%) helped: 6 HURT: 0 helped stats (abs) min: 40 max: 44 x̄: 42.00 x̃: 42 helped stats (rel) min: 8.89% max: 8.94% x̄: 8.92% x̃: 8.92% 95% mean confidence interval for cycles value: -44.30 -39.70 95% mean confidence interval for cycles %-change: -8.95% -8.88% Cycles are helped. No changes on Gen6 or earlier. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-10-09 13:56:42 -07:00
Ian Romanick	10f4a8871e	nir: Add helper functions to get the instruction that generated a nir_src Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-10-09 13:56:42 -07:00
Brian Paul	797e34f658	svga: change svga_destroy_shader_variant() to return void svga_destroy_shader_variant() itself flushes and retries the command if there's a failure. So no need for the callers to do it. Other callers of the function were already ignoring the return value. This also fixes a corner-case double-free reported by Coverity (and reported by Dave Airlie). Tested with various OpenGL apps. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-10-09 11:17:14 -06:00
Dylan Baker	b781688636	meson: Don't build glsl compiler tests unless OpenGL is enabled Since there are no other users of the glsl compiler. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-09 08:56:00 -07:00
Dylan Baker	d84f003b95	meson: Only build gallium state tracker tests with shared_glapi This has always been a requirement, it's just somehow been missed in the meson build. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-09 08:55:56 -07:00
Dylan Baker	0fa6a8271a	meson: only build clapi tests when OpenGL is being built Otherwise building just vulkan (among other things) will build these tests, pull in a bunch of stuff they shouldn't, and potentially fail to compile. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-09 08:55:48 -07:00
Ilia Mirkin	92f56fbd89	nvc0: fix blitting red to srgb8_alpha For some reason the 2d engine can't handle this. Red formats get special treatment there, so perhaps related. Fixes dEQP-GLES3 tests of the form: dEQP-GLES3.functional.fbo.blit.conversion.r{8,16f,32f}_to_srgb8_alpha8 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Cc: mesa-stable@lists.freedesktop.org	2018-10-09 10:33:11 -04:00
Ilia Mirkin	9bf0614116	nv50,nvc0: guard against zero-size blits The current state tracker can generate these sometimes. Fixing this is more involved, and due to some integer math we can generate divisions-by-zero. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Cc: mesa-stable@lists.freedesktop.org	2018-10-09 10:33:11 -04:00
Ilia Mirkin	78d3640e49	nv50,nvc0: mark RGBX_UINT formats as renderable This helps st/mesa avoid some (apparently) buggy fallbacks. Specifically the CopyTexSubImage fallback tries to read texture A as RGBA_FLOAT and write back that data into the target format, which fails for integer formats which have no appropriate logic to do the conversion. Since integer formats don't blend, there's no harm in the fact that the "A" component gets written anyways. Fixes, among others: https://www.khronos.org/registry/webgl/sdk/tests/conformance2/textures/canvas/tex-2d-rgb8ui-rgb_integer-unsigned_byte.html Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2018-10-09 10:33:11 -04:00
Eric Engestrom	976188737d	radv: add missing meson c++ visibility arguments Fixes: `6f3aee40f9` "radv: using tls to store llvm related info and speed up compiles (v10)" Cc: Dave Airlie <airlied@redhat.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-09 14:22:24 +01:00
Michel Dänzer	9d3fefdc41	gbm: Add GBM_FORMAT_ARGB1555 support Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-09 10:32:51 +02:00
Michel Dänzer	e7e033ed8a	st/dri: Handle BGRA5551 format Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-09 10:32:50 +02:00
Rob Clark	fa52ff856d	freedreno/a5xx+a6xx: fix LRZ pitch alignment Both RB_2D_DST_SIZE.PITCH (a6xx) and RB_MRT[n].PITCH (a5xx) need alignment to 64. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-08 19:05:14 -04:00
Rob Clark	82c3b6fe49	freedreno/a6xx: add LRZ support As with a5xx, hidden behind FD_MESA_DEBUG=lrz due to being paranoid about z-fighting issues with some games (in particular, this was observed with 0ad on a5xx.. but I think the proper solution to enable this by default is to figure out how to do driver specific driconf options). Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-08 19:05:14 -04:00
Rob Clark	a877451a41	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-08 18:03:35 -04:00
Rob Clark	bf79a7cc25	freedreno/a6xx: add helper for various CP_EVENT_WRITE Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-08 17:50:26 -04:00
Rob Clark	60af89815e	freedreno/a6xx: remove unused fxns Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-08 17:50:26 -04:00
Rob Clark	d5bd3ce89c	freedreno/a6xx: remove fd6_shader_stateobj Earlier gen's already got this cleanup, but a6xx was still off on a branch then. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-08 17:50:26 -04:00
Ilia Mirkin	1bb1c03d61	glsl: fix array assignments of a swizzled vector This happens in situations where we might do vec.wzyx[i] = ... The swizzle would get effectively ignored because of the interaction between how ir_assignment->set_lhs works and overwriting the write_mask. There are two cases, one where i is a constant, and another where i is variable. We have to be extra-careful in both cases. Fixes the following WebGL test: https://www.khronos.org/registry/webgl/sdk/tests/conformance2/glsl3/vector-dynamic-indexing-swizzled-lvalue.html And the new piglit tests: swizzled-writemask-indexing-nonconst.shader_test swizzled-writemask-indexing.shader_test Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Cc: mesa-stable@lists.freedesktop.org	2018-10-08 14:29:14 -04:00
Samuel Pitoiset	d3682766f6	radv: tidy up radv_pipeline_init_multisample_state() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-08 14:17:43 +02:00
Samuel Pitoiset	b38228ccb0	radv: always set PA_SC_MODE_CNTL_1.OUT_OF_ORDER_WATER_MARK It has probably no effect without out of order rasterization anyway. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-08 14:17:40 +02:00
Samuel Pitoiset	937986ca1d	radv: set DB_EQAA.INCOHERENT_EQAA_READS My attempt was to set this field instead of duplicating one. Fixes: `6cfa321c39` ("radv: add potential missing fields for DB_EQAA") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-08 14:17:33 +02:00
Chystiakov, Dmytro	47e3338b04	i965: fallback RGBX to RGBA in glEGLImageTargetRenderbufferStorageOES In the same fashion as is done for glEGLImageTextureTarget2D. v2: share the fallback which sets baseformat and internalformat correctly which makes both of the tests pass (Tapani) Fixes android.hardware.nativehardware.cts.AHardwareBufferNativeTests: #SingleLayer_ColorTest_GpuColorOutputCpuRead_R8G8B8X8_UNORM #SingleLayer_ColorTest_GpuColorOutputIsRenderable_R8G8B8X8_UNORM Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-10-08 08:03:45 +03:00
Tapani Pälli	d1fa69ed61	glsl: do not attempt assignment if operand type not parsed correctly v2: check types of both operands (Ian) Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108012	2018-10-08 08:02:50 +03:00
Marek Olšák	d877451b48	util/u_queue: add UTIL_QUEUE_INIT_SET_FULL_THREAD_AFFINITY Initial version discussed with Rob Clark under a different patch name. This approach leaves his driver unaffected.	2018-10-06 22:05:58 -04:00
Marek Olšák	066aa44fc5	radeonsi: fix a typo at CS_PARTIAL_FLUSH harmless	2018-10-06 21:50:52 -04:00
Marek Olšák	77903c8cfb	ac: add ac_build_round	2018-10-06 21:50:09 -04:00
Marek Olšák	fa023f293e	ac: correct PKT3_COPY_DATA definitions	2018-10-06 21:50:09 -04:00
Marek Olšák	82f5f89bf6	ac: simplify LLVM alloca helpers	2018-10-06 21:50:09 -04:00
Marek Olšák	a668c8d6ba	ac: define all address spaces properly	2018-10-06 21:50:09 -04:00
Gert Wollny	8f77156c26	gallivm: Make it possible to disable some optimization shortcuts in release builds For testing it is of interest that all tests of dEQP pass, e.g. to test virglrenderer on a host only providing software rendering like in a CI. Hence make it possible to disable certain optimizations that make tests fail. While we are there also add some documentation to the flags to make it clear that this is opt-out. Setting the environment variable "GALLIVM_PERF=no_filter_hacks" can be used to make the following tests pass in release mode: dEQP-GLES2.functional.texture.mipmap.2d.affine._linear_ dEQP-GLES2.functional.texture.mipmap.cube.generate.* dEQP-GLES2.functional.texture.vertex.2d.filtering._mipmap_linear_ dEQP-GLES2.functional.texture.vertex.2d.wrap.* Related: https://bugs.freedesktop.org/show_bug.cgi?id=94957 v2: rename optimization disabling flag to 'safemath' and also move the nopt flag to the perf flags. v3: rename flag "safemath" to "no_filter_hacks" since safemath is usually associated with floating point operations (Roland) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-10-06 13:12:48 +02:00
Tomeu Vizoso	9d81cd8e7c	virgl: Pass resource size and transfer offsets Pass the size of a resource when creating it so a backing can be kept in the other side. Also pass the required offset to transfer commands. This moves vtest closer to how virtio-gpu works, making it more useful for testing. v2: - Use new messages for creation and transfers, as changing the behavior of the existing messages would be messy given that we don't want to break compatibility with older servers. v3: - Use correct strides: The resource corresponding to the output display might have a differnt line stride then the IOVs, so when reading back to this resource take the resource stride and the the IOV stride into account. v4: Fix transfer size calculation (Andrey Simiklit) v5: Add comment about transfer size value in the PUT commend (Gurchetan). Add a comment about the size correction for transfers for reading and writing the resource. Fixing this by correctly evaluating the size upfront will need some work also on the virglrenderer side. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> (v2) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-10-06 13:12:44 +02:00
Gert Wollny	5d7858f151	virgl, vtest: Correct the transfer size calculation The transfer size used in virglrenderer refers to uint32_t, so one must add 3 and then divide by 4 instead of adding 3/4 which is a no-op with integers. Fixes: `b3b82fe8ea` virgl/vtest: add vtest driver Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-10-06 13:12:44 +02:00
Alan Coopersmith	066850edad	util: Make xmlconfig.c build on Solaris without d_type in dirent (v2) v2: check for lstat() failing Fixes: `04bdbbcab3` "xmlconfig: read more config files from drirc.d/" Signed-off-by: Alan Coopersmith <alan.coopersmith@oracle.com> Reviewed-by: Roland Mainz <roland.mainz@nrubsig.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-10-05 17:30:45 -07:00
Sonny Jiang	084cf3b966	radeonsi:optimizing SET_CONTEXT_REG for shaders vgt_vertex_reuse Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-05 19:04:13 -04:00
Sonny Jiang	ce1d72609d	radeonsi:optimizing SET_CONTEXT_REG for shaders Tessellation Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-05 19:04:13 -04:00
Sonny Jiang	4de328da07	radeonsi:optimizing SET_CONTEXT_REG for shaders PS Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-05 19:04:13 -04:00
Sonny Jiang	f243980f2c	radeonsi:optimizing SET_CONTEXT_REG for shaders VS Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-05 19:04:13 -04:00
Sonny Jiang	4052624398	radeonsi:optimizing SET_CONTEXT_REG for shaders GS Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-05 19:04:13 -04:00
Marek Olšák	86f004bdfc	radeonsi: optimize and allow reg > 31 in radeon_opt_set_context_reg functions reg_saved will have 64 bits, and (1 << reg) where reg > 31 has undefined behavior. (1ull << reg) would be correct for 64 bits. This commit shifts the other way in order to merge the conditions.	2018-10-05 19:04:13 -04:00
Sonny Jiang	eeb9170599	radeonsi: optimizing SET_CONTEXT_REG for shaders ES Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-05 17:53:52 -04:00
Samuel Pitoiset	a1bc152340	spirv: mark variables decorated with XfbBuffer as always active Otherwise, they are removed during NIR linking or in some lowering passes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-10-05 18:13:25 +02:00
Juan A. Suarez Romero	5bd03d02c1	docs: update calendar, add news and link release notes to 18.2.2 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-10-05 12:51:34 +02:00
Juan A. Suarez Romero	c565eeee0b	docs: add sha256 checksums for 18.2.2 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `cb63a4e114`)	2018-10-05 12:46:33 +02:00
Juan A. Suarez Romero	3537465059	docs: add release notes for 18.2.2 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `abaeb79eb2`)	2018-10-05 12:46:31 +02:00
Jason Ekstrand	dd553bc67f	nir/alu_to_scalar: Use ssa_for_alu_src in hand-rolled expansions The ssa_for_alu_src helper will correctly handle swizzles and other source modifiers for you. The expansions for unpack_half_2x16, pack_uvec2_to_uint, and pack_uvec4_to_uint were all broken with regards to swizzles. The brokenness of unpack_half_2x16 was causing rendering errors in Rise of the Tomb Raider on Intel ever since `c11833ab24` which added an extra copy propagation to the optimization pipeline and caused us to start seeing swizzles where we hadn't seen any before. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107926 Fixes: `9ce901058f` "nir: Add lowering of nir_op_unpack_half_2x16." Fixes: `9b8786eba9` "nir: Add lowering support for packing opcodes." Tested-by: Alex Smith <asmith@feralinteractive.com> Tested-by: Józef Kucia <joseph.kucia@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-10-04 12:43:59 -05:00
Vadym Shovkoplias	5f0567a4f6	glsl/linker: Check the subroutine associated functions names >From Section 6.1.2 (Subroutines) of the GLSL 4.00 specification "A program will fail to compile or link if any shader or stage contains two or more functions with the same name if the name is associated with a subroutine type." v2: - error out earlier (Tapani) - style fixes (Iago) Fixes: * no-overloads.vert Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108109 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-04 17:41:19 +02:00
Tomeu Vizoso	ed53a79cf8	virgl: Negotiate version with vtest server Check if server supports version negotation by sending a PING_PROTOCOL_VERSION message right before a dummy RESOURCE_BUSY_WAIT. If we don't get a reply for the first, we know the server doesn't support it. If it does support it, we can query the max protocol version supported by the server and fall back if needed. v2: - Send a new message to negotiate the protocol version, checking if the server supports this message by immediately sending a busy wait message. (Dave Airlie) v3: - Send a zero-arg command PING_PROTOCOL_VERSION so we actually keep compatibility with older servers. (Code by Dave Airlie) Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-10-04 16:18:36 +02:00
Sagar Ghuge	0c70e11206	intel: aubinator: Fix memory leaks Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-04 10:01:56 +01:00
Sagar Ghuge	29a2eaf3db	intel/decoder: construct correct xml filename construct correct gen xml filename when we try to load hardware xml description from a given path v2: remove temporary variable (Francesco Ansanelli) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-04 10:01:56 +01:00
Sagar Ghuge	f9c8468c82	intel/decoder: Avoid freeing invalid pointer v2: Free ctx.spec if error while reading genxml (Lionel Landwerlin) v3: Handle case where genxml is empty (Lionel Landwerlin) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-04 10:01:56 +01:00
Sagar Ghuge	ba3304e764	intel/decoder: add gen_spec_init method Initialize gen_spec instance properly when loading hardware xml description from specifc directory to avoid segmentation fault. v2: correct function definition (Lionel Landwerlin) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-04 10:01:56 +01:00
Samuel Pitoiset	2b34985d93	radv: fix resetting the pool for timestamp queries Since the driver no longer uses the availability bit for timestamp queries it shouldn't reset it. Instead, it should reset the query values to UINT32_MAX. This fixes VM faults. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108164 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Józef Kucia <joseph.kucia@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-10-04 10:56:25 +02:00
Guido Günther	b2a876a42b	etnaviv: Use write combine instead of unached mappings for shader bo The later are sensitive to unaligned accesses on arm64[1] and we don't need an uncached mapping here. [1]: https://lists.freedesktop.org/archives/etnaviv/2018-September/001956.html Signed-off-by: Guido Günther <guido.gunther@puri.sm> Signed-off-by: Lucas Stach <l.stach@pengutronix.de>	2018-10-04 10:33:25 +02:00
Marek Olšák	8e0b4cb8a1	drirc: add a workaround for ARMA 3 Cc: 18.2 <mesa-stable@lists.freedesktop.org>	2018-10-04 01:01:54 -04:00
Jason Ekstrand	f5bab06428	anv/batch_chain: Don't start a new BO just for BATCH_BUFFER_START Previously, we just went ahead and emitted MI_BATCH_BUFFER_START as normal. If we are near enough to the end, this can cause us to start a new BO just for the MI_BATCH_BUFFER_START which messes up chaining. We always reserve enough space at the end for an MI_BATCH_BUFFER_START so we can just increment cmd_buffer->batch.end prior to emitting the command. Fixes: `a0b133286a` "anv/batch_chain: Simplify secondary batch return..." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107926 Tested-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-03 09:03:12 -05:00
Jason Ekstrand	7a89a0d9ed	anv: Use separate MOCS settings for external BOs On Broadwell and above, we have to use different MOCS settings to allow the kernel to take over and disable caching when needed for external buffers. On Broadwell, this is especially important because the kernel can't disable eLLC so we have to do it in userspace. We very badly don't want to do that on everything so we need separate MOCS for external and internal BOs. In order to do this, we add an anv-specific BO flag for "external" and use that to distinguish between buffers which may be shared with other processes and/or display and those which are entirely internal. That, together with an anv_mocs_for_bo helper lets us choose the right MOCS settings for each BO use. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99507 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-03 09:03:03 -05:00
Emil Velikov	08bff097e1	meson: remove invalid "opencl" llvm component Seeming copy/paste mistake from configure.ac which uses $2 for the component and $3 for the fancy name printing. Cc: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	fe8be81b4a	Revert "mesa: remove unnecessary 'sort by year' for the GL extensions" This reverts commit `3d81e11b49`. As reported by Federico, some games require the 'sort by year' since they truncate the extensions which do not fit the fixed size string array. Seemingly I did not consider that, as the documentation (both Mesa and Nvidia) mentions about program crashes ... which are worked around by setting the env. variable. This commit reinstates the workaround and enhances the documentation. Cc: Marek Olšák <maraeo@gmail.com> Cc: Ian Romanick <idr@freedesktop.org> Reported-by: Federico Dossena <info@fdossena.com> Fixes: `3d81e11b49` ("mesa: remove unnecessary 'sort by year' for the GL extensions") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Tested-by: Federico Dossena <info@fdossena.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	91ff8b1dd9	mesa: reorder and document the tokens in glheader.h Split into different sections, document each one as well as strange cases like GL_ATI_texture_compression_3dc. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	5f70964b1d	mesa: remove duplicate declarations from glheader.h Remove all the desktop GL and GLX entries from the list. Former are pulled by the gl.h and glext.h includes at the top while the latter are no longer needed. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	01b92916af	i965: reference __DRI_ATTRIB_SWAP_COPY token over the GLX one Earlier commit updated the code to use the DRI tokens, yet forgot to update the comment. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	e04b2c0376	i915: reference __DRI_ATTRIB_SWAP_COPY token over the GLX one Earlier commit updated the code to use the DRI tokens, yet forgot to update the comment. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	d26b122ee8	dri/common: move the required GLX_* token definitions locally Will allow us to remove even bigger hack elsewhere. But more importantly, we should not be using _any_ GLX tokens in DRI. Document the gory details about the current side-effects. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	4ef53669af	dri/common: use __DRI_ATTRIB_SWAP* instances when describing db_modes Somewhat recently Thomas Hellstrom added the respective DRI tokens and updated the drivers. Update the documentation to match reality. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	d6a6760139	egl/x11: remove eglSwap* surface check Already handled further up in eglapi.c. To make things a tiny bit strange, X11+DRI3 was doing the wrong thing by returning EGL_FALSE (+ no error), while X11+DRI2 was returning EGL_TRUE. Cc: samiuddi <sami.uddin.mohammad@intel.com> Cc: Eric Engestrom <eric.engestrom@intel.com> Cc: Erik Faye-Lund <kusmabite@gmail.com> Cc: Tomasz Figa <tfiga@chromium.org> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	8030741996	egl/surfaceless: remove eglSwap* stubs The API validation in eglapi.c already returns if the surface type is !window. Cc: samiuddi <sami.uddin.mohammad@intel.com> Cc: Erik Faye-Lund <kusmabite@gmail.com> Cc: Tomasz Figa <tfiga@chromium.org> Cc: Gurchetan Singh <gurchetansingh@chromium.org> Cc: Chad Versace <chadversary@chromium.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	a370e278d3	egl/drm: remove eglSwap* surface check Already handled further up in eglapi.c Cc: samiuddi <sami.uddin.mohammad@intel.com> Cc: Erik Faye-Lund <kusmabite@gmail.com> Cc: Tomasz Figa <tfiga@chromium.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	91ccb59ff4	egl/android: remove eglSwap* surface check Already handled further up in eglapi.c Cc: samiuddi <sami.uddin.mohammad@intel.com> Cc: Erik Faye-Lund <kusmabite@gmail.com> Cc: Tomasz Figa <tfiga@chromium.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:06 +01:00
Emil Velikov	8f66743ca2	egl: make eglSwapBuffers* a no-op for !window surfaces Analogous to the previous commit - the spec says the function is a no-op when a pbuffer or pixmap surface is used. Cc: samiuddi <sami.uddin.mohammad@intel.com> Cc: Erik Faye-Lund <kusmabite@gmail.com> Cc: Tomasz Figa <tfiga@chromium.org> Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	64b4ccde0c	egl: make eglSwapInterval a no-op for !window surfaces As the spec says, the function is a no-op when the surface is not a window one. That spec implies that EGL_TRUE should be returned in that case, yet the ARM driver seems to return EGL_FALSE + EGL_BAD_SURFACE. The Nvidia driver returns EGL_TRUE. We follow that behaviour until a decision is made. https://gitlab.khronos.org/egl/API/merge_requests/17 Cc: samiuddi <sami.uddin.mohammad@intel.com> Cc: Erik Faye-Lund <kusmabite@gmail.com> Cc: Tomasz Figa <tfiga@chromium.org> Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	c231b49c53	freedreno: add the a6xx sources to the Android build Add the files otherwise things just won't build. Haven't actually tested it, but it's a small step in the right direction. Fixes: `de3b34df97` ("freedreno: Add a6xx backend") Cc: Kristian H. Kristensen <hoegsberg@chromium.org> Cc: Rob Clark <robdclark@gmail.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	7419b22413	pipe-loader: add a dup() in pipe_loader_sw_probe_kms The pipe_loader_release API closes the fd given, even if the pipe-loader should _not_ take ownership of it. With earlier commit we fixed pipe_loader_drm_probe_fd, and now with cover the final piece. Note that unlike the DRM case, here the caller _did_ forget to dup before using it ... most likely leading to all sorts of fun. Don't forget the close in the error path. Seems like the things are a bit leaky/asymmetrical with the semi-recent config work. But we can shave that yak another day ;-) Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	6ccc435e7a	pipe-loader: move dup(fd) within pipe_loader_drm_probe_fd Currently pipe_loader_drm_probe_fd takes ownership of the fd given. To match that, pipe_loader_release closes it. Yet we have many instances which do not want the change of ownership, and thus duplicate the fd before passing it to the pipe-loader. Move the dup() within pipe-loader, explicitly document that and document all the cases through the codebase. A trivial git grep -2 pipe_loader_release makes things as obvious as it gets ;-) Cc: Leo Liu <leo.liu@amd.com> Cc: Thomas Hellstrom <thellstrom@vmware.com> Cc: Axel Davy <davyaxel0@gmail.com> Cc: Patrick Rudolph <siro@das-labor.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Axel Davy <davyaxel0@gmail.com> (for nine)	2018-10-03 13:38:05 +01:00
Emil Velikov	7b8d1b313c	st/nine: do not double-close the fd on teardown As the newly introduced comment says: The pipe loader takes ownership of the fd Thus, there's no need to close it again. Cc: Patrick Rudolph <siro@das-labor.org> Cc: Axel Davy <davyaxel0@gmail.com> Cc: mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Axel Davy <davyaxel0@gmail.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	fa9df82f67	mesa: fold _glapi_check_multithread() back into _mesa_make_current With commit `c6c0f94714`, back in 2006 Brian removed the _glapi_check_multithread() call from core mesa - _mesa_make_current. It was done to remove fairly awkward #ifdef guard which caused subtle differences in core mesa. Since that guard is long gone, we can drop the duplication and reintroduce the call in core. Note that the function is was missing when using EGL + classic dri HW drivers. Yet on TLS builds it's a no-op, so we're safe. Any non TLS users - more or less anything !Linux (or even musl on Linux up-to semi-recently) may have experienced problems. v2: don't remove the call from swrast - move it to core (Eric) Cc: Eric Anholt <eric@anholt.net> Cc: Brian Paul <brianp@vmware.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	d081ad2aa2	vl/dri3: do full teardown on screen_destroy Earlier commit added support for 'front_buffers', erroneously adding a return in vl_dri3_screen_destroy. Effectively leaking a lot of state. Fixes: `8d7ac0a4e4` ("vl/dri3: implement DRI3 BufferFromPixmap") Cc: Leo Liu <leo.liu@amd.com> Cc: Christian König <christian.koenig@amd.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	1301674c39	st/dri: make swrast_no_present member of dri_screen Just like the dri2 options, this is better suited in the dri_screen struct. Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	80b62e2d6d	st/dri: inline dri2_buffer.h within dri2.c The header was used only by dri2.c, containing a two-member struct and cast wrapper. Just inline it where it's used/needed. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	89c2c386c0	st/xa: remove unused xa_screen::d[s]_depth_bits_last Unused since the initial import. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:05 +01:00
Emil Velikov	5ade4b10e2	mesa: use C99 initializer in get_gl_override() The overrides array contains entries indexed on the gl_api enum. Use a C99 initializer to make it a bit more obvious. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-10-03 13:38:05 +01:00
Gabriel Majeri	f0b987646a	anv: Ensure discreteQueuePriorities is at least 2 This is the minimum value according to the spec. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-10-03 07:57:37 +02:00
Timothy Arceri	2b5f42068d	r600: use build-id when available for disk cache Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-03 09:49:21 +10:00
Timothy Arceri	397f2603eb	nouveau: use build-id when available for disk cache Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-03 09:49:21 +10:00
Timothy Arceri	2169acbf34	radeonsi: use build-id when available for disk cache Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-03 09:49:21 +10:00
Timothy Arceri	83ea8dd99b	util: add disk_cache_get_function_identifier() This can be used as a drop in replacement for disk_cache_get_function_timestamp(). Here we use build-id to generate a driver-id rather than build timestamp if available. This should resolve issues such as distros using reproducable builds and flatpak not having real build timestamps. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-03 09:49:21 +10:00
Timothy Arceri	6a884014e4	util: rename timestamp param in disk_cache_create() Only some drivers use a timestamp here. Others use things such as build-id, or even a combination of build-ids from Mesa and LLVM. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-03 09:49:21 +10:00
Józef Kucia	e24a4e05c7	radeonsi: avoid sending GS_EMIT in shaders without outputs Fixes GPU hangs. Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107857 Signed-off-by: Józef Kucia <joseph.kucia@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-10-02 17:13:52 -04:00
Fritz Koenig	08f97407fb	i965: Replace checks for rb->Name with FlipY (v2) In the GL_MESA_framebuffer_flip_y implementation _mesa_is_winsys_fbo checks were replaced with FlipY checks. rb->Name is also used to determine if a buffer is winsys. v2: Fixes annotation [for emil] Fixes: `ab05dd183c` ("i965: implement GL_MESA_framebuffer_flip_y [v3]") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-10-02 11:28:46 -07:00
Marek Olšák	2fd58d8eb2	radeonsi: initialize ac_gpu_info::name when using SI_FORCE_FAMILY so that it's not NULL when loading radeonsi and a GCN GPU is not present in the system.	2018-10-02 12:21:49 -04:00
Marek Olšák	0b062f0419	radeonsi: don't set the VS prolog key for the blit VS	2018-10-02 12:21:49 -04:00
Jason Ekstrand	58360ca09d	spirv: Move function call handling to vtn_cfg It makes way more sense for it to live there with the rest of function handling. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-10-02 10:24:56 -05:00
Jason Ekstrand	00f385e6d4	nir/from_ssa: Don't rewrite derefs destinations to registers We already call nir_rematerialize_derefs_in_use_blocks_impl prior to calling nir_lower_ssa_defs_to_regs_block so the assertion that all deref uses in the block should hold. This fixes the following CTS test when SPIR-V optimization recipe 1: dEQP-VK.glsl.struct.local.loop_nested_struct_array_vertex Fixes: `606eb56ab9` "intel/nir: Only lower load/store derefs" Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-10-02 10:24:56 -05:00
Jason Ekstrand	bfc89c668e	nir/cf: Remove phi sources if needed in nir_handle_add_jump If the block in which the jump is inserted is the predecessor of a phi then we need to remove phi sources otherwise the phi may end up with things improperly connected. This fixes the following CTS test when dEQP is run with SPIR-V optimization recipe 1: dEQP-VK.glsl.functions.control_flow.return_in_nested_loop_vertex Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-10-02 10:24:56 -05:00
Eric Engestrom	7b0752fb10	anv: suppress warning about unhandled image layout Let's just be explicit that VK_NV_shading_rate_image is not supported. Suggested-by: Jason Ekstrand <jason.ekstrand@intel.com> Fixes: `6ee1709170` "vulkan: Update the XML and headers to 1.1.86" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2018-10-02 15:09:29 +01:00
Rob Clark	ae78489d3e	freedreno/a6xx: hwbinning Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-02 10:08:18 -04:00
Rob Clark	8ff349e564	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-02 10:08:18 -04:00
Jason Ekstrand	7e7959fcb7	intel/fs: Fix a typo in need_matching_subreg_offset This fixes a bunch of Vulkan subgroup tests on little core platforms. Fixes: `4150920b95` "intel/fs: Add a helper for emitting scan operations" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-10-02 07:44:25 -05:00
Timothy Arceri	ea66bfda88	util: disable cache if we have no build-id and timestamp is zero Timestamp can be zero for example when Flatpak is used. In this case just disable the cache rather then segfaulting when incompatible cache items are loaded. V2: actually return false when mtime is 0. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-02 22:07:55 +10:00
Eric Engestrom	0bdf7b1d0f	include: sync eglext.h from Khronos Signed-off-by: Eric Engestrom <eric@engestrom.ch> Acked-by: Tapani Pälli <tapani.palli@intel.com>	2018-10-02 12:10:46 +01:00
Timothy Arceri	0e6cdfd561	radeonsi: add a workaround for bitfield_extract when count is 0 This ports the fix from `3d41757788`. Both LLVM 7 & 8 continue to have this problem. It fixes rendering issues in some menu and loading screens of Civ VI which can be seen in the trace from bug 104602. Note: This does not fix the black triangles on Vega for bug 104602. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104602 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107276	2018-10-02 08:39:51 +10:00
Jason Ekstrand	e4538b93f5	anv: Implement VK_KHR_driver_properties Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-01 13:21:12 -05:00
Jason Ekstrand	6ee1709170	vulkan: Update the XML and headers to 1.1.86 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-01 11:43:20 -05:00
Samuel Pitoiset	c2867e4c2a	radv: do not try to set DCC_CONTROL when image doesn't use DCC Unnecessary. While we are at it, remove the check for pre-VI because it's already checked earlier. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-01 12:13:12 +02:00
Samuel Pitoiset	f622ab889a	radv: add a sanity check for mutable formats and TC-compat HTILE If apps use the MUTABLE bit and the same formats as the image one in the list, we can still enable TC-compat HTILE. I don't think this happens often but given the fact that TC-compat HTILE allows a nice boost in some situations, it's worth checking. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-01 12:13:09 +02:00
Samuel Pitoiset	dc91c4d40a	radv: disable HTILE for very small depth surfaces Like we disable DCC/CMASK for small color surfaces as well. Serious Sam 2017 creates a 1x1 depth surface and I think it should be faster to do slow clears on the graphics queue instead of fast clears on compute, and eventually a depth expand if the surface isn't TC-compatible HTILE. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-01 10:16:33 +02:00
Samuel Pitoiset	6cfa321c39	radv: add potential missing fields for DB_EQAA Other drivers set these two as well, just apply the same rule. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-01 10:16:30 +02:00
Samuel Pitoiset	bd6df2f923	radv: disable complicated point clipping against user clip planes I don't think this is required by Vulkan too. Ported from RadeonSI (AMDVLK doesn't set it either). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-10-01 10:16:25 +02:00
Michel Dänzer	cb863de626	gallium/util: Clarify comment in util_init_thread_pinning As discussed in the review of the patch which added the comment: Nothing happens when a thread is created, because pthread_atfork doesn't affect creating threads. However, spawning a child process will likely crash. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-28 17:52:11 +02:00
Samuel Pitoiset	3fb4adae83	radv: do not sync CP DMA when copying buffers We already track if the DMA engine is busy/idle with a flag, and we emit a packet that waits for all CP DMA operations to be complete. This is done at end of command buffer because the kernel doesn't wait for them, and also when emitting barriers, so it should be safe. This improves small copies for both aligned and unaligned sizes. Aligned sizes: BEFORE: 1 KB: 59.840000 ms 2 KB: 71.200000 ms AFTER: 1 KB: 31.200000 ms 2 KB: 31.040000 ms Unaligned sizes: BEFORE: 2 KB: 68.3200 ms 3 KB: 79.3600 ms 5 KB: 76.6400 ms 9 KB: 90.8800 ms 17 KB: 116.0000 ms AFTER: 2 KB: 31.0400 ms 3 KB: 32.0000 ms 5 KB: 30.8800 ms 9 KB: 30.5600 ms 17 KB: 29.6000 ms Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-28 09:08:52 +02:00
Samuel Pitoiset	621e70dd40	radv: adjust the CmdUpdateBuffer threshold for optimal performance According to my benchmark results, it appears that we should reduce the threshold to 1024. BEFORE: 1 KB: 68.656000 ms 2 KB: 118.368000 ms AFTER: 1 KB: 31.760000 ms 2 KB: 29.840000 ms Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-28 09:08:44 +02:00
Samuel Pitoiset	5d6a560a29	radv: do not use the availability bit for timestamp queries It's unnecessary because we can just check if the timestamp is to different to the default value when a pool is created or resetted. Instead of waiting for the availability bit to be 1, we have to emit a not equal WAIT_REG_MEM for checking if the timestamp is ready. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-28 09:08:03 +02:00
Kristian H. Kristensen	3e90505224	freedreno/a6xx: Build up draw dword0 outside visibilty if statement Pulling this logic out means we can share the logic and avoid a couple of temporary variables that helped make things clearer before. Note that in either vismode case, we always program vismode 0. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-27 16:08:52 -04:00
Kristian H. Kristensen	74a87cdaa6	freedreno/a6xx: Simplify draw_emit() branches a bit Now that we've copied the emit logic into each branch of the if (info->index_size) statement, we can simplify the logic a bit according to which case we're in. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-27 16:08:52 -04:00
Kristian H. Kristensen	2516073cb6	freedreno/a6xx: Copy OUT_RING() part into each branch of the index if Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-27 16:08:52 -04:00
Kristian H. Kristensen	c3d58d9ffc	freedreno/a6xx: Split fd6_draw_emit into direct and indirect paths This splits the two code paths into separate functions and moves the "if (info->indirect)" test into draw_impl(). Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-27 16:08:52 -04:00
Kristian H. Kristensen	adcd83fb22	freedreno/a6xx: Inline fd6_draw() Simplify the code a bit by inlining this helper. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-27 16:08:52 -04:00
Kristian H. Kristensen	fb1c6b89a2	freedreno/a6xx: Move emit_marker and wfi to draw_impl() This way the markers clearly bracket the draw call and isn't duplicated for both direct and indirect draw code. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-27 16:08:52 -04:00
Kristian H. Kristensen	0559050557	freedreno/a6xx: Move inline functions out of fd6_draw.h Only used in fd6_draw.c so put them there. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-27 16:08:52 -04:00
Hyunjun Ko	1a40faa864	freedreno: fix a typo in launch_grid	2018-09-27 16:06:19 -04:00
Hyunjun Ko	aef410f31e	freedreno/ir3: fix the param order of cmpxchg According to the following definition, int AtomicCompSwap(inout int mem, uint compare, uint data); the preceding one in atomic_comp_swap of NIR is compare and data is followed, while src0 for cmpxchg needs vec2(data, compare) So for ssbo/image deref comp_swap, that should be reversed. Fixes: dEQP-GLES31.functional.image_load_store..atomic.comp_swap	2018-09-27 16:05:49 -04:00
Rob Clark	49d22c2dfc	freedreno/a6xx: fix shaders w/ >= 24 regs Possibly these bits mean something else now. Blob always seems to use FOUR_QUADS, and changing to TWO_QUADS seems to cause different threads to overlap registers. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:49:14 -04:00
Rob Clark	6530fcc4a7	freedreno/a6xx: fix gl_FragCoord.w Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:45:44 -04:00
Rob Clark	919741b8d5	freedreno: handle invalidated buffers harder Do a better job of skipping mem2gmem/gmem2mem.. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:41:46 -04:00
Rob Clark	19e9d28646	freedreno/a6xx: fix constlen Fix a few bits of confusion, as with previous gen's constlen is aligned to 4, and value in bitfield is left-shifted by 2 (ie. divided by 4). But this is done by the CONSTLEN() accessor/builder fxn, so don't do it twice. Also HLSQ_FS_CNTL.CONSTLEN is not special. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:33:10 -04:00
Rob Clark	12de415ad1	freedreno: fix inorder rendering case Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:32:39 -04:00
Rob Clark	b65b6f7606	freedreno/a6xx: backface stencil state Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:31:56 -04:00
Rob Clark	93db15d300	freedreno/a6xx: fix gpu crash with separate-stencil Fixes a crash in (of all things) dEQP-GLES2.info.vendor with --deqp-surface-type=fbo.. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:31:34 -04:00
Rob Clark	a52ef80d24	freedreno/a6xx: fix MRT config Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:30:36 -04:00
Rob Clark	8930e83642	freedreno: fix potential hang when destroying batch batch_flush_reset_dependencies() expects to be called unlocked, and can call fd_batch_reference() which can try to aquire the screen lock again. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:29:45 -04:00
Rob Clark	ef6d15f8a8	freedreno: fix corrupted fb state In `c3d9f29b` we allowed ctx->batch to be null, and started tracking the current framebuffer state in fd_context. But the existing logic in fd_blitter_pipe_begin() would, if !ctx->batch, set null fb state to be restored after blit. Which broke the world of deqp (and probably other things) Fixes: `c3d9f29b78` freedreno: allocate ctx's batch on demand Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:27:38 -04:00
Rob Clark	5bb96bf73a	freedreno: simplify pctx->clear() This is defined to always clear the entire surface(s) specified, regardless of scissor state.. mesa/st will turn scissored clears into a draw. So rip about a bunch of unnecessary machinery. Also remove a comment that was obsolete since using u_blitter to turn clear into draw (for the cases where there isn't a hw blitter fast-path). Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:26:32 -04:00
Rob Clark	a7fa44cd33	freedreno: fix FD_MESA_DEBUG=flush The logic to force a flush every draw was short-circuited with newer kernels. Also it should apply to clears as well. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:25:49 -04:00
Rob Clark	83c5c026ee	freedreno: fix scissor state emit The effective scissor changes based on rasterizer->scissor flag, so we need to re-emit scissor state when rasterizer state changes. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:25:24 -04:00
Rob Clark	106f18258a	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:25:01 -04:00
Erik Faye-Lund	c3486cd8c9	st/mesa: do not call update_framebuffer_size with NULL pointer In st_renderbuffer_alloc_storage, we avoid allocating storage for zero-sized buffers, leading to this pointer being NULL. We already take care to avoid dereferencing these pointers for color-buffers, but not for depth/stencil-buffers. So let's thread a bit more carefully here. This avoids a crash while running Piglit's glx/glx-visuals-stencil test, both on virgl and r600g. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Guillaume Charifi <guillaume.charifi@sfr.fr> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-27 10:33:44 +02:00
Maxime	dd333c66bd	vulkan: Disable randr lease for libxcb < 1.13 Since the Randr lease code was added, compiling against libxcb 1.12 no longer works. CC: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108024 Fixes: `7ab1fffcd2` Tested-By: Maxime <berillions@gmail.com> Fixes: `7ab1fffcd2` "vulkan: Add EXT_acquire_xlib_display [v5]"	2018-09-27 16:31:42 +10:00
Bas Nieuwenhuizen	40585ddb48	radv: Remove garbage comment. Trivial.	2018-09-27 02:04:06 +02:00
Bas Nieuwenhuizen	0207ebcbf1	radv: Do not use multiple draws for multisample copies. Use sample rate shading instead, should give better locality. Makes Nier with 8x msaa on a Raven go 5 fps -> 7 fps in the menu. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-27 02:04:00 +02:00
Jordan Justen	ca1d3fc538	anv: If softpin is supported, use it with the hiz clear value bo Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-09-26 10:21:23 -07:00
Jordan Justen	2a97390552	anv: s/batch/value_bo/ on anv_device_init_hiz_clear_batch Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-09-26 10:21:23 -07:00
Dylan Baker	e9bd071f49	docs: update calendar, add news and link release notes for 18.1.9	2018-09-26 09:44:40 -07:00
Dylan Baker	d4bdcf5d22	docs: Add sha256 sums to 18.1.9	2018-09-26 09:41:53 -07:00
Dylan Baker	4769f49455	docs: Add 18.1.9 release notes	2018-09-26 09:40:56 -07:00
Jason Ekstrand	b3f477ef7a	intel/isl: Add a unit suffixes to some struct fields and variables I was about to make the claim to someone that every field in isl_surf is either an enum or has explicit units. Then I looked at isl_surf and discovered this claim was wrong. We should fix that. This commit does a few refactors: * Add _B suffixes to some struct fields * Add _B to some variables and parameters * Rename row_pitch_tiles -> row_pitch_tl Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-09-26 08:52:26 -05:00
Axel Davy	0d495bec25	radeonsi: NaN should pass kill_if Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=105333 Fixes: https://github.com/iXit/Mesa-3D/issues/314 For this application, NaN is passed to KILL_IF and is expected to pass. v2: Explain in the code why UGE is used. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> CC: <mesa-stable@lists.freedesktop.org>	2018-09-25 22:05:24 +02:00
Axel Davy	46814e771a	st/nine: Do not mark both ff vs and ps updated Previously if only ff vs or only ff ps was used, the constants for both were marked as updated, while only the constants of the used ff shader were updated. Now that NINE_STATE_FF_VS and NINE_STATE_FF_PS do not intersect anymore, we can correctly mark the correct set of constant as updated. Fixes: https://github.com/iXit/Mesa-3D/issues/319 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	8e0526555d	st/nine: Split NINE_STATE_FF_OTHER NINE_STATE_FF_OTHER was mostly ff vs states. Rename it to NINE_STATE_FF_VS_OTHER and move common states with ps to NINE_STATE_FF_PS_CONSTS (renamed from NINE_STATE_FF_PSSTAGES). Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	5f7a41c33b	st/nine: Add dummy ff shader state Some states only affect the ff shader, not its constants. Currently we don't check anything and always recompute the ff shader key. However we do check for NINE_STATE_FF_OTHER and if set we reupload some constants. Thus for those states which had NINE_STATE_FF_OTHER set but didn't need it, replace by a dummy ff shader state (which is easier to understand for an external reader than just setting 0 and more future proof). Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	f6bf1d2db0	st/nine: Mark pointsize states as ff states The pointsize states were missing the ff NINE_STATE_FF_OTHER flag, and thus might miss state updates when using ff. Fixes some wine tests. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	89beea100f	st/nine: Minor refactor of a few NINE_STATE_* flags Rename NINE_STATE_FOG_SHADER, NINE_STATE_POINTSIZE_SHADER and NINE_STATE_PS1X_SHADER into NINE_STATE_VS_PARAMS_MISC and NINE_STATE_PS_PARAMS_MISC. The behaviour is unchanged, except one minor change: D3DRS_FOGTABLEMODE doesn't need to affect VS. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	7ae2509ce0	st/nine: Increase maximum number of temp registers With some test app I hit the limit. As we allocate on demand (up to the maximum), it is free to increase the limit. Signed-off-by: Axel Davy <davyaxel0@gmail.com> CC: <mesa-stable@lists.freedesktop.org>	2018-09-25 22:05:24 +02:00
Axel Davy	dc4b53e129	st/nine: Lock the entire buffer in some cases. Previously we had already found that for MANAGED buffers the buffer started dirty (which meant all writes out of bound before the first draw call using the buffer have to be taken into account). Possibly it is the same for the other types of buffers. For now always lock the entire buffer (starting from the offset) for these (except for DYNAMIC buffers, which might hurt performance too much). Fixes: https://github.com/iXit/Mesa-3D/issues/301 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	0eeb583650	st/nine: Don't call SetCursor until a cursor is set The previous code was ignoring the input until a cursor is set inside d3d (with SetCursorProperties), as expected by wine tests. However it did still make a call to ID3DPresent_SetCursor, which would result into a SetCursor(NULL) call, thus hidding any cursor set outside d3d, which we shouldn't do. Add comment about not avoiding redundant ID3DPresent_SetCursor calls once a cursor has been set in d3d, as it has been tested to cause regressions. Fixes: https://github.com/iXit/Mesa-3D/issues/197 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	dcfde02bb0	st/nine: Avoid redundant SetCursorPos calls For some applications SetCursorPosition is called when a cursor event is received. Our SetCursorPosition was always calling wine SetCursorPos which would trigger a cursor event. The infinite loop is avoided by not calling SetCursorPos when the position hasn't changed. Found thanks to wine tests. Fixes irresponsive GUI for some applications. Fixes: https://github.com/iXit/Mesa-3D/issues/173 Signed-off-by: Axel Davy <davyaxel0@gmail.com> CC: <mesa-stable@lists.freedesktop.org>	2018-09-25 22:05:24 +02:00
Axel Davy	112c770597	st/nine: Init cursor position at device creation This is only useful for software cursor, but at least now we won't start it at (0, 0). Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	62ea55ec8b	st/nine: Initialize manually cursor structure Initialize manually the cursor structure fields for more clarity on its content. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	110950318c	st/nine: Check if format is DS before retrieving flags d3d9_get_pipe_depth_format_bindings assumes the input format is a depth stencil format. Previously the user could hit this function with an invalid format. Protect the last non protected call with a depth_stencil_format check. Another solution is to have d3d9_get_pipe_depth_format_bindings support non depth stencil format, but we don't want the user to create depth buffers with d3d formats that can't be one, it's better to check if the format can be depth buffer with d3d. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	af60fbc0a4	st/nine: Remove clamping when mul_zero_wins Tests show the clamping can be removed when mul_zero_wins is supported. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	a0afa80889	st/nine: Implement predicated instructions Most of the work was already there, just not implemented. Fixes: https://github.com/iXit/Mesa-3D/issues/318 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	e7e82bcdc9	st/nine: Fix aliased read in ff Fix aliasing of colorarg_b4 with colorarg_b5. Fixes: https://github.com/iXit/Mesa-3D/issues/302 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	9fc6aa1bbe	st/nine: Fix ff assignment with aliasing "tex_stage[s][D3DTSS_COLORARG0] >> 4" could be a two bit number, thus colorarg_b4 was incorrectly set. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	8c35fb0280	st/nine: Clarify some ff assignments colorarg0, etc are 3 bits wide. Make the code more readable by adding an & 0x7 to further indicate we only remember the first 3 bits only. The 4th bit is always 0, and colorarg_b4, colorarg_b5, etc are used to store the 5th and 6th bits. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	59aaeeb730	st/nine: Print transform matrices in debug This is useful to see the matrices content in the log to debug. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	d9da0a1f6d	st/nine: Add ff key hash to help debug This is very useful to find in the log the ff shader shource of a given call. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	fcbb00a502	st/nine: Avoid RefToBind calls in ff When using csmt, ff shader creation happens on the csmt thread. Creating the shaders, then calling RefToBind causes the device ref to be increased then decreased. However the device dtor assumes than no work pending on the csmt thread could increase the device ref, leading to hang. The issue is avoided by creating the shaders with a bind count directly. Fixes: https://github.com/iXit/Mesa-3D/issues/295 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	e83b15cba0	st/nine: Add new helper for object creation with bind Add a new helper to create objects starting with a bind count instead of a ref count. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	fd86ce7c14	st/nine: Add parameter to start with bind Add a parameter to start new object with a bind instead of a refcount. Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	a9bf82ecf4	st/nine: Use perspective correction for ps depth fog Emulate perspective interpolation of depth for programmable ps fog ff ps fog uses position z, or 1/w depending on the ff projection matrix set. This is according to public documents found describing the algorithm and tests we made. In the case of programmable ps, we used position's z, which was sufficient to pass wine tests (which test shaders don't set w). Issue https://github.com/iXit/Mesa-3D/issues/315 showed that this calculation was wrong. Using perspective interpolation on z, that is using z * 1/w seems to satisfy both this application and wine tests. Fixes: https://github.com/iXit/Mesa-3D/issues/315 Signed-off-by: Axel Davy <davyaxel0@gmail.com>	2018-09-25 22:05:24 +02:00
Axel Davy	7ee5e5e239	st/nine: Clamp RCP when 0inf!=0 Tests done on several devices of all 3 vendors and of different generations showed that there are several ways of handling infs and NaN for d3d9. Tests showed Intel on windows does always clamp RCP, RSQ and LOG (thus preventing inf/nan generation), for all shader versions (some vendor behaviours vary with shader versions). Doing this in nine avoids 0inf issues for drivers that can't generate 0*inf=0 (which is controled by TGSI's MUL_ZERO_WINS). For now clamp for all drivers. An ulterior optimization would be to avoid clamping for drivers with MUL_ZERO_WINS for the specific shader versions where NV or AMD don't clamp. LOG and RSQ being already clamped, this patch only clamps RCP. Fixes: https://github.com/iXit/Mesa-3D/issues/316 Signed-off-by: Axel Davy <davyaxel0@gmail.com> CC: <mesa-stable@lists.freedesktop.org>	2018-09-25 22:05:23 +02:00
Jan Vesely	1f3fe4aaeb	.travis: Drop note about Clover builds being slow SWR takes 17+ minutes to build. Clover builds take ~6-7 minutes. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-09-25 14:08:06 -04:00
Jan Vesely	cb1b109733	.travis: Add LLVM-7 Clover build Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-09-25 14:08:06 -04:00
Caio Marcelo de Oliveira Filho	3cf07361ac	intel/compiler: Export TCS passthrough creation Move create_passthrough_tcs() from i965 so can be used in other contexts. Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-25 09:16:31 -07:00
Gert Wollny	47a6f98e15	mesa/st: In the precense of integer buffers enable per buffer blending Since blending will be disabled later for integer formats we have to consider that in the case of a mixed set of integer/non-integer format buffers blending must be handled on a per buffer basis. Fixes on r600: dEQP-GLES31.functional.draw_buffers_indexed.random. max_required_draw_buffers.13 Fixes: `8fb966688b` st/mesa: Disable blending for integer formats. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-25 15:54:38 +02:00
Eric Engestrom	97ae5a858d	meson+autotools: get rid of spammy GCC warning -Wformat-truncation That warning fires every time a string function takes an argument that could possibly be longer than its max output, which triggers all over the place, especially when working with file paths ("what if every file path is MAX_PATH long?" is what GCC is saying, which is really annoying when we know that "/dev/dri/cardN" is not gonna be 4096 char long and it's safe to store it in a 32-char array). Anyway, we either add a ton of dead code all over the place to make GCC happy, or we get rid of its spam. I chose the latter. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2018-09-25 11:40:08 +01:00
Eric Engestrom	1a37a80bf6	meson: make it trivial to add other -Wno-foo CFLAGS Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-25 11:39:56 +01:00
Eric Engestrom	f5b41f9121	gallivm: ensure string is null-terminated instead of assert()ing Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-25 11:39:30 +01:00
Topi Pohjolainen	1cc17fb731	intel/compiler/icl: Use barrier id bits 24:30 instead of 24:27,31 Fixes gpu hangs with Carchase and Manhattan. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-09-25 09:59:59 +03:00
Andres Rodriguez	ec1fcf92ae	radv: only emit ZPASS_DONE for timestamp queries on gfx queues A ZPASS_DONE packet doesn't make sense for the compute queue. It will result in a gpu hang. This change resolves a gpu hang for SteamVR+Vega. Cc: mesa-stable@lists.freedesktop.org Fixes: `1f616a840e` "radv: emit a dummy ..." Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-25 02:30:34 -04:00
Timothy Arceri	72e4287e8f	radv: make use of nir_lower_load_const_to_scalar() This allows NIR to CSE more operations. LLVM does this also so the impact is limited, however doing this in NIR allows other opts to make progress. For example in radeonsi more loops are unrolled in Civilization Beyond Earth. The actual pipeline-db stats are not overwhelming but even in the negatively affected shaders the NIR is clearly better. It just happens that the code shuffling and in some cases calls to max rather than a flt result in the final output from LLVM not giving as good numbers. However this is an incremental opt that further passes build off so the change should be made IMO. Totals from affected shaders: SGPRS: 20192 -> 20184 (-0.04 %) VGPRS: 19516 -> 19524 (0.04 %) Spilled SGPRs: 437 -> 444 (1.60 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 1527444 -> 1522276 (-0.34 %) bytes LDS: 6 -> 6 (0.00 %) blocks Max Waves: 1018 -> 1016 (-0.20 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-25 09:31:22 +10:00
Dylan Baker	f03a160592	meson: de-duplicate LLVM check By adding `_llvm == 'true'` to the required argument we can check the 'auto' and 'true' case in one path. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-24 13:02:07 -07:00
Eric Engestrom	f2519e3493	vulkan/wsi/display: wsi_display_select_crtc() doesn' need to modify the connector Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-24 17:38:11 +01:00
Eric Engestrom	bde3102c0d	vulkan/wsi/display: check if wsi_swapchain_init() succeeded Fixes: `da997ebec9` "vulkan: Add KHR_display extension using DRM [v10]" Cc: Keith Packard <keithp@keithp.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-24 17:37:43 +01:00
Leo Liu	3e7b5e5db2	radeon/uvd: use bitstream coded number for symbols of Huffman tables Signed-off-by: Leo Liu <leo.liu@amd.com> Fixes: 130d1f456(radeon/uvd: reconstruct MJPEG bitstream) Cc: "18.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com>	2018-09-24 09:12:49 -04:00
Rhys Perry	6ca1402c11	nv50/ir: fix link-time build failure Seems this fixes linking problems that occur in some situations. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-09-23 18:20:08 +01:00
Rhys Perry	b473fcc9a3	nvc0: fix bindless multisampled images on Maxwell+ NVC0_CB_AUX_BINDLESS_INFO isn't written to on Maxwell+ and it's too small anyway. With these changes, TXQ is used to determine the number of samples and the coordinate adjustment information looked up in a small array in the driver constant buffer. v2: rework to use TXQ and a small array instead of a larger array with an entry for each texture v3: get rid of the small array and calculate the adjustments in the shader Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `c2ae9b4052` ('nvc0: implement multisampled images on Maxwell+') Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-09-22 20:13:17 +01:00
Eric Engestrom	ed797f6597	docs: fix couple typos/outdated info `git-branch` doesn't exist, and mesa3d-dev hasn't been used in a great many years :) Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-22 17:23:18 +01:00
Eric Engestrom	ae2694efe0	docs: update repo URLs after GitLab move I also updated the developer instructions; presumably someone who's been given commit rights already knows how to clone a repository :) A more useful thing is to show how to update the pushurl, and how to use access tokens to push over HTTPS (especially for us at Intel, where non-http traffic is a pain). Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-22 17:23:18 +01:00
Stuart Young	c95dd966c4	docs: Update FAQ with respect to s3tc support It's just over 10 months since 17.3.0 was released with s3tc support enabled. Probably a good idea to update the FAQ page. v2: Incorporate feedback from Adam Jackson <ajax@redhat.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `04396a134f` ("mesa: Import libtxc_dxtn sources") Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-09-22 17:23:18 +01:00
Rhys Perry	f580a895b1	nvc0: warn about changing NVC0_CB_AUX_MP_INFO and NVC0_CB_AUX_DRAW_INFO Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-09-22 16:50:39 +01:00
Rhys Perry	01fa76b707	nvc0: Update counter reading shaders to new NVC0_CB_AUX_MP_INFO Fixes: `66ca7e400b` ('nvc0: add support for programmable sample locations') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-09-22 16:50:22 +01:00
Eric Anholt	cd667edecc	vc4: Remove dead i == 0 code from the cos() implementation. The loop starts at 1.	2018-09-21 17:16:43 -07:00
Eric Anholt	10d5d2d527	vc4: Fix sin(0.0) and cos(0.0) accuracy to fix SDL rendering rotation. SDL has some shaders that compute sin(angle) and cos(angle) for a rotation matrix in the VS, and angle is usually 0.0. Our previous implementation had quite a bit of error around 0.0, causing single-pixel rotations at typical window sizes. SDL2 has changed as of August 28th (commit 12156:e5a666405750) to not need sin/cos in the VS, but we should still fix this for existing implementations or similar patterns that other programs may have. glsl-cos goes from 32 instructions to 36, but 9 uniforms to 7. glsl-sin goes from 32 instructions to 34, but 8 uniforms to 7. This seems like a fine impact to have for the bugfix. Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Fixes: https://github.com/anholt/mesa/issues/110	2018-09-21 17:16:43 -07:00
Anuj Phogat	a0baedb638	intel/icl: Fix URB size for different SKUs Different ICL SKUs have different URB sizes. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-21 14:40:04 -07:00
Anuj Phogat	fa1ff71a0f	i965/icl: Set Enabled Texel Offset Precision Fix bit h/w specification requires this bit to be always set. V2: Fix bit mask (Chris Wilson) Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-21 14:40:04 -07:00
Anuj Phogat	5eb173304b	anv/icl: Set Enabled Texel Offset Precision Fix bit h/w specification requires this bit to be always set. Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-21 14:40:04 -07:00
Alex Deucher	afb7c6b301	pci_ids: add new polaris pci id Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Cc: mesa-stable@lists.freedesktop.org	2018-09-21 14:33:13 -05:00
Marek Olšák	f0cd7dbcd7	glsl_to_tgsi: invert gl_SamplePosition.y for the default framebuffer Fixes dEQP-GLES31.functional.shaders.sample_variables.sample_pos.correctness.default_framebuffer with --deqp-gl-config-name=rgba8888d24s8ms4 Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org>	2018-09-21 13:39:00 -04:00
Caio Marcelo de Oliveira Filho	b29ec31854	util: Add macro to get number of elements in dynarray Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2018-09-21 10:12:51 -07:00
Dylan Baker	be56f8a788	docs/meson: Add note about llvm-config$version and llvm-config-$version v2: - fix typo These are how FreeBSD and Debian handle multiple versions of LLVM installed at the same time, respectively. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-21 10:03:15 -07:00
Dylan Baker	e0829f9c1a	docs/meson: Update notes on using CFLAGS and -Dc_args v2: - Use ${} to denote variables instead of just $ - fix spelling error bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107313 Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-21 10:03:15 -07:00
Dylan Baker	1da60667b5	docs: update meson docs to reflect the current status v2: - minor grammar changes - fix typo Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-21 10:03:15 -07:00
Dylan Baker	509ea4649a	meson: Don't force libva to required from auto We already correctly handle va being auto, but we force it to being true, which is bad. Fixes `94cf397092` ("meson: Fix auto option for va") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-21 10:03:15 -07:00
Dylan Baker	5dcb77e491	meson: Don't compile pipe loader with dri support when not using dri Corrects building glx as gallium-xlib without any dri targets. v2: - fix ugly formatting Fixes: `66c94b9313` ("meson: build gallium winsys for dri, null, and wrapper") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-21 10:03:15 -07:00
Samuel Pitoiset	fe3f13cc5a	radv: use the resolve compute path if dest uses multiple layers The hardware path doesn't support resolving layers, for both source and destination images. This fixes a reflection issue when MSAA is enabled which affects GTA V and probably DIRT3. CC: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107786 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Gregor Münch <gr.muench_at_gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-21 16:35:59 +02:00
Jason Ekstrand	ab80889e92	anv,radv: Implement vkAcquireNextImage2 This was added as part of 1.1 but it's very hard to track exactly what extension added it. In any case, we should implement it. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Dave Airlie <Airlied@redhat.com>	2018-09-21 07:02:35 -05:00
Juan A. Suarez Romero	24bacaddef	docs: update calendar, add news and link release notes to 18.2.1 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-09-21 13:09:21 +02:00
Juan A. Suarez Romero	eefc77e691	docs: add sha256 checksums for 18.2.1 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `686eab6642`)	2018-09-21 13:06:14 +02:00
Juan A. Suarez Romero	17fbb1ef74	docs: add release notes for 18.2.1 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `3c8c851fe4`)	2018-09-21 13:06:12 +02:00
Samuel Pitoiset	674fcfaecc	radv: only enable shaderInt16 on GFX9+ and LLVM7+ The throughput is similar to 32-bit integers on GFX8 and AMDVLK does not expose 16-bit integers on pre Vega as well. On GFX9+, only LLVM 7+ has support. This fixes a bunch of CTS crashes on GFX9/LLVM 6. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-21 10:56:17 +02:00
Marek Olšák	945e9cdb2b	docs/features: add EXT_direct_state_access features Acked-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-09-21 03:01:58 -04:00
Bas Nieuwenhuizen	0a77e70d10	radv: Fix driver UUID SHA1 init. Was missing the init, found by Emil. Fixes: `d17443a459` "radv: Use build ID if available for cache UUID." CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-20 23:38:38 +02:00
Charmaine Lee	64731e7c5e	svga: fix uninitialized fields in DefineDepthStencilView/DefineStreamOutput This patch fixes uninitialized fields in DefineDepthStencilView and DefineStreamOutput commands that are not relevant in SM4 device. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-20 13:20:10 -06:00
Brian Paul	7f4e6f4c97	r300g: add PIPE_SHADER_CAP_SCALAR_ISA switch case to silence warning Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-09-20 13:20:10 -06:00
Brian Paul	198c50f487	st/mesa: silenced unhanded enum warning in st_glsl_to_tgsi.cpp Add ir_intrinsic_begin_fragment_shader_ordering switch case to silence warning Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-09-20 13:20:10 -06:00
Brian Paul	35ea66a68e	mesa: use GLsizeiptrARB, GLintptrARB in bufferobj.c The function pointer declarations in dd.h for the BufferData() and BufferSubData() use the ARB-suffixed datatypes. This patch changes the buffer_data_fallback() and buffer_sub_data_fallback() functions to use those datatypes too. This fixes a build warning when building 32-bit libraries. Evidently, GLsizeiptrARB and GLsizeiptr are defined differently in that situation. All all implementations of these driver hooks use the ARB-suffixed types. Reviewed-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-09-20 13:20:10 -06:00
Neha Bhende	708d34d41a	svga: Enable Opengl 3.3 compatibility profile With this patch, svga driver will start advertising OpenGL 3.3 compatibility profile. Tested with some mesa demos, piglit and glretrace. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-20 13:20:10 -06:00
Neha Bhende	ede805dd19	svga: Apply texcoord scale factors only if there is sampler view We need to convert unnormalized texcoords to normalized texcoords when we are sampling from texture. We don't need this conversion if there is no sampler view. Tested with piglit, glretrace Fixes vmware bug 2101970 Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-20 13:20:10 -06:00
Charmaine Lee	1dcf377a76	svga: fix texture array layer index in transfer map In gallium, the layer index of a texture array to be mapped is specified in the z component, whereas in svga device, the index is specified in a separate argument. Currently in svga_texture_transfer_map(), we explicitly modify the z value in the base transfer map to 0 so the layer offset will not be applied twice, but this causes problem when state tracker later refers to the base transfer map and expects the slice index to be specified in z (commit `463b0ea1f6`). To fix the problem, this patch makes a local copy of the box in svga_transfer and modifies the z value in this copy instead. Fixes spec@khr_texture_compression-astc piglit test crashes. Fixes regression in the dma path with commit 1fdd3dd94a. Tested with mtt glretrace, piglit on Windows VM and Linux VM. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-20 13:20:10 -06:00
Dylan Baker	18a6e426f3	Revert "utils/u_math: break dependency on gallium/utils" This reverts commit `0abce6d770`. Which broke the windows build.	2018-09-20 10:36:33 -07:00
Caio Marcelo de Oliveira Filho	2567ad28bb	i965: remove outdated comment about TCS passthrough Since commit `75881bed9e` "i965: Rework the TCS passthrough shader to use NIR." the created nir_shader is not dummy, and it is compiled by the backend like the others. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-20 09:58:55 -07:00
Christoph Haag	b01834b56c	meson: add option to statically link llvm Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-20 06:08:50 -07:00
Dylan Baker	0abce6d770	utils/u_math: break dependency on gallium/utils Currently u_math needs gallium utils for cpu detection. Most of what u_math uses out of u_cpu_detection is duplicated in src/mesa/x86 (surprise!), so I've just reworked it as much as possible to use the x86/common_x86_features.h macros instead of the gallium ones. The mesa implementation is a header only approach, with no external dependencies. There is one small function that was copied over, as promoting u_cpu_detection is itself a fairly hefty undertaking, as it depends on u_debug, and this fixes the bug for now. bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107870 Tested-by: Vinson Lee <vlee@freedesktop.org>	2018-09-20 05:52:23 -07:00
Emil Velikov	b8b3517a49	egl/android: rework device probing Unlike the other platforms, here we aim do guess if the device that we somewhat arbitrarily picked, is supported or not. In particular: when a vendor is _not_ requested we loop through all devices, picking the first one which can create a DRI screen. When a vendor is requested - we use that and do _not_ fall-back to any other device. The former seems a bit fiddly, but considering EGL_EXT_explicit_device and EGL_MESA_query_renderer are MIA, this is the best we can do for the moment. With those (proposed) extensions userspace will be able to create a separate EGL display for each device, query device details and make the conscious decision which one to use. v2: - update droid_open_device_drm_gralloc() - set the dri2_dpy->fd before using it - return a EGLBoolean for droid_{probe,open}_device* - do not warn on droid_load_driver failure (Tomasz) - plug mem leak on dri2_create_screen failure (Tomasz) - fixup function name typo (Tomasz, Rob) v3: - add forward declaration for droid_load_driver() Fixes the HAVE_DRM_GRALLOC build (Mauro) - split dup() assignment and check in separate lines (Tomasz, Eric) - make droid_load_driver() static (Tomasz) - drop unused prop_set variable (Tomasz) v4: - rebase - fwd declarationi should be for droid_probe_device() Cc: Robert Foss <robert.foss@collabora.com> Cc: Tomasz Figa <tfiga@chromium.org> Cc: Mauro Rossi <issor.oruam@gmail.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org> Tested-by: Tomasz Figa <tfiga@chromium.org> Tested-by: Tapani Pälli <tapani.palli@intel.com>	2018-09-20 10:15:38 +01:00
Danylo Piliaiev	18be7403a1	glsl: Add an assert when cloning ir_dereference_record with invalid field Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-09-20 08:30:11 +10:00
Danylo Piliaiev	6f3c7374b1	glsl: Avoid propagating incompatible type of initializer do_assignment validated assigment but when rhs type was not compatible it proceeded without issues and returned error_emitted = false. On the other hand process_initializer expected do_assignment to always return compatible type and never fail. As a result when variable was initialized with incompatible type the type of variable changed to the incompatible one. This manifested in unnecessary error messages and in one case in crash. Example GLSL: vec4 tmp = vec2(0.0); tmp.z -= 1.0; Past error messages: initializer of type vec2 cannot be assigned to variable of type vec4 invalid swizzle / mask `z' type mismatch operands to arithmetic operators must be numeric After this patch: initializer of type vec2 cannot be assigned to variable of type vec4 In the other case when we initialize variable with incompatible struct, accessing variable's field leaded to a crash. Example: uniform struct {float field;} data; ... vec4 tmp = data; tmp.x -= 1.0; After the patch there is only error line without a crash: initializer of type #anon_struct cannot be assigned to variable of type vec4 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107547	2018-09-20 08:30:11 +10:00
Michal Srb	194bf0a2e0	st/dri: don't set queryDmaBufFormats/queryDmaBufModifiers if the driver does not implement it This is equivalent to commit `a65db0ad1c`, but for dri_kms_init_screen. Without this gbm_dri_is_format_supported always returns false. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104926 Fixes: `e14fe41e0b` ("st/dri: implement createImageFromRenderbuffer(2)") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Tested-by: Adam Williamson <adamwill@fedoraproject.org>	2018-09-19 15:20:04 -04:00
Jason Ekstrand	c811af767e	anv/so_memcpy: Don't consider src/dst_offset when computing block size The only thing that matters is the size since we never specify any offsets in terms of blocks. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-19 09:38:04 -05:00
Jakob Bornecrantz	09171705d5	Revert "mesa: only update framebuffer-state for clears" This reverts commit `fb86365148`.	2018-09-19 15:21:26 +01:00
Samuel Pitoiset	121f226471	radv: use a 64-bit unsigned integer when allocating a descriptor pool pool->size is a 64-bit unsigned integer too. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-19 13:36:12 +02:00
Samuel Pitoiset	35656823b9	radv: enable VK_SUBGROUP_FEATURE_ARITHMETIC_BIT All CTS pass on Polaris/Vega with LLVM 6, 7 and master, so I think it's safe to enable the feature. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-19 13:36:10 +02:00
Samuel Pitoiset	febdc13a6c	radv: do not support blitting surfaces with depth and stencil Fixes: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.depth_stencil.d32_sfloat_s8_uint_d32_sfloat_s8_uint.optimal_optimal_nearest And all friends that try to blit a surface with different depth and stencil formats. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-19 13:36:07 +02:00
Erik Faye-Lund	fb86365148	mesa: only update framebuffer-state for clears If we update the program-state etc, we risk compiling needless shaders, which can cost quite a bit of performance. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-19 11:52:53 +02:00
Juan A. Suarez Romero	0c82e3603e	nir: add initializer data to fix MSVC compile error CC: Jason Ekstrand <jason@jlekstrand.net> Fixes: 82799a5d1b8 ("nir: Add a small pass to rematerialize derefs per-block") Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-09-19 11:46:44 +02:00
Jason Ekstrand	976046a8d8	nir: Add some asserts that we don't put derefs in phis The lcssa and phis_to_regs passes are used by various NIR optimizations that modify the CFG. Putting a couple of asserts will help ensure that we don't accidentally put derefs in phis as part of an optimization pass. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-09-19 02:00:49 -05:00
Jason Ekstrand	864c780566	nir/opt_if: Re-materialize derefs in use blocks before peeling loops Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107879 Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-09-19 02:00:49 -05:00
Jason Ekstrand	0796c3934e	nir/loop_unroll: Re-materialize derefs in use blocks before unrolling When we're about to re-arrange a bunch of blocks, it's a good idea to make sure that we don't have deref uses crossing block boundaries. Otherwise we may end up with a deref going through a phi and that would be bad. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-09-19 01:59:40 -05:00
Jason Ekstrand	7d1d1208c2	nir: Add a small pass to rematerialize derefs per-block This pass re-materializes deref instructions on a per-block basis to ensure that every use of a deref occurs in the same block as the instruction which uses it. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-09-19 01:59:40 -05:00
Kenneth Feng	4490fce166	amd: Add Picasso device id No changes here compared to Raven. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Huang Rui <ray.huang@amd.com> Signed-off-by: Kenneth Feng <kenneth.feng@amd.com> Signed-off-by: Huang Rui <ray.huang@amd.com> Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org>	2018-09-18 18:05:17 -04:00
Bas Nieuwenhuizen	95bb7d82ca	Revert "radv: fix descriptor pool allocation size" This reverts commit `90819abb56`. This logic was wrong, the original code is correct. The direct impact is that we allocate up to approximately a squared amount of memory compared to what we should allocate. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-18 22:51:42 +02:00
Samuel Pitoiset	c9dbe52f84	radv: implement VK_EXT_conservative_rasterization Only supported by GFX9+. The conservativeraster Sascha demo seems to work as expected. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-18 13:28:01 +02:00
Samuel Pitoiset	450a325858	radv: do not re-create the sampler for every blits in CmdBlitImage() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-18 13:27:59 +02:00
Samuel Pitoiset	3871dd7a92	radv: allow to force anisotropy via RADV_TEX_ANISO Ported from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-18 13:27:58 +02:00
Timothy Arceri	b54a2311a9	mesa: enable EXT_framebuffer_object in core profile Since user defined names are not allowed in core profile we remove the allow_user_names bool and just check if we have a core profile like all other buffer/texture object handling code does. This extension is required by "Wolfenstein: The Old Blood" and is exposed in core in the Nvidia binary driver. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:58:24 +10:00
Timothy Arceri	02843ed768	mesa: move legacy dri config option texture_depth Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	f958ea6eff	mesa: move legacy dri config option fthrottle_mode Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	4b1a81ef9d	mesa: move legacy dri config option def_max_anisotropy Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	6164d59bcc	mesa: move legacy dri config option no_neg_lod_bias Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	6d1890fa07	mesa: move legacy dri config option round_mode Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	3a1d09fd55	mesa: remove unused dri option float_depth This seems to have only been used by DRI1 drivers which were removed with `e4344161bd`. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	91e76ce493	mesa: move legacy dri config option dither_mode Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	2d7dc9591d	mesa: move legacy dri config option color_reduction Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	408d41a413	mesa: move legacy TCL dri config options Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:43:05 +10:00
Timothy Arceri	024abd3534	util: use force_compat_profile for Wolfenstein The Old Blood This game is looking for some odd extension after creating a core context such as ARB_vertex_program and EXT_framebuffer_object. Rather then enabling these in core this forces the game to use compat. This allows the game to run and seems to work without issues. All other id tech games/engines use a compat profile. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:34:54 +10:00
Timothy Arceri	64ec50d52f	mesa/st: add force_compat_profile option to driconfig Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-18 19:34:54 +10:00
Timothy Arceri	7a992fcfa0	Revert "radeonsi: avoid syncing the driver thread in si_fence_finish" This reverts commit `bc65dcab3b`. This was manually reverted. Reverting stops the menu hanging in some id tech games such as RAGE and Wolfenstein The New Order. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107891	2018-09-18 19:21:32 +10:00
Eric Anholt	4e1af6808c	v3d: Switch from FLUSH_ALL_STATE to FLUSH for ending our bin CLs. The HW for FLUSH_ALL_STATE isn't validated, since the closed driver only uses FLUSH. Now that we don't have any new state at the end of our bin CLs, follow their lead.	2018-09-17 16:35:45 -07:00
Eric Anholt	0b8007b523	v3d: Stop clearing the OQ state at the end of the job. Ever since we added OQ support, we've been clearing OQ state at the start of the job anyway. We're intentionally breaking old-and-new-driver-mix systems, because we need to stop using the unvalidated FLUSH_ALL_STATE.	2018-09-17 16:35:45 -07:00
Eric Anholt	350cb79045	v3d: Always emit a TF disable at the start of drawing on V3D 4.x. The HW's FLUSH_ALL_STATE is not validated, so we probably shouldn't use it, meaning that we need to reset state at the start. By doing this, we also make ourselves more resilient to another client leaving the TF state enabled at the end of their batch (as we now do, ourselves). However, we still need to emit a single TF disable at the end of the frame, for SWVC5-718.	2018-09-17 16:35:45 -07:00
Dylan Baker	7f08bcb73f	build: Don't overlink gallium xlib target Currently gallium's xlib target will fail to link due to multiple definitions of all the symbols in libmesautil, this only shows up in autotools, and not in meson due to differences in the way that meson and autotools handle linking static archives into static archives. Autotools uses -Wl,--whole-archive implicitly, meson requires this behavior to be opted-into. The solution is just to remove libmesautils from the libgl-xlib target, since it will get all of those symbols form libmesagallium. I've dropped the link from meson as well, it doesn't seem to hurt anything and should make linking just a little faster. Fixes: `8396043f30` ("Replace uses of _mesa_bitcount with util_bitcount") bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107923 Tested-by: Brian Paul <brianp@vmware.com> Tested-by: Vinson Lee <vlee@freedesktop.org> Cc: Sergii Romantsov<sergii.romantsov@globallogic.com>	2018-09-17 13:21:01 -07:00
Dylan Baker	3acc18fcf7	move pthread_setaffinity_np check to the build system Rather than trying to encode all of the rules in a header, lets just put them in the build system where they belong. This fixes the build on FreeBSD, which does have pthraed_setaffinity_np, but it's in a pthread_np.h, not behind _GNU_SOURCE. FreeBSD also implements cpu_set slightly differently, so additional changes would be required to get it working right there anyway. v2: - fix #define in autotools Fixes: `9f1bbbdbbd` ("util: try to fix the Android and MacOS build") Cc: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-17 13:16:46 -07:00
Fritz Koenig	60d0c0d062	mesa: FramebufferParameteri parameter checking Missing break; causes parameter checking to never pass GL_FRAMEBUFFER_FLIP_Y_MESA parameters. Fixes: `318c265160` ("mesa: GL_MESA_framebuffer_flip_y extension [v4]") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-17 11:48:00 -07:00
Fritz Koenig	ba6cc32cf9	mesa: Additional FlipY applications Instances where direction was determined based on winsys or user fbo and should be determined based on FlipY. Key STATE_FB_WPOS_Y_TRANSFORM for of FlipY instead of _mesa_is_user_fbo. This corrects gl_FragCoord usage when applying GL_MESA_framebuffer_flip_y. Fixes: `ab05dd183c` ("i965: implement GL_MESA_framebuffer_flip_y [v3]") Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-17 11:48:00 -07:00
Bas Nieuwenhuizen	d17443a459	radv: Use build ID if available for cache UUID. To get an useful UUID for systems that have a non-useful mtime for the binaries. I started using SHA1 to ensure we get reasonable mixing in the various possibilities and the various build id lengths. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-09-17 20:19:52 +02:00
Samuel Pitoiset	08103c5f65	radv: enable shaderInt16 capability Not sure if this is all wired up. CTS does pass and the Tangrams demo works fine on Vega. There are corruption issues on Polaris but not sure if that related to 16-bit support. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:39 +02:00
Samuel Pitoiset	cd76ce0078	ac: add 16-bit support to ac_build_bitfield_reverse() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:37 +02:00
Samuel Pitoiset	fc398f4d67	ac: add 16-bit support to ac_build_bit_count() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:34 +02:00
Samuel Pitoiset	94dd08eb7c	ac: add 16-bit support to ac_find_lsb() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:32 +02:00
Samuel Pitoiset	5a6c8ca3e8	ac: add 16-bit support to ac_build_umsb() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:30 +02:00
Samuel Pitoiset	3e7f3e2cd1	ac: add 16-bit support to ac_build_isign() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:28 +02:00
Samuel Pitoiset	cfd6314cfe	ac: add 16-bit constant values for zero and one Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:26 +02:00
Samuel Pitoiset	074e29183c	ac: add ac_build_bifield_reverse() helper Are we missing 64-bit support? Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:23 +02:00
Samuel Pitoiset	371c35e5bb	ac: add ac_build_bit_count() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 15:18:20 +02:00
Samuel Pitoiset	aec9151464	radv: fix use of unreachable() in the meta blit path Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-09-17 11:29:25 +02:00
Samuel Pitoiset	6521d4a659	Revert "radv: Optimize rebinding the same descriptor set." This introduces random GPU hangs on Vega, at least. This reverts commit `02a43edf18`.	2018-09-17 11:20:57 +02:00
Samuel Pitoiset	90819abb56	radv: fix descriptor pool allocation size The size has to be multiplied by the number of sets. This gets rid of the OUT_OF_POOL_KHR error and fixes a crash with the Tangrams demo. CC: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-17 10:18:01 +02:00
Jason Ekstrand	67094e11e9	anv/query: Add an emit_srm helper Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-17 02:57:21 -05:00
Jason Ekstrand	40149441b8	anv: Add a mi_memset and use it for zeroing queries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-17 02:57:21 -05:00
Jason Ekstrand	b11e9b5ffe	anv/query: Use anv_address everywhere Instead of passing around BOs and offsets, use addresses which are anv's GPU equivalent of pointers. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-17 02:57:21 -05:00
Jason Ekstrand	07e214f1ce	anv/query: Write both dwords in emit_zero_queries Each query slot is a uint64_t and we were only zeroing half of it. Fixes: `7ec6e4e689` "anv/query: implement multiview interactions" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-17 02:57:21 -05:00
Jason Ekstrand	c0420a62c9	anv/query: Increment an index while writing results Instead of computing an index at the end which we hope maps to the number of things written, just count the number of things as we go. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-17 02:57:21 -05:00
Ian Romanick	df9dbc03d3	i965/fs: Don't propagate conditional modifiers from integer compares to adds No shader-db changes on any Intel platform... which probably explains why no bugs have been bisected to this problem since it landed in Mesa 18.1. :( The commit mentioned below is in 18.2, so 18.1 would need a slightly different fix (due to code refactoring). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `77f269bb56` "i965/fs: Refactor propagation of conditional modifiers from compares to adds" Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (reviewed the original patch) Cc: Matt Turner <mattst88@gmail.com> (reviewed the original patch)	2018-09-17 00:38:22 -07:00
Bas Nieuwenhuizen	0dd8189f15	radv: Only allow 16 user SGPRs for compute on GFX9+. Apparently for compute there are only 16 instead of the 32 for the graphics path. Fixes dEQP-VK.binding_model.descriptorset_random.sets16.noarray.ubolimitlow.sbolimitlow.imglimitlow.noiub.comp.0 CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-16 12:50:58 +02:00
Bas Nieuwenhuizen	d97c892584	radv: Set the user SGPR MSB for Vega. Otherwise using 32 user SGPRs would be broken. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-16 12:50:58 +02:00
Bas Nieuwenhuizen	02a43edf18	radv: Optimize rebinding the same descriptor set. This makes it cheaper to just change the dynamic offsets with the same descriptor sets. Suggested-by: Philip Rebohle <philip.rebohle@tu-dortmund.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-16 12:50:19 +02:00
Gert Wollny	14976817f4	r600/sb: use safe math optimizations when TGSI contains precise operations Fixes: dEQP-GLES3.functional.shaders.invariance.highp.common_subexpression_3 dEQP-GLES3.functional.shaders.invariance.mediump.common_subexpression_3 dEQP-GLES3.functional.shaders.invariance.lowp.common_subexpression_3 Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-09-15 20:44:53 +02:00
Mauro Rossi	cc3b99bb48	android: broadcom/cle: export the broadcom top level path headers Fixes the following building error in vc4 build: In file included from external/mesa/src/gallium/drivers/vc4/kernel/vc4_render_cl.c:34: In file included from external/mesa/src/gallium/drivers/vc4/kernel/vc4_drv.h:27: In file included from external/mesa/src/gallium/drivers/vc4/vc4_simulator_validate.h:34: In file included from external/mesa/src/gallium/drivers/vc4/vc4_context.h:39: In file included from external/mesa/src/gallium/drivers/vc4/vc4_cl.h:56: gen/STATIC_LIBRARIES/libmesa_broadcom_genxml_intermediates/broadcom/cle/v3d_packet_v21_pack.h:12:10: fatal error: 'cle/v3d_packet_helpers.h' file not found ^~~~~~~~~~~~~~~~~~~~~~~~~~ 1 error generated. Fixes: `5b102160ae` ("broadcom/genxml: Introduce a V3D packet/struct decoder.") Cc: "18.2" <mesa-stable@lists.freedesktop.org> Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Mauro Rossi <issor.oruam@gmail.com>	2018-09-15 09:14:46 +02:00
Mauro Rossi	9158e0bd82	android: broadcom/cle: add gallium include path Fixes the following building error: In file included from external/mesa/src/broadcom/cle/v3d_decoder.c:38: In file included from external/mesa/src/broadcom/cle/v3d_packet_helpers.h:29: external/mesa/src/gallium/auxiliary/util/u_math.h:42:10: fatal error: 'pipe/p_compiler.h' file not found ^~~~~~~~~~~~~~~~~~~ 1 error generated. Fixes: `5b102160ae` ("broadcom/genxml: Introduce a V3D packet/struct decoder.") Cc: "18.2" <mesa-stable@lists.freedesktop.org> Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Mauro Rossi <issor.oruam@gmail.com>	2018-09-15 09:14:42 +02:00
Mauro Rossi	3341429d74	android: broadcom/genxml: fix collision with intel/genxml header-gen macro Fixes the following building error, happening when building both intel and broadcom: Gen Header: libmesa_broadcom_genxml_32 <= v3d_packet_v21_pack.h FAILED: gen/STATIC_LIBRARIES/libmesa_broadcom_genxml_intermediates/broadcom/cle/v3d_packet_v21_pack.h /bin/bash -c "python external/mesa/src/broadcom/cle/gen_pack_header.py \ external/mesa/src/broadcom/cle/v3d_packet_v21.xml \ > gen/STATIC_LIBRARIES/libmesa_broadcom_genxml_intermediates/broadcom/cle/v3d_packet_v21_pack.h" Traceback (most recent call last): File "external/mesa/src/broadcom/cle/gen_pack_header.py", line 626, in <module> p = Parser(sys.argv[2]) IndexError: list index out of range header-gen macro is already defined by Intel genxml building rules and the existing header-gen does not have the $(PRIVATE_VER) argument, infact the bash command line logged in the building error is missing exactly $(PRIVATE_VER) argument Renaming the macro as pack-header-gen in src/broadcom/Android.genxml.mk solves the building error, another possible way is to keep the gen rules commands expanded and not use the macros. Fixes: `7f80a9ff13` ("vc4: Introduce XML-based packet header generation like Intel's.") Cc: "18.2" <mesa-stable@lists.freedesktop.org> Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Mauro Rossi <issor.oruam@gmail.com>	2018-09-15 09:14:33 +02:00
Caio Marcelo de Oliveira Filho	f9d25f630c	anv/memcpy: fix build after starting to use addresses The offsets now come from the anv_address, these references were not updated and using the old variable. Fixes: `e1ab834557` "anv/memcpy: Use addresses instead of bo+offset" Tested-by: Clayton Craft <clayton.a.craft@intel.com>	2018-09-14 21:45:50 -07:00
Jason Ekstrand	d6a73824bd	anv/cmd_buffer: Take an address in emit_lrm Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-14 22:12:11 -05:00
Jason Ekstrand	e1ab834557	anv/memcpy: Use addresses instead of bo+offset Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-14 22:12:11 -05:00
Jason Ekstrand	90b46f6c17	anv/so_memcpy: Use the correct SO_BUFFER size on gen8+ This shouldn't matter as we'll never write OOB anyway but we may as well get it right. It's supposed to be in dwords - 1. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-09-14 22:12:11 -05:00
Timothy Arceri	e29f0ede75	ac: fix get_image_coords() for radeonsi Because this was setting image to true we would end up calling si_load_image_desc() when we sould be calling si_load_sampler_desc(). This fixes an assert() in Deus Ex: MD Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-15 12:23:32 +10:00
Marek Olšák	914bd3014f	gallium/util: don't let child processes inherit our thread affinity v2: corrected the comment	2018-09-14 21:15:39 -04:00
Marek Olšák	7d41a7593a	gallium/util: start with a random L3 cache index for AMD Zen	2018-09-14 21:05:37 -04:00
Josh Pieper	936e0dcd61	st/mesa: Validate the result of pipe_transfer_map in make_texture (v2) When using Freecad, I was getting intermittent segfaults inside of mesa. I traced it down to this path in st_cb_drawpixels.c where the result of pipe_transfer_map wasn't being checked. In my case, it was returning NULL because nouveau_bo_new returned ENOENT. I'm by no means a mesa developer, but this patch solves the problem for me and seems reasonable enough. v2: Marek - also unmap the PBO and release the texture, and call the make_texture function sooner for less cleanup Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org>	2018-09-14 21:05:37 -04:00
Samuel Pitoiset	c79aad30ae	radv: emit the initial config only once in the preambles It shouldn't be needed to emit the initial graphics or compute state when beginning a new command buffer. Emitting them in the preamble should be enough and this will reduce IB sizes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	9de062ef20	radv: fix setting global locations for indirect descriptors Indirect descriptors only need one entry, we don't have to emit a location for every descriptors. Fixes GPU hangs with new CTS: dEQP-VK.binding_model.descriptorset_random.* CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	748f4cce18	radv: fix flushing indirect descriptors Let say, we first bind a graphics pipeline that needs indirect descriptors sets. The userdata pointers will be emitted at draw time. Then if we bind a compute pipeline that doesn't need any indirect descriptors, the driver will re-emit them for all grpahics stages. To avoid this to happen, just check the bind point type. CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	063264db5b	radv: fix GPU hangs with 32-bit indirect descriptors LLVM 6 isn't affected. Fixes GPU hangs with new CTS: dEQP-VK.binding_model.descriptorset_random.* CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	aa30205929	radv: handle loc->indirect correctly for the first descriptor This was wrong for descriptor #0 when all of them are indirect. This is because indirect_offset was 0 and we emitted a "normal" descriptor pointer for nothing. While we are at it remove radv_userdata_info::indirect_offset which is useless. CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	b9f6521157	radv: bump the maximum number of arguments to 64 Bumping to 64 should be safe enough. Fixes some crashes with new CTS: dEQP-VK.binding_model.descriptorset_random.* CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	c28ea92947	radv: tidy up ac_setup_rings() for the GSVS rings Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	40fb8c7fca	radv: fix setting the number of entries for GSVS on VI+ According to RadeonSI, it's unnecessary to multiply by the stride. That field seems to always be 64. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	a006c24237	radv: always compute the number of components from the output mask That removes two special cases for clip/cull distances. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	9447e91329	radv: emit data contiguously in the GS->VS ring buffer Instead of having holes. The other ring parameters like offset and stride can be updated later. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	fbc064a5b4	radv: make use of the output usage mask in GS copy shader This is just for consistency because LLVM can detect and remove unused loads. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	f398595dca	radv: improve a comment in si_emit_set_predication_state() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	abdf396cbe	radv: fix VK_EXT_conditional_rendering visibility It's actually just the opposite. This fixes the new Sascha conditionalrender demo. CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Samuel Pitoiset	18464d298b	radv: make use of ac_unpack_param() instead of ac_build_bfe() Same code is generated because LLVM ends up by using bfe, but that seems cleaner to me. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-14 10:59:52 +02:00
Timothy Arceri	21e34bab09	nir: add loop unroll support for complex wrapper loops In GLSL IR we cheat with switch statements and simply convert them into loops with a single iteration. This allowed us to make use of the existing jump instruction handling provided by the loop handing code, it also allows dead code to be cleaned up once we have wrapped the code in a loop. However using loops in this way created previously unrollable loops which limits further optimisations. Here we provide a way to unroll loops that end in a break and have multiple other exits. All shader-db changes are from the dolphin uber shaders. There is a small amount of HURT shaders but in general the improvements far exceed the HURT. shader-db results IVB: total instructions in shared programs: 10018187 -> 10016468 (-0.02%) instructions in affected programs: 104080 -> 102361 (-1.65%) helped: 36 HURT: 15 total cycles in shared programs: 220065064 -> 154529655 (-29.78%) cycles in affected programs: 126063017 -> 60527608 (-51.99%) helped: 51 HURT: 0 total loops in shared programs: 2515 -> 2308 (-8.23%) loops in affected programs: 903 -> 696 (-22.92%) helped: 51 HURT: 0 total spills in shared programs: 4370 -> 4124 (-5.63%) spills in affected programs: 1397 -> 1151 (-17.61%) helped: 9 HURT: 12 total fills in shared programs: 4581 -> 4419 (-3.54%) fills in affected programs: 2201 -> 2039 (-7.36%) helped: 9 HURT: 15 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-14 16:07:36 +10:00
Timothy Arceri	2975422ceb	nir: propagates if condition evaluation down some alu chains v2: - only allow nir_op_inot or nir_op_b2i when alu input is 1. - use some helpers as suggested by Jason. v3: - evaluate alu op for single input alu ops - add helper function to decide if to propagate through alu - make use of nir_before_src in another spot shader-db IVB results: total instructions in shared programs: 9993483 -> 9993472 (-0.00%) instructions in affected programs: 1300 -> 1289 (-0.85%) helped: 11 HURT: 0 total cycles in shared programs: 219476091 -> 219476059 (-0.00%) cycles in affected programs: 7675 -> 7643 (-0.42%) helped: 10 HURT: 1 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-14 16:07:36 +10:00
Timothy Arceri	ef4ad7baf1	nir: evaluate if condition uses inside the if branches Since we know what side of the branch we ended up on we can just replace the use with a constant. All the spill changes in shader-db are from Dolphin uber shaders, despite some small regressions the change is clearly positive. V2: insert new constant after any phis in the use->parent_instr->type == nir_instr_type_phi path. v3: - use nir_after_block_before_jump() for inserting const - check dominance of phi uses correctly v4: - create some helpers as suggested by Jason. v5 (Jason Ekstrand): - Use LIST_ENTRY to get the phi src shader-db results IVB: total instructions in shared programs: 9999201 -> 9993483 (-0.06%) instructions in affected programs: 163235 -> 157517 (-3.50%) helped: 132 HURT: 2 total cycles in shared programs: 231670754 -> 219476091 (-5.26%) cycles in affected programs: 143424120 -> 131229457 (-8.50%) helped: 115 HURT: 24 total spills in shared programs: 4383 -> 4370 (-0.30%) spills in affected programs: 1656 -> 1643 (-0.79%) helped: 9 HURT: 18 total fills in shared programs: 4610 -> 4581 (-0.63%) fills in affected programs: 374 -> 345 (-7.75%) helped: 6 HURT: 0 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-14 16:07:36 +10:00
Erik Faye-Lund	fa5e9f1f73	virgl: adjust strides when mapping temp-resources When we're mapping temp-resources, we clip the resource to the transfer-box, which means the stride might not be correct any more. So let's update the stride from the temp-resource, and recompute the layer-stride. This fixes crashes when running dEQP with --deqp-gl-config-name=rgba8888d24s8ms4 Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Fixes: `a8987b88ff` "virgl: add driver for virtio-gpu 3D (v2)" Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-14 10:59:02 +10:00
Pierre Moreau	21b92b3464	nvir: Always split 64-bit IMAD/IMUL operations Those operations do not map to actual hardware instructions, therefore those should always be lowered to 32-bit instructions. Fixes: `009c54aa7a` "nv50/ir: Split 64-bit integer MAD/MUL operations" Signed-off-by: Pierre Moreau <pierre.morrow@free.fr> Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-09-13 20:49:38 +02:00
Leo Liu	cb63e5d1eb	st/vdpau: Use output buffer as back buffer with 24-bit color only Using output buffer with 8 bits video RGB as back buffer certainly is not working for 30 bits color depth visual. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-09-13 14:28:32 -04:00
Leo Liu	4d8ec12f03	vl/dri: add color depth to vl winsys For VDPAU use later Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-09-13 14:28:32 -04:00
Leo Liu	cd77d49ecf	vl/dri3: add support for 10 bits format Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-09-13 14:28:32 -04:00
Leo Liu	902358de4b	vl/dri: add 10 bits format supports v2: Tell B10G10R10X2 and R10G10B10X2 formats for different HW. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-09-13 14:28:32 -04:00
Kristian H. Kristensen	aaafae4f55	egl/android: Declare droid_load_driver() static Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org>	2018-09-13 11:12:35 -07:00
Samuel Pitoiset	d4bf954fe6	radv: fix function names for VK_EXT_conditional_rendering Otherwise they are not exported. CC: 18.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Dave Airlie <airlied@redhat.com Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-13 16:03:18 +02:00
Jason Ekstrand	1a263b377c	anv: Silence a couple compiler warnings [63/93] Compiling C object 'src/intel/vulkan/...intel@vulkan@@anv_common@sta/anv_device.c.o'. ../src/intel/vulkan/anv_device.c:685:30: warning: passing 'const char ' to parameter of type 'void ' discards qualifiers [-Wincompatible-pointer-types-discards-qualifiers] vk_free(&instance->alloc, instance->app_info.app_name); ^~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/vulkan/util/vk_alloc.h:62:51: note: passing argument to parameter 'data' here vk_free(const VkAllocationCallbacks alloc, void data) ^ ../src/intel/vulkan/anv_device.c:686:30: warning: passing 'const char ' to parameter of type 'void ' discards qualifiers [-Wincompatible-pointer-types-discards-qualifiers] vk_free(&instance->alloc, instance->app_info.engine_name); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/vulkan/util/vk_alloc.h:62:51: note: passing argument to parameter 'data' here vk_free(const VkAllocationCallbacks alloc, void data) ^ [65/93] Compiling C object 'src/intel/vulkan/...ommon@sta/anv_nir_apply_pipeline_layout.c.o'. ../src/intel/vulkan/anv_nir_apply_pipeline_layout.c:519:13: warning: unused variable 'image_uniform' [-Wunused-variable] unsigned image_uniform; Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-09-12 21:20:27 -05:00
Michel Dänzer	e34dd4f508	loader/dri3: Don't wait for fence of old buffer when re-allocating it We only need to wait for the fence before drawing to a buffer, not before reading from it. This might avoid hangs when re-allocating the fake front buffer, similar to the previous change. But I haven't seen any evidence that this was actually happening in practice. Tested-by: Olivier Fourdan <ofourdan@redhat.com>	2018-09-12 16:55:09 +02:00
Michel Dänzer	aefac10fec	loader/dri3: Only wait for back buffer fences in dri3_get_buffer We don't need to wait before drawing to the fake front buffer, as front buffer rendering by definition is allowed to produce artifacts. Fixes hangs in some cases when re-using the fake front buffer, due to it still being busy (i.e. in use for presentation). Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/106404 Bugzilla: https://bugs.freedesktop.org/107757 Tested-by: Olivier Fourdan <ofourdan@redhat.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2018-09-12 16:53:58 +02:00
Vadym Shovkoplias	9b5c0c520f	glsl/linker: Check the invariance of built-in special variables From Section 4.6.4 (Invariance and Linkage) of the GLSL ES 1.0 specification "The invariance of varyings that are declared in both the vertex and fragment shaders must match. For the built-in special variables, gl_FragCoord can only be declared invariant if and only if gl_Position is declared invariant. Similarly gl_PointCoord can only be declared invariant if and only if gl_PointSize is declared invariant. It is an error to declare gl_FrontFacing as invariant. The invariance of gl_FrontFacing is the same as the invariance of gl_Position." Fixes: * glsl-pcoord-invariant.shader_test * glsl-fcoord-invariant.shader_test * glsl-fface-invariant.shader_test Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107734 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-09-12 11:43:21 +03:00
Tapani Pälli	30580640f2	intel/tools: fix initial position of window in aubinator viewer Currently position is set before widgets are sized by gtk and calculation can get wrong results where window is positioned offscreen. Patch fixes this by setting aubfile window position as 0,0 only when size_allocate has been called to the widget. Now window is always positioned to 0,0 if imgui.ini is missing. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-12 11:43:21 +03:00
Erik Faye-Lund	eaa718588e	winsys/virgl: avoid unintended behavior If we end up never taking the loop that writes ret, we can end up with an uninitialized value, and if we're really unlucky, that value can be -1, causing us to go down an error-path instead of a success path. This was obviously not intended, so let's just initialize this to zero. Noticed by Valgrind: Conditional jump or move depends on uninitialised value(s) at 0xBA640A0: virgl_drm_winsys_resource_cache_create (virgl_drm_winsys.c:348) by 0xBA62FCF: virgl_buffer_create (virgl_buffer.c:170) by 0xBA605AC: virgl_resource_create (virgl_resource.c:60) by 0xBCF816F: bufferobj_data (st_cb_bufferobjects.c:344) by 0xBCF816F: st_bufferobj_data (st_cb_bufferobjects.c:390) by 0xBB7E836: vbo_use_buffer_objects (vbo_exec_api.c:1136) by 0xBCFCC6E: st_create_context_priv (st_context.c:414) by 0xBCFD3CD: st_create_context (st_context.c:590) by 0xBBB30CA: st_api_create_context (st_manager.c:896) by 0xB981E76: dri_create_context (dri_context.c:155) by 0xB97BDCE: driCreateContextAttribs (dri_util.c:473) by 0x5288331: dri3_create_context_attribs (dri3_glx.c:309) by 0x5264D64: glXCreateContextAttribsARB (create_context.c:78) Fixes: `a8987b88ff` ("virgl: add driver for virtio-gpu 3D (v2)") Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-09-12 10:14:43 +02:00
Juan A. Suarez Romero	d631916f29	travis: use python3.5 for meson Newer Meson versions require python >=3.5. But in Trusty default python3 version is 3.4.x. Install python3.5 and makes it the default version for Meson using update-alternatives method. CC: Jan Vesely <jano.vesely@gmail.com> CC: Andres Gomez <agomez@igalia.com> CC: Emil Velikov <emil.l.velikov@gmail.com> CC: Jon Turney <jon.turney@dronecode.org.uk> CC: Eric Engestrom <eric.engestrom@intel.com> CC: Dylan Baker <dylan@pnwbakers.com> Fixes: `3824c8e7cd` "meson: disable asserts by default on release builds" Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Andres Gomez <agomez@igalia.com>	2018-09-11 14:27:58 +01:00
Samuel Pitoiset	3d08631fe5	radv: adjust ESGS ring buffer size computation on VI+ Noticed while working in this area. Ported from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-11 11:30:19 +02:00
Gert Wollny	47e01e77d8	mesa/texture: Also check for LA texture when querying intensity component size Gallium may pick L16A16_FLOAT to represent GL_INTENSITY16F if no intensity format is provided by the driver. However, when calling glGetTexLevelParameteriv(..., GL_TEXTURE_INTENSITY_SIZE, ...) mesa will return a zero size because the actually used format has no intensity channel and as a fallback only the sizes of the red/green channels are checked. Also checking for LA sizes in the allocated texture resolves this problem. v2: Only check alpha channel size and return it (Marek) L and A size are always the same in this case. Fixes (on virgl): ext_framebuffer_multisample-fast-clear GL_ARB_texture_float * Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107832 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-11 09:07:05 +02:00
Ilia Mirkin	133e12fb69	nv50,nvc0: warn on not-explicitly-handled caps Not handling caps explicitly means that we're likely getting incorrect values -- these need to be reviewed and set appropriately. While we're at it, add in some missing caps, and set all the subpixel stuff to 8 as that seems to be what the blob reports. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-09-11 01:25:19 -04:00
Timothy Arceri	e66c2158f8	mesa: remove duplicate dispatch sanity tests This removes duplicate tests from gl_core_functions_possible that are already covered by common_desktop_functions_possible. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-09-11 10:13:31 +10:00
Timothy Arceri	355a5ef761	mesa: tidy up init_matrix_stack() Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2018-09-11 09:26:04 +10:00
Christopher Egert	51995f6920	radeon: fix ColorMask Since commit `af3685d149` various OpenGL applications regressed on the classic mesa radeon driver. Signed-off-by: Christopher Egert <cme3000@gmail.com> CC: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-09-10 16:57:20 -04:00
Elie Tournier	9179c745f6	gallium: Correctly handle no config context creation This patch fixes the following Piglit test: spec@egl_mesa_configless_context@basic It also fixes few test in a virgl guest. v2: Evaluate the value of no_config (Ilia) Suggested-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Elie Tournier <elie.tournier@collabora.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-09-10 15:30:17 -04:00
Bas Nieuwenhuizen	f6e09db2e6	radv: Support v3 of VK_EXT_vertex_attribute_divisor. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> CC: 18.2 <mesa-stable@lists.freedesktop.org>	2018-09-10 21:26:17 +02:00
Marek Olšák	867f7aaed2	radeonsi/nir: port some bindless and sampler code from TGSI These might be all missing changes for bindless textures. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:23:21 -04:00
Marek Olšák	b00deed66f	radeonsi: adjust and simplify max_alloc_size determination Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	203ef19f48	radeonsi: split si_copy_buffer compute and SDMA will be added into it. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	986d6f12fb	radeonsi: don't call VBO prefetch with size=0 for the next commit. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	1119fe5c25	radeonsi: merge SI and CI dma_clear_buffer and remove the callback also use assertions for the requirements that offset and size are a multiple of 4. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	be0bd95abf	radeonsi: fix GPU hangs with bindless textures and LLVM 7.0 Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	fa595e3d0c	ac: remove deprecated use of LLVMInt1Type() Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	cc36ebbdc3	ac: use iN_0/1 constants Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	bc09c3d59e	ac: add radeon_info::num_good_cu_per_sh Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	a5f35aa742	ac: revert new LLVM 7.0 behavior for fdiv Cc: 18.2 <mesa-stable@lists.freedesktop.org> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	662db03577	radeonsi: fix printing a BO list into ddebug reports important for debugging Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	da72b6296c	r600: fix HTILE for NPOT textures with mipmapping Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	d4e52281aa	winsys/radeon: fix CMASK fast clear for NPOT textures with mipmapping on SI/CI Cc: 18.2 <mesa-stable@lists.freedesktop.org> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Marek Olšák	a1b9a00f82	radeonsi: fix HTILE for NPOT textures with mipmapping on SI/CI VI uses addrlib so it's unaffected. Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 15:19:56 -04:00
Brian Paul	5162735957	docs: document new features/extensions in driver for WS 15 / Fusion 11 Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	7baf45dfc7	svga: assorted fixes/changes in svga_pipe_blit.c To align the code with VMware's in-house copy. Signed-off-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	25fceccf72	svga: set buffer bind_flags in svga_buffer_add_host_surface() To match the in-house VMware code. Signed-off-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	337a74aa40	svga: add format conversion for legacy formats This patch extends the format_conversion table to support different view formats on texture buffer. For legacy image formats such as INTENSITY, LUMINANCE, LUMINANCE_ALPHA, special swizzle masks will be used on the red or RG channels. This fixes piglit test arb_texture_buffer_object-formats fs\|vs arb Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	389450a271	svga: remove obsolete code to reemit gs binding The svga_reemit_gs_bindings function is no longer needed. Remove it. Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	c174ee9f9d	svga: move variant->fs_shadow_compare_units assignment Fixes a crash since the variant object isn't allocated until later in the function. Not sure how this got through. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	cb70474b20	svga: fix resource checking in is_blending_enabled() This patch makes sure a valid color buffer is bound before checking its resource. This fixes Unigine Valley running in SM41 device. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Neha Bhende	c6103328ab	svga: Use texture_copy_region instead of texture_copy_handle for multisampling This fixes some of tests cases in arb_copy_image-formats and also fixes SurfaceCopy related errors in vmware.log when multi sampled surfaces are used. Tested with piglit, glretrace on windows and linux VM. v2: As per Brian's comment Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	fdf5885183	svga: add missing devcap check for texture array support The patch checks DXFMT_ARRAY devcap for texture array support. Tested with MTT-piglit. No regressions. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	3069581260	svga: no need to check MULTISAMPLE devcap for view format According to the current SVGA contract, any view format can be used on the underlying resource that is multisample. So there is no need to check the MULTISAMPLE devcap for the view format. Fixes black rendering issue with Tropics running with 4xMSAA. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	6f254ad9b4	svga: sync devcap name changes in svga3d_devcaps.h Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	49428c8d61	svga: explicit set DXFMT_SHADER_SAMPLE for DS format for pre-SM41 device Explicit set the DXFMT_SHADER_SAMPLE bit for depth stencil formats for pre-SM41 device only. This bit is now set by the SM41 device. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	379a2f265f	svga: remove unused variable Trivial.	2018-09-10 13:07:30 -06:00
Brian Paul	cbcc416a58	svga: draw round points when msaa is enabled See comments for details. This allows the piglit ext_framebuffer_multisample-point-smooth test to pass. Also, test the pipe_rasterizer_state::point_quad_rasterization field to see if sprite point rasterization is needed because it's possible for no sprite_coord_enable bits to be set when drawing sprites. Finally, remove old, stale comments. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	6b039c7d7c	svga: check number of samples before emitting MSAA decls/opcodes If real MSAA is not available, we only support 1 sample/pixel. In that case, we must not declare MSAA resources or emit MSAA opcodes. Do that by checking the sample count. Fixes several piglit MSAA tests, such as arb_texture_multisample-sample-depth (when the hard-coded sample count of 4 is fixed in that test). Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	cf2fb6813c	svga: remove obsolete comment on format_cap_table[] We removed the special cases referred to in this comment in the commit "svga: add a separate function to get dx format capabilities from vgpu10 device". Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	0fc6c17bf2	svga: allow TGSI_TEXTURE_CUBE_ARRAY in emit_tg4() Technically, SM4.1 doesn't support cube map arrays, but our backend renderers actually do. This allows the Piglit textureGather cube map array tests to pass. Tested with GLrenderer, DX11renderer and SWrenderer. Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	3467a274e0	svga: no dma on multisample surface Force direct map on multisample surface. Fixes SVGA Driver Errors running multisample piglit tests on Linux VM v2: use texture for the check. Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	5f14444184	svga: src surface for IntraSurfaceCopy cannot be multisample Fixes SVGA Driver Errors with piglit test arb_copy_image-targets Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	026e1ad7bb	svga: fix missing format multisample devcap check In commit e4048f6cd1, svga_is_dx_format_supported() is supposed to also check the SVGA3D_DXFMT_MULTISAMPLE bit for multisample support of a format. Somehow that code is not included in that commit. This patch fixes it. Fixes piglit test spec@ext_framebuffer_multisample@formats all_samples. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	285d8b47b1	svga: fix incorrect multisample support in VGPU9 device Commit e4048f6cd1 unintentionally allows multisample support for VGPU9 device. This patch fixes this regression. Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	59a56ca1c8	svga: fix the missing devcap for SVGA3D_BC3_UNORM_SRGB Set the devcap to SVGA3D_DEVCAP_DXFMT_BC3_UNORM_SRGB Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	16666eb470	svga: add a separate function to get dx format capabilities from vgpu10 device Currently we have one function to get format capabailities and we convert DX10 devcaps back to DX9. This can be confusing. Going forward we will have a separate function for dealing with dx formats. This patch also fixes the depth stencil devcap. Instead of hardcoding the capabilities for the depth stencil formats, we will inquire the device for the capabilities. Note: we will still need to explicity set the SVGA3D_DXFMT_SHADER_SAMPLE bit for SVGA3D_R32_FLOAT_X8X24 and SVGA3D_R24_UNORM_X8 since this bit is not advertised but supported by the device. v2: reapply the patch after svga_is_format_supported is moved to svga_format.c Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	b1aee7ff05	svga: assign a separate function for is_format_supported() for vgpu10 device This patch adds a new function svga_is_dx_format_supported() to check for format support in a VGPU10 device. v2: reapply the patch after svga_is_format_supported is moved to svga_format.c Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	1ea9c80d6d	svga: add some devcap debugging code Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	96ef81e39e	svga: fix depth and coverage mask output declaration Set the component mask to zero for both registers. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	7187a2f7ff	svga: add sample positions for 2 samples Fixes piglit tests spec@arb_sample_shading@builtin-gl-sample-position 2 spec@arb_texture_multisample@fb-completeness@2 Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	73c850fb9a	svga: check sample count devcaps Check sample count devcaps from the svga device to determine the supported sample counts. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	afacde3553	svga: fix 1-element cube map array issue As with 1D and 2D array textures, if there's only one array element (one cubemap in this case) we have to issue different shader code. This fixes a number of Piglit cubemap array tests. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	767c1eb436	svga: simplify array test in svga_init_shader_key_common() And squash commit a patch to silence a compiler warning (add default case to the switch statement). Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	45517f492b	winsys/drm: check for CAPS2/SM41 support if VGPU10 is enabled No need to check for HW_CAPS2 or SM4_1 support if VGPU10 is not enabled or is explicitly disabled via the environment variable SVGA_VGPU10. Reviewed-by: Deepak Rawat <drawat@vmware.com>	2018-09-10 13:07:30 -06:00
Deepak Rawat	159e706c4c	winsys/drm: Add support for quality level in surface ioctl A new argument "quality level" is added in surface define v3 which represets precision settings for surface. This commit add support for quality level in DRM_VMW_GB_SURFACE_CREATE_EXT and DRM_VMW_GB_SURFACE_REF_EXT. Signed-off-by: Deepak Rawat <drawat@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	b343c6915c	svga: sync svga3d_types.h with upstream changes Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	b5827db2ea	winsys/drm: enable intra_surface_copy if HW_CAP2 is supported With drm version 2_15, we can inquire for support of HW_CAP2. If it is supported, we can enable intra_surface_copy support. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Deepak Rawat <drawat@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	7448bb0089	svga: add git version logging at init time Before we can log the git version in the host log, we'll add the git version in the init debug message. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	4669ffd29b	svga: fix a typo in svga_texture_copy_region() Trivial.	2018-09-10 13:07:30 -06:00
Charmaine Lee	3233d05390	svga: use helper function to do copy region Use the common helper function svga_texture_copy_region for copy region command. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	74791b80b9	svga: fix cubemap array rendering with backed surface view This patch fixes the layer index when rendering to a backed surface view of a cubemap array. Fixes piglit test fbo-generatemipmap-cubemap array. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	2d39e6d0c8	svga: add a helper function to send ResolveCopy command Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	9a24b08a49	svga: sync svga3d header files This is a squash of what was orginally three commits. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	f3eda3e5e1	svga: add SM4_1 enable debug print Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	ccd895db76	svga: fix swizzling for texture gather Texture swizzling for texture gather needs to be done to the selected texels rather than to the returned vector. This patch has specical cases for the different swizzles in emit_tg4(). Fixes a lot of piglit texture gather tests. Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	be1993d6ed	svga: fix starting index for system values Currently, the starting index for system values is assigned to the next index after the highest index of the tgsi declared input registers. But the tgsi index might be different from the actual assigned index, hence this might cause overlap of indices. With this patch, the shader linker keeps track of the highest index of the translated input registers, and the next index will be used for the starting index for system values. Fixes SHIM errors running arb_copy_image-formats on SM4_1 device. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Deepak Rawat	569f838987	winsys/svga: Add support for new surface ioctl, multisample pattern Kernel driver version 2.15 added new surface ioctl named: DRM_VMW_GB_SURFACE_CREATE_EXT DRM_VMW_GB_SURFACE_REF_EXT The new ioctl has support for 64-bit svga3d_flags if DRM_VMW_PARAM_SM4_1 is available. Multisampling surface mob size calculation is added. Also synced the relevant header update. svga device modified the surface define command V3 with new parameter multisampling pattern. Adding support for that in winsys. Signed-off-by: Deepak Rawat <drawat@vmware.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	3f55425ee6	svga: enable MSAA for SM4_1 device The SVGA device is deprecating the DX9 MSAA support. This patch enables MSAA for SM4_1 device by explicitly setting the SVGA3D_SURFACE_MULTISAMPLE bit. For SM4_1 device, only 4 samples is supported. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	8088cb6f53	svga: add sample count to the surface_can_create interface With this patch, sample count is also taken into account when determining if a resource can be created. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	4a1976bfcf	svga: implement support for GL_ARB_texture_query_lod Just translate the TGSI LODQ intruction to VGPU10 LOD instruction. All (4) Piglit GL_ARB_texture_query_lod tests pass. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-10 13:07:30 -06:00
Neha Bhende	252e97ecdf	svga: Add support for arb_texture_gather With sm4_1, we can support single channel 2D or CubeMap textures. This patch exercises this feature. Tested with piglit v2: As per Brian's comment Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	36c84bcd77	svga: add support for interpolation at sample position Vs. sampling at the centroid or the fragment center. Note that this does not fix failures with the Piglit arb_sample_shading-interpolate-at-sample-position or arb_sample_shading-ignore-centroid-qualifier.exe tests at this time. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	bcf7aaa9f7	svga: clarify sys value -> input register mapping We translate TGSI system value registers to VGPU10 input registers. Add a comment and set file = TGSI_FILE_INPUT. That's not stricly necessary since we map both TGSI_FILE_INPUT and TGSI_FILE_SYSTEM_VALUE to VGPU10_OPERAND_TYPE_INPUT, but this makes the code a bit more understandable. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	9de5bdb341	svga: add support for FS sample mask output This, with the previous work for sample position/id query, allows us to enable per-sample shading for VGPU 10.1. Note that quite a few Piglit arb_sample_shading tests still do not pass, but many do. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	0a219dd918	svga: add support for sample id, sample position Sample ID is just a system value. Sample position must be implemented with the VGPU10_OPCODE_SAMPLE_POS instruction. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	ac4a0c0e82	svga: implement no-op svga_set_min_samples() This is part of the per-sample shading feature (PIPE_CAP_SAMPLE_SHADING). Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	3c3fc7154e	svga: add support for independent blend function per render target This patch adds support for GL_ARB_draw_buffers_blend extension for SM4_1 device. Fixes piglit test fbo-draw-buffers-blend. This patch is squashed with a subsequent patch which fixed a regression. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	5512f943b8	svga: emit shader version as 4.0 or 4.1 depending on device support Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	1d806b6f13	svga: restructure nested if's in emit_src_register() To make it cleaner for subsequent changes. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	16439085f5	svga: sync VGPU10ShaderTokens.h with upstream changes This includes new DX 10.1 opcodes and tokens. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	22e8099711	svga: add support for shadow cubemap array Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	f929247d24	svga: add support for rendering to cubemap array Fixes piglit test arb_texture_cube_map_array-fbo-cubemap-array Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	1df17fc697	svga: add support for TXL2 opcode This patch adds support for cubemap array texture lookup with explicit LOD. Fixes piglit test arb_texture_cube_map_array-cubemap-lod Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Charmaine Lee	62402be407	svga: add support for cubemap array This patch adds support for cubemap array for SM4_1. Fixes piglit test arb_texture_cube_map_array-cubemap Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Brian Paul	018ff0112f	svga: add have_sm4_1 flag, helper function Signed-off-by: Brian Paul <brianp@vmware.com>	2018-09-10 13:07:30 -06:00
Marek Olšák	d211679017	gallium/u_inlines: remove the destroy variable in pipe_reference_described Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 14:53:01 -04:00
Marek Olšák	ed880fe192	gallium/u_inlines: improve pipe_reference_described perf for debug builds Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-09-10 14:53:01 -04:00
Marek Olšák	c042a34b14	gallium/auxiliary: don't dereference counters twice needlessly Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 14:52:32 -04:00
Marek Olšák	61767c059e	gallium/u_inlines: normalize naming, use dst & src, style fixes (v2) v2: update comments Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-09-10 14:52:32 -04:00
Marek Olšák	9f1bbbdbbd	util: try to fix the Android and MacOS build Bionic does not have pthread_setaffinity_np. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107869 Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-09-10 14:49:07 -04:00
Jason Ekstrand	6f00785765	anv: Support v3 of VK_EXT_vertex_attribute_divisor Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-10 13:45:32 -05:00
Jason Ekstrand	34a17a48d4	vulkan: Update the XML and headers to 1.1.84 Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-10 13:30:21 -05:00
Sergii Romantsov	bbe551f3ea	mesa/meson: 32bit xmlconfig linkage Building of 32bit mesa with meson causes linkage issue: "undefined reference to `util_get_process_name'" Fixed by adding link-with mesa_util for xmlconfig primary. v2: Removed '[]', commit message corrected. v3: Reverted changes in gbm and glx libraries. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107843 Fixes: `2e1e6511f7` "util: extract get_process_name from xmlconfig.c" Cc: Marek Olšák <marek.olsak@amd.com> Cc: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-10 08:57:42 -07:00
Jose Fonseca	52ca32121b	Require Visual Studio 2015. We no longer need or use Visual Studio 2013. https://ci.appveyor.com/project/jrfonseca/mesa/build/52 Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-09-10 10:10:16 +01:00
Jose Fonseca	d5f934522d	util: Make util_context_thread_changed a no-op on Windows. Spite using thrd_t types, these functions are wed to pthreads, and break Windows builds, because thrd_current() is not implemented there, as it's impossible to have an efficient thrd_current() implementation on Windows. Trivial.	2018-09-10 10:10:16 +01:00
Erik Faye-Lund	c4017106bb	virgl: do not map zero-sized resource When creating textures, we avoid creating backing-store for all multisampled textures, not just depth buffers. So we can't try to map them later. That's just going to fail. So let's take the blit-based code-path that seems to avoid this problem. This make this piglit test-case no longer crash (although it still fails): bin/copyteximage 2D -samples=2 -auto Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-10 10:35:42 +02:00
Erik Faye-Lund	8083464013	virgl: remove dead code We don't use the size we calculate in this function, so let's just drop the calculation Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-10 10:35:32 +02:00
Erik Faye-Lund	b9c40e492d	virgl: drop needless return-code We always return TRUE, and we never check the return-value. Let's just drop the return value instead. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-10 10:35:20 +02:00
Erik Faye-Lund	9635869d73	virgl: free trans on map-error When we fail to map memory, we should also free trans to avoid leaking memory. Noticed while reading code. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-09-10 10:35:02 +02:00
Chris Wilson	44e3e6a9b4	i965: Bump aperture tracking to u64 As a prelude to handling large address spaces, first allow ourselves the luxury of handling the full 4G. Reported-by: Andrey Simiklit <asimiklit.work@gmail.com> Cc: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-10 09:14:46 +01:00
Mathias Fröhlich	2fece204c0	etnaviv: Reduce max offset to available hardware bits. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-09-10 07:59:31 +02:00
Mathias Fröhlich	4569bc6ad0	gallium: New cap PIPE_CAP_MAX_VERTEX_ELEMENT_SRC_OFFSET. Introduce a new capability for the maximum value of pipe_vertex_element::src_offset. Initially just every driver backend returns the value previously set from _mesa_init_constants. So this shall end up in no functional change. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-09-10 07:59:31 +02:00
Dave Airlie	240af61494	virgl: don't send a shader create with no data. (v2) This fixes the situation where we'd send a shader with just the header and no data. piglit/glsl-max-varyings test was causing this to happen, and the renderer fix was breaking it. v2: drop fprintf Fixes: `a8987b88ff` "virgl: add driver for virtio-gpu 3D (v2)" Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-09-10 12:23:30 +10:00
Timothy Arceri	14fe9fa11b	mesa: enable ARB_vertex_buffer_object in core profile This extension is required by "Wolfenstein: The Old Blood" and is exposed in core in the Nvidia binary driver. All the functions are just alias of the core functions so there should be nothing more to do. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-09-08 14:35:09 +10:00
Marek Olšák	21ca322e63	st/mesa: throttle texture uploads if their memory usage goes beyond a limit This prevents radeonsi from running out of memory. It also increases texture upload performance by being nice to the kernel memory manager.	2018-09-07 17:59:02 -04:00
Marek Olšák	9ce2cef68f	gallium: add PIPE_CAP_MAX_TEXTURE_UPLOAD_MEMORY_BUDGET	2018-09-07 17:59:02 -04:00
Andres Gomez	ecfe41e690	docs: update calendar, add news item and link release notes for 18.2.0 Signed-off-by: Andres Gomez <agomez@igalia.com>	2018-09-08 00:40:43 +03:00
Andres Gomez	5382a90cb2	docs: add sha256 checksums for 18.2.0 Signed-off-by: Andres Gomez <agomez@igalia.com> (cherry picked from commit `cb1ddf48e2`)	2018-09-08 00:28:23 +03:00
Andres Gomez	65f3327db6	docs: update 18.2.0 release notes Signed-off-by: Andres Gomez <agomez@igalia.com> (cherry picked from commit `7378180e7a`)	2018-09-08 00:28:21 +03:00
Marek Olšák	7ac52c2e38	Revert "gallium/os_thread: simplify helper pipe_current_thread_get_time_nano" This reverts commit `6d477bc546`. It fixes the Windows build hopefully.	2018-09-07 16:52:36 -04:00
Jason Ekstrand	465e5a868c	anv: Clamp scissors to the framebuffer boundary The Vulkan 1.1.81 spec says: "It is legal for offset.x + extent.width or offset.y + extent.height to exceed the dimensions of the framebuffer - the scissor test still applies as defined above. Rasterization does not produce fragments outside of the framebuffer, so such fragments never have the scissor test performed on them." Elsewhere, the Vulkan 1.1.81 spec says: "The application must ensure (using scissor if necessary) that all rendering is contained within the render area, otherwise the pixels outside of the render area become undefined and shader side effects may occur for fragments outside the render area. The render area must be contained within the framebuffer dimensions." Unfortunately, there's some room for interpretation here as to what the consequences are of having the render area set to exactly the framebuffer dimensions and having a scissor that is larger than the framebuffer. Given that GL and other APIs provide automatic clipping to the framebuffer, it makes sense that applications would assume that Vulkan does this as well. It costs us very little to play it safe and just clamp client-provided scissors to the framebuffer dimensions. Fortunately, the user is required to provide us with at least one scissor so we don't need to handle the case where they don't. Fixes: `fb2a5ceb32` "anv: Emit DRAWING_RECTANGLE once at driver..." Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-07 15:19:02 -05:00
Jason Ekstrand	b08b4b2b25	anv: Disable the vertex cache when tessellating on SKL GT4 I have no idea if I'm correct about what's going wrong or if this is the correct fix. However, in my multiple weeks of banging my head on this hang, a VUE reference counting bug seems to match all the symptoms and it definitely fixes the hang. Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107280 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-07 15:19:02 -05:00
Jason Ekstrand	5dee89438a	anv: Implement a VF cache invalidate workaround Known to fix nothing whatsoever but it's in the docs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-07 15:19:02 -05:00
Jason Ekstrand	c643c5e18d	anv: Re-emit vertex buffers when the pipeline changes Some of the bits of VERTEX_BUFFER_STATE such as access type, instance data step rate, and pitch come from the pipeline. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-07 15:19:02 -05:00
Marek Olšák	25ffb84016	radeonsi: pin the winsys thread to the requested L3 cache (v2) v2: rebase Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 16:03:36 -04:00
Marek Olšák	8016639f63	gallium/u_threaded: implement set_context_param for thread pinning (v2) v2: - use set_context_param - set set_context_param even if the driver doesn't implement it Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 16:03:36 -04:00
Marek Olšák	8d473f555a	st/mesa: pin driver threads to a specific L3 cache on AMD Zen (v2) v2: use set_context_param Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 16:03:30 -04:00
Marek Olšák	e5e3b5cdcc	gallium: add pipe_context::set_context_param for tuning perf on AMD Zen (v2) State trackers will not use the new param directly, but will instead use a helper in MakeCurrent that does the right thing. v2: rework the interface Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 15:48:31 -04:00
Marek Olšák	6d477bc546	gallium/os_thread: simplify helper pipe_current_thread_get_time_nano Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 15:48:31 -04:00
Marek Olšák	15fa2c5e35	gallium/u_cpu_detect: get the number of cores per L3 cache for AMD Zen Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 15:48:31 -04:00
Marek Olšák	ce432e259d	gallium/u_cpu_detect: fix parsing the CPU family According to: https://support.amd.com/TechDocs/54945_PPR_Family_17h_Models_00h-0Fh.pdf Also Intel: https://www.microbe.cz/docs/CPUID.pdf Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 15:48:31 -04:00
Marek Olšák	a84fd58f48	gallium/u_cpu_detect: fix a race condition on initialization Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 15:48:31 -04:00
Dylan Baker	8396043f30	Replace uses of _mesa_bitcount with util_bitcount and _mesa_bitcount_64 with util_bitcount_64. This fixes a build problem in nir for platforms that don't have popcount or popcountll, such as 32bit msvc. v2: - Fix additional uses of _mesa_bitcount added after this was originally written Acked-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-09-07 10:21:26 -07:00
Dylan Baker	80825abb5d	move u_math to src/util Currently we have two sets of functions for bit counts, one in gallium and one in core mesa. The ones in core mesa are header only in many cases, since they reduce to "#define _mesa_bitcount popcount", but they provide a fallback implementation. This is important because 32bit msvc doesn't have popcountll, just popcount; so when nir (for example) includes the core mesa header it doesn't (and shouldn't) link with core mesa. To fix this we'll promote the version out of gallium util, then replace the core mesa uses with the util version, since nir (and other non-core mesa users) can and do link with mesautils. Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-09-07 10:21:26 -07:00
Dylan Baker	aa4386ebfe	docs: update calendar, add news item and link release notes for X.Y.Z Signed-off-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-07 10:19:33 -07:00
Dylan Baker	d514f55611	docs/relnotes: Add sha256 sums for mesa 18.1.8	2018-09-07 10:17:38 -07:00
Dylan Baker	f6a9f44529	docs: Add release notes for 18.1.8	2018-09-07 10:17:36 -07:00
Jason Ekstrand	f9e630e23d	i965: Workaround the gen9 hw astc5x5 sampler bug gen9 hardware has a bug in the sampler cache that can cause GPU hangs whenever an texture with aux compression enabled is in the sampler cache together with an ASTC5x5 texture. Because we can't control what the client binds at any given time, we have two options: resolve the CCS or decompresss the ASTC. Doing a CCS or HiZ resolve is far less drastic and will likely have a smaller performance impact. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-09-07 10:42:40 -05:00
Eric Anholt	a91b158bd9	v3d: Fix setup of the VCM cache size. There were two bugs working together to make things mostly work: I wasn't dividing the VPM output size available by the size of a batch (vertex), but I also had the size of the VPM reduced by a factor of 8. Fixes dEQP-GLES3.functional.vertex_array_objects.all_attributes and it seems also my intermittent varying failures. Fixes: `1561e4984e` ("v3d: Emit the VCM_CACHE_SIZE packet.")	2018-09-07 08:11:38 -07:00
Eric Anholt	f73f748323	v3d: Fix SRC_ALPHA_SATURATE blending for RTs without alpha. Fixes dEQP-GLES3.functional.fragment_ops.blend.default_framebuffer.rgb_func_alpha_func.dst.src_alpha_saturate_src_alpha_saturate and friends with --deqp-egl-config-name=rgb565d0s0 Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-09-07 08:11:05 -07:00
Lionel Landwerlin	69874e9a6a	intel/genxml: turn SLM Enable bit into boolean Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-07 14:46:20 +01:00
Sergii Romantsov	97fcccb25e	i965/tools: 32bit compilation with meson Building of 32bit mesa with meson causes issue: "implicit declaration of function ‘__builtin_ia32_clflush’". Fixed by adding msse2 compilation flag. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107843 Fixes: `314879f7fe` (i965: Fix asynchronous mappings on !LLC platforms.) Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-07 13:46:48 +01:00
Sergii Romantsov	d709f12792	intel: compiler option msse2 and mstackrealign Seems in case of 32-bit library, usage of msse2 makes some stack corruption or incorrect instructions. Usage with mstackrealign fixes that case. v2: Fixed meson. v3: Definition of c_sse2_args moved on the top (L.Landwerlin). Added mstackrealign for Android's mks where msee4.1 is used. v4: Added for Vulkan also. v5: Commit message correction. CC: <mesa-stable@lists.freedesktop.org> Fixes: `6b05c080f2` (i965: Compile with -msse3) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107779 Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-07 13:45:46 +01:00
Rob Clark	5404e0637f	freedreno: fix rast->depth_cleap_near/far Fixes: `daa19363de` gallium: split depth_clip into depth_clip_near & depth_clip_far Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-07 07:41:43 -04:00
Marek Olšák	fda7683726	gallium: enable GL_AMD_depth_clamp_separate on r600, radeonsi	2018-09-06 21:53:00 -04:00
Marek Olšák	daa19363de	gallium: split depth_clip into depth_clip_near & depth_clip_far for AMD_depth_clamp_separate.	2018-09-06 21:53:00 -04:00
Jason Ekstrand	7b26741806	anv/pipeline: Only consider double elements which actually exist The brw_vs_prog_data::double_inputs_read field comes directly from shader_info::double_inputs which may contain inputs which are not actually read. Instead of using it directly, AND it with inputs_read which is only things which are read. Otherwise, we may end up subtracting too many elements when computing elem_count. Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103241 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-06 16:07:50 -05:00
Jason Ekstrand	44ec31cd75	nir: Drop the vs_inputs_dual_locations option It was very inconsistently handled; the only things that made use of it were glsl_to_nir, glspirv, and nir_gather_info. In particular, nir_lower_io completely ignored it so anyone using nir_lower_io on 64-bit vertex attributes was going to be in for a shock. Also, as of the previous commit, it's set by every driver that supports 64-bit vertex attributes. There's no longer any reason to have it be an option so let's just delete it. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-09-06 16:07:50 -05:00
Jason Ekstrand	0909a57b63	radeonsi/nir: Set vs_inputs_dual_locations and let NIR do the remap We were going out of our way to disable dual-location re-mapping in NIR only to then do the remapping in st_glsl_to_nir.cpp. Presumably, this was so that double_inputs would be correct for the core state tracker. However, now that we've it to gl_program::DualSlotInputs which is unaffected by NIR lowering, we can let NIR lower things for us. The one tricky bit here is that we have to remap the inputs_read bitfield back to the single-slot convention for the gallium state tracker to use. Since radeonsi is the only NIR-capable gallium driver that also supports GL_ARB_vertex_attrib_64bit, we only have to worry about radeonsi when making core gallium state tracker changes. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-09-06 16:07:50 -05:00
Jason Ekstrand	25efd787cf	compiler: Move double_inputs to gl_program::DualSlotInputs Previously, we had two field in shader_info: double_inputs_read and double_inputs. Presumably, the one was for all double inputs that are read and the other is all that exist. However, because nir_gather_info regenerates these two values, there is a possibility, if a variable gets deleted, that the value of double_inputs could change over time. This is a problem because double_inputs is used to remap the input locations to a two-slot-per-dvec3/4 scheme for i965. If that mapping were to change between glsl_to_nir and back-end state setup, we would fall over when trying to map the NIR outputs back onto the GL location space. This commit changes the way slot re-mapping works. Instead of the double_inputs field in shader_info, it adds a DualSlotInputs bitfield to gl_program. By having it in gl_program, we more easily guarantee that NIR passes won't touch it after it's been set. It also makes more sense to put it in a GL data structure since it's really a mapping from GL slots to back-end and/or NIR slots and not really a NIR shader thing. Tested-by: Alejandro Piñeiro <apinheiro@igalia.com> (ARB_gl_spirv tests) Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-09-06 16:07:50 -05:00
Marek Olšák	1285f71d3e	gallium: add PIPE_CAP_RASTERIZER_SUBPIXEL_BITS Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-09-06 16:07:40 -04:00
Eric Engestrom	3824c8e7cd	meson: disable asserts by default on release builds By the time Mesa 18.3 comes out (probably December '18), Meson 0.45 will be 9 months old (March '18), so I think this is reasonable. (btw, the currently-required Meson 0.44.1 was released less than 12 days before 0.45, so we're really not bumping by much.) Currently, the Meson versions in the major distributions are: Arch: ships 0.47.2 CentOS: 7 ships 0.47.1 Debian: stable ships 0.37.1, so it hasn't been usable in a long time. everything more recent ships 0.47.2 Fedora: 28 ships 0.45.1 FreeBSD: ships 0.46.1 (ports) Gentoo: ships 0.46.1 OpenSUSE: 15 ships 0.46 Ubuntu: 18.04 ships 0.45.1 Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-06 18:16:31 +01:00
Andrii Simiklit	2930b76cfe	mesa/util: add missing va_end() after va_copy() MSDN: "va_end must be called on each argument list that's initialized with va_start or va_copy before the function returns." Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107810 Fixes: `c6267ebd6c` "gallium/util: Stop bundling our snprintf implementation." Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com>	2018-09-06 17:33:27 +01:00
Andrii Simiklit	65cfe698b0	mesa/util: don't ignore NULL returned from 'malloc' We should exit from the function 'util_vasprintf' with error code -1 for case where 'malloc' returns NULL Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `864148d69e` "util: add util_vasprintf() for Windows (v2)" Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com>	2018-09-06 17:33:27 +01:00
Andrii Simiklit	570cacba7a	mesa/util: don't use the same 'va_list' instance twice The first usage of the 'va_list' instance could change it. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `864148d69e` "util: add util_vasprintf() for Windows (v2)" Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com>	2018-09-06 17:33:27 +01:00
Andrii Simiklit	267ed29288	apple/glx/log: added missing va_end() after va_copy() Each invocation of va_copy() must be matched by a corresponding invocation of va_end() Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `51691f0767` "darwin: Use ASL for logging" Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com>	2018-09-06 17:33:27 +01:00
Eric Engestrom	6daba55aa1	meson: drop unnecessary llvm version hacks The current minimum meson version supported is 0.44.1, so we have met both the 0.43 and 0.44 requirement to not need these hacks anymore :) Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-06 17:16:58 +01:00
Danylo Piliaiev	2b98a023d9	mesa: add missing return statement for GL_RG_SNORM case Fixes: `0d356cf478` "mesa: enable EXT_render_snorm extension" Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-09-06 17:24:53 +03:00
Eric Engestrom	e67dadd3a9	meson: consolidate langs lists Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-09-06 15:22:24 +01:00
Eric Engestrom	07ff56791d	intel/compiler: remove unused get_image_base_type() Unused since `09f1de97a7` "anv,i965: Lower away image derefs in the driver". Cc: Jason Ekstrand <jason.ekstrand@intel.com> Signed-off-by: Eric Engestrom <eric@engestrom.ch> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-06 15:22:24 +01:00
Mathias Fröhlich	a6232b6932	tnl: Fix green gun regression in xonotic. Fix an other regression of mesa: Make gl_vertex_array contain pointers to first order VAO members. The regression showed up with drivers using the tnl module and was reproducible using xonotic-glx -benchmark demos/the-big-keybench.dem. Fixes: `64d2a20480` mesa: Make gl_vertex_array contain pointers to first order VAO members. Tested-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2018-09-06 14:35:12 +02:00
Lionel Landwerlin	2dce1175c1	Revert "i965/tools: 32bit compilation with meson" This reverts commit `4aec44c0d9`. Unfortunately this patch needed a another one to be committed first.	2018-09-06 12:25:07 +01:00
Sergii Romantsov	4aec44c0d9	i965/tools: 32bit compilation with meson Building of 32bit mesa with meson causes issue: "implicit declaration of function ‘__builtin_ia32_clflush’". Fixed by adding msse2 compilation flag. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107843 Fixes: `314879f7fe` (i965: Fix asynchronous mappings on !LLC platforms.) Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-06 11:55:57 +01:00
Timothy Arceri	b9fe8ff23d	glsl: fixer lexer for unreachable defines If we have something like: #ifdef NOT_DEFINED #define A_MACRO(x) \ if (x) #endif The # on the #define is not skipped but the define itself is so this then gets recognised as #if. Until `28a3731e3f` this didn't happen because we ended up in <HASH>{NONSPACE} where BEGIN INITIAL was called stopping the problem from happening. This change makes sure we never call RETURN_TOKEN_NEVER_SKIP for if/else/endif when processing a define. Cc: Ian Romanick <idr@freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107772 Tested-By: Eero Tamminen <eero.t.tamminen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-09-06 10:13:21 +10:00
Hyunjun Ko	2454742a84	freedreno/ir3: insert mov if same instruction in the outputs. For example, result0 = texture(sampler[indexBase + 5], coords); result1 = texture(sampler[indexBase + 0], coords); result2 = texture(sampler[indexBase + 0], coords); out_result0 = result0; out_result1 = result1; out_result2 = result2; In this kind of case we need to insert an extra mov to the outputs so that the result could be assigned to each register respectively. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Hyunjun Ko	b4da2f6667	freedreno/ir3: make immediates array dynamic Since most shaders wouldn't need that large array of immediates, making the array dynamic could save unnecessary spaces. In addition, sometimes we can potentially have a much larger array of immediates to be lowered, which might be more than 64. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	c3d9f29b78	freedreno: allocate ctx's batch on demand Don't fall over when app wants more than 32 contexts. Instead allocate contexts on demand. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	a122118c14	freedreno: add fd_context_batch() accessor For cases in which (after the following commit) ctx->batch may be null. Prep work for following commit. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	a45e1802db	freedreno/a6xx: fix mem2gmem for zsbuf Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	c77e0948c7	freedreno/batch: fix crash in !reorder case We aren't using the batch-cache if reorder==false. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	2c623e7071	freedreno/ir3: better compile_error() printing Try to show the error at the appropriate line of nir Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	ca758251ba	freedreno/a6xx: bordercolor fixes Port fixes from a5xx (`f0715442`) TODO maybe this should move to shared code, since it seems to be the same. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	73378013d7	freedreno: fix context teardown harder The border_color_uploaders need to be torn down before the transfer_pool is destroyed. Fixes: `e11e9d6394` freedreno: fix context teardown race Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	1a24f51966	freedreno/ir3: ignore unused inputs We could end up w/ inputs larger than vec4, simply because unused inputs are not split. Fixes things like dEQP-GLES31.functional.separate_shader.random.77 (and probably a handful of others) Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Rob Clark	6b4397feab	freedreno/a6xx: fix debug build crash Porting `0c8d9e923a` to a6xx. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-05 13:38:43 -04:00
Dylan Baker	d25a27ec56	meson: Print a message about why a libdrm version was selected We require a single version of libdrm for all of our libdrm dependencies (core and driver), but the way this is structured can make the error message less than helpful, as one driver might be the one setting the libdrm requirement, while another might be the one that generates the version failure. This adds a simple message to the output announcing which libdrm module set the version, which might be more helpful. v2: - Use message suggested by Eric Engstrom Fixes: `c445b1d56f` ("meson: Use the same version for all libdrm checks") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-09-05 10:32:51 -07:00
Charmaine Lee	af104ad799	svga: rename face to layer_face Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-05 11:22:42 -06:00
Brian Paul	e334e104d0	svga: encode sample count in resource declarations No regressions before the corresponding host-side change. Reviewed-by: Neha Bhende <bhenden@vmware.com>	2018-09-05 11:22:42 -06:00
Charmaine Lee	49678e9e49	svga: sync with upstream changes to surface flags SVGA device now supports 64 bits surface flags. This patch updates the winsys interface to allow 64 bits surface flags. The linux winsys layer will for now only honor the lower 32 bits of the surface flags. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-05 11:22:42 -06:00
Neha Bhende	4310649ccb	svga: avoid try_blit() for some depth formats on non vgpu10. On non vgpu10, driver doesn't support util_blitter_blit for SVGA3D_Z_D16, SVGA3D_Z_D24x8, SVGA3D_Z_D24S8. Patch fixes following piglit tests regression on hwv8 caused by commit 27bf35caea5e: spec@arb_depth_texture@fbo-depth-gl-depth-component16-blit spec@arb_depth_texture@fbo-depth-gl-depth-component24-blit spec@arb_depth_texture@fbo-depth-gl-depth-component32-blit Tested with mtt-piglit on hw 8,9,10,11,13 and mtt-glretrace on windows and linux. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-05 11:22:42 -06:00
Neha Bhende	53091a0312	svga: convert dst format to linear when blending is enabled. When blending is enabled, framebuffer colorspace has to be linear. Previously, we never hit this case because we were not supporting sRGB drawable. Previous patch added that support. Tested with mtt glretrace, viewperf, piglit, conform. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-05 11:22:42 -06:00
Neha Bhende	dfab1289e8	winsys/svga: Avoid cap2 code path for now CAP2 functionality is not yet part of vmwgfx. This is causing unnecessary dmesg error messages. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-09-05 11:22:42 -06:00
Neha Bhende	8449c33a27	svga: start using SVGA3dCmdIntraSurfaceCopy command for svga_blit. Basically, SVGA3dCmdIntraSurfaceCopy command allow copying when source and destination are same. Tested with MTT piglit, glretrace, viewperf, conform v2: changes as per Charmaine's comment v3: changes as per Charmaine's comment Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-05 11:22:42 -06:00
Neha Bhende	4639ef3763	svga/winsys: Add cap2 support in winsys Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-05 11:22:42 -06:00
Neha Bhende	6b3627da08	svga: Add SVGA3dCmdIntraSurfaceCopy command support in OpenGL driver v2: changes as per Charmaine's comment Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-05 11:22:42 -06:00
Brian Paul	bac94dfefa	svga: update device header files from upstream This is a squash commit of several earlier patches. Signed-off-by: Brian Paul <brianp@vmware.com>	2018-09-05 11:22:42 -06:00
Charmaine Lee	f4f39fa5d9	winsys/drm: Fix assert when try to accumulate an invalid fd This patch makes sure there is a valid fd before merging it to the context's fd in vmw_svga_winsys_fence_server_sync(). This fixes the assert running webot. No regression running kmscube. Reviewed-by: Sinclair Yeh <syeh@vmware.com>	2018-09-05 11:22:42 -06:00
Eric Anholt	16f17e3a3c	loader: Drop unused argument from dri3_update_drawable(). The argument has never been used since the function was added. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-09-05 10:11:27 -07:00
Alejandro Piñeiro	4e1f8d82c2	i965/fs: include multisamplers on image_intrinsic_coord_components This is the second patch needed to fix the following piglit tests: tests/spec/arb_gl_spirv/linker/uniform/multisampler.shader_test tests/spec/arb_gl_spirv/linker/uniform/multisampler-array.shader_test Although in this case it doesn't affect so many borrowed tests, as there aren't too many tests using multisamplers on Intel. It is worth to note that this patch is also needed when those tests are run on GLSL mode (using the --glsl option). Although most Intel drivers would not be able to run/execute tests using multisamplers, as GL_MAX_IMAGE_SAMPLES is zero, technically those tests are expected to link correctly, so linking tests should pass. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-05 17:02:28 +02:00
Alejandro Piñeiro	8969777686	i965: move brw_nir_lower_gl_images call At this moment that lowering is using info coming from the UniformStorage, so for the ARB_gl_spirv codepath, it needs to be done after calling gl_nir_link_uniforms. As for the GLSL codepath it can also be called later, we just move the call on both cases, to avoid adding several shader->spirv_data checks, and keep the patch as small as possible. This is the first patch needed to fix the following piglit tests: tests/spec/arb_gl_spirv/linker/uniform/multisampler.shader_test tests/spec/arb_gl_spirv/linker/uniform/multisampler-array.shader_test but fixes thousands of tests when borrowing the tests from other specs (that needs to be done manually right now). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-05 17:02:28 +02:00
Alejandro Piñeiro	2a6182fe06	intel/compiler: rename brw_nir_lower_glsl_images To brw_nir_lower_gl_images, as it will be also used on the ARB_gl_spirv codepath, that doesn't involves GLSL at all. So the lowering is about images following the OpenGL semantics. In any case "brw_nir_lower_opengl_images" seemed too long to me, so I just used gl. That shortening is already used on other parts of the code. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-05 17:02:28 +02:00
Alejandro Piñeiro	960f6459be	intel/compiler: remove unused variable num_images Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-05 17:02:28 +02:00
Gert Wollny	218ff0d510	winsys/virgl/vtest: Correct off-by-one error in resource allocation The resource bo array must already extended when the target index is equal to the current size of the array. Signed-off-by: Gert Wollny <gert.wollny@collabora.com>	2018-09-05 13:54:01 +02:00
Gert Wollny	5341260f62	winsys/virgl: Initialize value to silence valgrind Silences: Conditional jump or move depends on uninitialised value(s) at 0xB72F2C0: virgl_drm_winsys_create (virgl_drm_winsys.c:854) by 0xB72F2C0: virgl_drm_screen_create (virgl_drm_winsys.c:926) by 0xB21C885: pipe_virgl_create_screen (drm_helper.h:275) by 0xB7201F0: pipe_loader_create_screen (pipe_loader.c:137) by 0xB639C91: dri2_init_screen (dri2.c:2112) by 0xB634F68: driCreateNewScreen2 (dri_util.c:153) by 0x63023E6: dri3_create_screen (dri3_glx.c:893) by 0x62D35BD: AllocAndFetchScreenConfigs (glxext.c:820) by 0x62D35BD: __glXInitialize (glxext.c:946) by 0x62CECB3: GetGLXPrivScreenConfig (glxcmds.c:174) by 0x62CF69C: glXQueryExtensionsString (glxcmds.c:1304) by 0x60AA7D9: ??? (in /usr/lib/x86_64-linux-gnu/libwaffle-1.so.0.5.2) by 0x4F81450: wfl_checked_display_connect (piglit-util-waffle.h:74) by 0x4F829E0: piglit_wfl_framework_init (piglit_wfl_framework.c:627) Signed-off-by: Gert Wollny <gert.wollny@collabora.com>	2018-09-05 13:54:01 +02:00
Gert Wollny	9b0e8d8723	winsys/virgl: correct resource and handle allocation (v2) Fixes crash with piglit/bin/map_buffer_range-invalidate CopyBufferSubData \ increment-offset -auto -fbo * Resize the resource storage already when the count is equal to the allocated size, fixes: Invalid write of size 8 at 0xB72E4CF: virgl_drm_add_res (virgl_drm_winsys.c:629) by 0xB72E4CF: virgl_drm_emit_res (virgl_drm_winsys.c:663) by 0xB72A44A: virgl_encode_resource_copy_region (virgl_encode.c:776) by 0xB40CD12: st_copy_buffer_subdata (st_cb_bufferobjects.c:585) by 0xB244A3B: _mesa_CopyBufferSubData (bufferobj.c:2940) by 0x109A1E: upload (invalidate.c:169) by 0x109C2F: piglit_display (invalidate.c:215) by 0x4F80FBE: run_test (piglit_fbo_framework.c:52) by 0x4F66E5F: piglit_gl_test_run (piglit-framework-gl.c:229) by 0x10949D: main (invalidate.c:47) Address 0xbe07d30 is 0 bytes after a block of size 4,096 alloc'd at 0x4C31B25: calloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) by 0xB72DAAF: virgl_drm_cmd_buf_create (virgl_drm_winsys.c:567) * Also resize the space allocated for the handles, fixes: Invalid write of size 4 at 0xB72E4F0: virgl_drm_add_res (virgl_drm_winsys.c:631) by 0xB72E4F0: virgl_drm_emit_res (virgl_drm_winsys.c:663) by 0xB72A44A: virgl_encode_resource_copy_region (virgl_encode.c:776) by 0xB40CD12: st_copy_buffer_subdata (st_cb_bufferobjects.c:585) by 0xB244A3B: _mesa_CopyBufferSubData (bufferobj.c:2940) by 0x109A1E: upload (invalidate.c:169) by 0x109C2F: piglit_display (invalidate.c:215) by 0x4F80FBE: run_test (piglit_fbo_framework.c:52) by 0x4F66E5F: piglit_gl_test_run (piglit-framework-gl.c:229) by 0x10949D: main (invalidate.c:47) Address 0xbe08570 is 0 bytes after a block of size 2,048 alloc'd at 0x4C2FB0F: malloc ( in /usr/lib/valgrind/vgpreload_memcheck-amd64- linux.so) by 0xB72DAC8: virgl_drm_cmd_buf_create (virgl_drm_winsys.c:572) Fixes: `4b15b5e803` ("virgl: resize resource bo allocation if we need to.") v2: - Use REALLOC macro and avoid memory leak when re-allocation fails - add Fixes tag (both Emil Velikov) - reorder commit message Signed-off-by: Gert Wollny <gert.wollny@collabora.com>	2018-09-05 13:54:01 +02:00
Tomeu Vizoso	f13de57edb	virgl: use hw-atomics instead of in-ssbo ones Emulating atomics on top of ssbos can lead to too small max SSBO count, so let's use the hw-atomics mechanism to expose atomic buffers instead. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-09-05 05:46:58 +01:00
Erik Faye-Lund	1bd927d997	virgl: update minor differences to upstream header virgl_protocol.h is considered to have it's upstream in the virglrenderer repository, and somehow these minor differences has crept in. Let's sync with the upstream to avoid this. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-09-05 05:46:52 +01:00
Erik Faye-Lund	5a587d18d5	gallium: add PIPE_CAP_MAX_COMBINED_HW_ATOMIC_COUNTER{S,_BUFFERS} This moves the evergreen-specific max-sizes out as a driver-cap, so other drivers with less strict requirements also can use hw-atomics. Remove ssbo_atomic as it's no longer needed. We should now be able to use hw-atomics for some stages and not for other, if needed. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-09-05 05:46:46 +01:00
Erik Faye-Lund	d641d3f48b	gallium: add PIPE_CAP_MAX_COMBINED_SHADER_BUFFERS This gets rid of a r600 specific hack in the state-tracker, and prepares for other drivers to be able to use hw-atomics. While we're at it, clean up some indentation in the various drivers. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-09-05 05:46:37 +01:00
Erik Faye-Lund	84795f8c64	st/mesa: simplify MaxAtomicBufferSize-logic MaxAtomicCounters has already been assigned in the loop above in the ssbo_atomic = true case, so this will calculate the same value as the default. While we're at it, fixup indentation on the MaxAtomicBufferBindings assign. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-09-05 05:46:33 +01:00
Erik Faye-Lund	38f0c078de	st/mesa: clean up atomic vs ssbo code This makes the code a bit easier to follow; we first set up MaxShaderStorageBlocks, then we either set up a dedicated MaxAtomicBuffers, or we split MaxShaderStorageBlocks in two. While we're at it, also make the SSBO-splitting code tolerate the hypothetical case of having an odd number of SSBOs without incorrectly dropping the last SSBO. This has the nice result that the SSBOs and atomic buffers are dealt with almost completely orthogonally, easing some upcoming patches. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-09-05 05:46:27 +01:00
Erik Faye-Lund	a805e4e9de	st/mesa: use real bool for can_ubo We're doing full c99 now, so there's no point in using the old boolean type. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2018-09-05 05:46:09 +01:00
Marek Olšák	28e542dcdb	gallium/u_threaded: increase batch size to increase performance This reduces mutex overhead. radeonsi: +4.4% performance with piglit/drawoverhead, DrawElements, Ryzen X1700 iris_dri.so: +14% with piglit/drawoverhead, DrawArrays, i7 7700HQ. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2018-09-04 14:31:56 -04:00
Marek Olšák	ebd5806e0f	st/vdpau: silence an unitialized-variable warning	2018-09-04 14:01:43 -04:00
Marek Olšák	725e8ad559	st/mesa: help fix stencil border color for GL_DEPTH_STENCIL textures GL_STENCIL_INDEX uses GL_INTENSITY for the border color, which is nicer to hardware that doesn't read the stencil border value from the X channel. This fixes a bunch of dEQP tests on Vega & Raven. Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org>	2018-09-04 14:01:43 -04:00
Ernestas Kulik	d49904085a	glsl_to_tgsi: Fix potential leak Reported by Coverity: arr_live_ranges is freed in a different branch than the one in which it was allocated. Signed-off-by: Ernestas Kulik <ernestas.kulik@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-09-04 14:01:43 -04:00
Ernestas Kulik	ea1e50cc16	u_vbuf: Fix leak Reported by Coverity: data is heap-allocated, but only freed in the info->index_size != 0 branch. Signed-off-by: Ernestas Kulik <ernestas.kulik@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Cc: 18.2 <mesa-stable@lists.freedesktop.org>	2018-09-04 14:01:43 -04:00
Eric Anholt	2e59b88903	freedreno: Drop a bunch of duplicated gallium PIPE_CAP default code. Now that we have the util function for the default values, we can get rid of the boilerplate. v2: Rebase on new gallium caps Reviewed-by: Rob Clark <robdclark@gmail.com> (v1)	2018-09-04 08:08:22 -07:00
Eric Anholt	492b74b445	v3d: Drop a bunch of duplicated gallium PIPE_CAP default code. Now that we have the util function for the default values, we can get rid of the boilerplate. v2: Rebase on new gallium caps	2018-09-04 08:08:18 -07:00
Eric Anholt	c311e00000	vc4: Drop a bunch of duplicated gallium PIPE_CAP default code. Now that we have the util function for the default values, we can get rid of the boilerplate. v2: drop GLSL level in favor of defaults. v3: Rebase on new gallium caps	2018-09-04 08:08:10 -07:00
Eric Anholt	ad782a7020	gallium: Add a helper for implementing PIPE_CAP_* default values. One of the pains of implementing a gallium driver is filling in a million pipe caps you don't know about yet when you're just starting out. One of the pains of working on gallium is copy-and-pasting your new PIPE_CAP into each driver. We can fix both of these by having each driver call into the default helper from their default case, so that both sides can ignore each other until they need to. v2: fix i915g build, revert swr change to avoid breaking scons build (https://travis-ci.org/anholt/mesa/jobs/419739857) v3: Rebase on 3 new gallium caps. Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1) Cc: Bruce Cherniak <bruce.cherniak@intel.com> Cc: George Kyriazis <george.kyriazis@intel.com> Cc: Kenneth Graunke <kenneth@whitecape.org>	2018-09-04 08:07:52 -07:00
Jason Ekstrand	67571ae796	intel/compiler: Remove redundant nir_remove_dead_variables call As of `07a2098a70`, brw_nir_optimize calls nir_remove_dead_variables as the last optimization. Doing it again is just pointless. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-09-04 09:03:16 -05:00
Lionel Landwerlin	07a2098a70	intel: compiler: remove dead local variables at optimization pass We're hitting an assert in gfxbench because one of the local variable is a sampler (according to Jason this isn't valid) : testfw_app: ../src/compiler/nir_types.cpp:551: void glsl_get_natural_size_align_bytes(const glsl_type, unsigned int, unsigned int*): Assertion `!"type does not have a natural size"' failed. Since this particular variable isn't used, it can be eliminated by removing unused local variables at the end of the optimization loop. This makes sense also for valid local variables. v2: Move additional local variable removal out of optimization loop, but before large constant removal (Jason/Lionel) v3: Move the removal at the end of brw_nir_optimize() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107806 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-09-03 17:24:19 +01:00
Andrii Simiklit	095600dad6	intel/decoder: fix the possible out of bounds group_iter The "gen_group_get_length" function can return a negative value and it can lead to the out of bounds group_iter. v2: printing of "unknown command type" was added v3: just the asserts are added Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-09-03 11:14:30 +01:00
Bas Nieuwenhuizen	233718a199	radv: Fix CMASK dimensions. Mirrors `1e40f69483` "ac/surface: fix CMASK fast clear for NPOT textures with mipmapping on SI/CI/VI" CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-03 09:24:30 +02:00
Bas Nieuwenhuizen	ab64891f4c	radv: Use a lower max offchip buffer count. No clue what gets fixed by this but both radeonsi and amdvlk do it. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-03 09:24:30 +02:00
Bas Nieuwenhuizen	4dc244eb44	radv: Add VEGA20 support. Just mirror the radeonsi bits. Since this is just adding the extra switch entries for new HW I think this should be fine for stable. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-09-03 09:24:30 +02:00
Dave Airlie	c1ba33c34b	radv: don't expose linear depth surfaces on SI/CIK/VI either. ac_surface.c: gfx6_compute_surface says /* DB doesn't support linear layouts. */ Now if we expose linear depth and create a linear depth image and use CmdCopyImage to copy into it, we can't map the underlying memory and read it linearly which I think should work. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-09-03 11:38:00 +10:00
Mauro Rossi	ac0856ae41	egl/android: do not indent HAVE_DRM_GRALLOC preprocessor directive Fixes: `3f7bca44d9` ("egl/android: #ifdef out flink name support") Fixes: `c7bb82136b` ("egl/android: Add DRM node probing and filtering") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Mauro Rossi <issor.oruam@gmail.com>	2018-09-02 11:27:08 +02:00
Jason Ekstrand	2ad9917e18	anv/blorp: Fix a comment as per Nanley's review feedback This accidentally didn't make it into `62378c5e9e`	2018-09-01 09:12:08 -05:00
Jason Ekstrand	62378c5e9e	anv/blorp: Do more flushing around HiZ clears We make the flush after a HiZ clear unconditional and add a flush/stall before the clear as well. Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107760 Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-09-01 09:08:36 -05:00
Ian Romanick	82530ce1b5	i965/vec4: Clamp indirect tes input array reads with 0x0fffffff Page 190 of "Volume 7: 3D Media GPGPU Engine (Haswell)" says the valid range of the offset is [0, 0FFFFFFFh]. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2018-09-01 00:23:45 -07:00
Ian Romanick	75666605c9	i965/vec4: Correctly handle uniform sources in generate_tes_add_indirect_urb_offset Fixes failure in the new piglit test tes-patch-input-array-vec2-index-invalid-rd.shader_test. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2018-09-01 00:23:43 -07:00
Andres Gomez	adad7e3aa8	docs: update calendar to extended the 18.1 cycle by one more release Due to having 2 additional RCs for 18.2. Cc: Dylan Baker <dylan.c.baker@intel.com> Cc: Juan A. Suarez <jasuarez@igalia.com> Cc: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Acked-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Juan A. Suarez <jasuarez@igalia.com> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2018-09-01 02:23:14 +03:00
Rodrigo Vivi	e8c42ed4ab	intel: Introducing Amber Lake platform Amber Lake uses the same gen graphics as Kaby Lake, including a id that were previously marked as reserved on Kaby Lake, but that now is moved to AML page. This follows the ids and approach used on kernel's commit e364672477a1 ("drm/i915/aml: Introducing Amber Lake platform") Reported-by: Timo Aaltonen <timo.aaltonen@canonical.com> Cc: José Roberto de Souza <jose.souza@intel.com> Cc: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-31 13:57:52 -07:00
Rodrigo Vivi	886a048feb	intel: aubinator: Adding missed platforms to the error message. Many new platforms got added to gen_device_name_to_pci_device_id() but the error message inside aubinator didn't reflected those changes. So syncing on the same order to be sure that we are not missing any now. Cc: Anuj Phogat <anuj.phogat@gmail.com> Cc: Matt Turner <mattst88@gmail.com> Cc: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-31 13:57:41 -07:00
Nanley Chery	904c2a617d	i965/gen7_urb: Re-emit PUSH_CONSTANT_ALLOC on some gen9 According to internal docs, some gen9 platforms have a pixel shader push constant synchronization issue. Although not listed among said platforms, this issue seems to be present on the GeminiLake 2x6's we've tested. We consider the available workarounds to be too detrimental on performance. Instead, we mitigate the issue by applying part of one of the workarounds. Re-emit PUSH_CONSTANT_ALLOC at the top of every batch (as suggested by Ken). Fixes ext_framebuffer_multisample-accuracy piglit test failures with the following options: * 6 depth_draw small depthstencil * 8 stencil_draw small depthstencil * 6 stencil_draw small depthstencil * 8 depth_resolve small * 6 stencil_resolve small depthstencil * 4 stencil_draw small depthstencil * 16 stencil_draw small depthstencil * 16 depth_draw small depthstencil * 2 stencil_resolve small depthstencil * 6 stencil_draw small * all_samples stencil_draw small * 2 depth_draw small depthstencil * all_samples depth_draw small depthstencil * all_samples stencil_resolve small * 4 depth_draw small depthstencil * all_samples depth_draw small * all_samples stencil_draw small depthstencil * 4 stencil_resolve small depthstencil * 4 depth_resolve small depthstencil * all_samples stencil_resolve small depthstencil v2: Include more platforms in WA (Ken). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106865 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93355 Cc: <mesa-stable@lists.freedesktop.org> Tested-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-31 13:19:17 -07:00
Christian Gmeiner	773d6ea6e7	imx: make use of loader_open_render_node(..) helper Gets rid of hard-coded gpu device path. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-31 21:47:13 +02:00
Christian Gmeiner	b05a8f4f41	tegra: make use loader_open_render_node(..) helper Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-31 21:46:32 +02:00
Christian Gmeiner	ab348885eb	loader: add loader_open_render_node(..) This helper is almost a 1:1 copy of tegra_open_render_node(). Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-31 21:46:03 +02:00
Christian Gmeiner	d0b09e2dfe	tegra: fix memory leak Fixes: `1755f608f5` ("tegra: Initial support") Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-31 21:45:16 +02:00
Daniel Stone	01c0aa9f05	st/dri: Don't expose sRGB formats to clients Though the SARGB8888 format is used internally through its FourCC value, it is not a real format as defined by drm_fourcc.h; it cannot be used with KMS or other interfaces expecting drm_fourcc.h format codes. Ensure we don't advertise it through the dmabuf format/modifier query interfaces, preventing us from tripping over an assert. Signed-off-by: Daniel Stone <daniels@collabora.com> Reported-by: Michel Dänzer <michel.daenzer@amd.com> Fixes: `8c1b9882b2` ("egl/dri2: Guard against invalid fourcc formats") Acked-by: Jason Ekstrand <jason.ekstrand@intel.com>	2018-08-31 18:02:42 +01:00
Samuel Pitoiset	686ec97cfb	radv: add missing support for protected memory properties Fixes Vulkan CTS CL#2849. Similar to the ANV driver. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-31 17:35:13 +02:00
Samuel Pitoiset	7355e9326b	radv: remove dead code in scan_shader_output_decl() Never used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-31 17:34:41 +02:00
Samuel Pitoiset	e9acf069b2	radv: remove radv_shader_context::num_output_{clips,culls} Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-31 17:34:41 +02:00
Samuel Pitoiset	a6a6441c75	radv: adjust the cull dist mask in scan_shader_output_decl() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-31 17:34:41 +02:00
Samuel Pitoiset	ea778e760c	radv: get length of the clip/cull distances array from usage mask Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-31 17:34:41 +02:00
Samuel Pitoiset	732679c25e	radv: do not recompute the output usage mask for clipdist twice The shader info pass takes care of this now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-31 17:34:41 +02:00
Samuel Pitoiset	730c704f86	radv: gather the output usage mask for clip/cull distances correctly It's a special case because both are combined into a single array. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-31 17:34:41 +02:00
Samuel Pitoiset	ffe3a2a298	radv: add set_output_usage_mask() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-31 17:34:41 +02:00
Samuel Pitoiset	6f47df3129	radv: fix passing clip/cull distances from VS to PS CTS doesn't test input clip/cull distances for the fragment shader stage, which explains why this was totally broken. I wrote a simple test locally that works now. This fixes a crash with GTA V and DXVK. Note that we are exporting unused parameters from the vertex shader now, but this can't be optimized easily because we don't keep the fragment shader info... Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107477 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-31 17:34:36 +02:00
Juan A. Suarez Romero	54a9622dd5	egl/wayland: do not leak wl_buffer when it is locked If color buffer is locked, do not set its wayland buffer to NULL; otherwise it can not be freed later. Rather, flag it in order to destroy it later on the release event. v2: instruct release event to unlock only or free wl_buffer too (Daniel) This also fixes dEQP-EGL.functional.swap_buffers_with_damage.* tests. CC: Daniel Stone <daniel@fooishbar.org> Reviewed-by: Daniel Stone <daniels@collabora.com>	2018-08-31 16:29:36 +02:00
Dave Airlie	2c1f249f2b	ac/radeonsi: fix CIK copy max size While adding transfer queues to radv, I started writing some tests, the first test I wrote fell over copying a buffer larger than this limit. Checked AMDVLK and found the correct limit. Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-31 15:11:49 +10:00
Dave Airlie	c9f5448695	radeonsi: fix regression in indirect input swizzles. This fixes: tests/spec/arb_enhanced_layouts/execution/component-layout/vs-fs-array-dvec3.shader_test since I reworked the 64-bit swizzles. Fixes: `bb17ae49ee` (gallivm: allow to pass two swizzles into fetches.) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-31 06:08:24 +01:00
Dave Airlie	750b829daf	radeonsi: fix tess/gs fetchs for new swizzle. I have piglit results from my machine, but I must have messed up, and not built mesa in between properly. Fixes: `bb17ae49ee` (gallivm: allow to pass two swizzles into fetches.) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-31 06:08:21 +01:00
Marek Olšák	355ed029b0	mesa: ignore VAO IDs equal to 0 in glDeleteVertexArrays This fixes a firefox crash. Fixes: `781a78914c` Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-30 22:30:28 -04:00
Kenneth Graunke	b147254d36	Revert "intel/tools/aubwrite: Always use physical addresses for traces." This reverts commit `f8cfc77660`. This appears to break intel_dump_gpu for Gen9 systems - I can load them in the simulator, but nothing happens. Reverting the patch makes the simulator properly execute our commands and shaders again.	2018-08-30 14:36:28 -07:00
Jason Ekstrand	a0f18f2142	intel/nir: Lowering image loads and stores trashes all metadata This fixes the GL_ARB_fragment_shader_interlock piglit test on gen8 platforms where the lack of metadata dirtying was causing another pass to accidentally delete a much needed loop. https://bugs.freedesktop.org/show_bug.cgi?id=107745 Fixes: `37f7983bcc` "intel/compiler: Do image load/store lowering..." Jason Ekstrand <jason@jlekstrand.net> writes: Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-30 14:06:31 -05:00
Jason Ekstrand	d9cf4308ce	i965/screen: Allow modifiers on sRGB formats This effectively reverts `a266934935` which was a misguided attempt at protecting intel_query_dma_buf_modifiers from invalid formats. Unfortunately, in some internal EGL cases, we can get an SRGB format validly in this function. Rejecting such formats caused us to not allow CCS in some cases where we should have been allowing it. This regressed the performance of some SynMark tests as well as GfxBench ALU2, Tessellation and Manhattan 3.0 tests There's some question of whether or not we really should be using SRGB "fourcc" formats that aren't actually in drm_foucc.h but there's not much harm in allowing them through here. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107223 Fixes: `a266934935` "i965/screen: Return false for unsupported..." Tested-By: Eero Tamminen <eero.t.tamminen@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-30 11:41:50 -05:00
Jason Ekstrand	8c1b9882b2	egl/dri2: Guard against invalid fourcc formats We already reject attempts to import images with invalid fourcc formats but don't really guard the queries all that well. This makes us error out in any calls to eglQueryDmaBufModifiersEXT if the given format is not a valid fourcc format. We also add an assert to ensure that drivers don't advertise any non-fourcc formats. Cc: mesa-stable@lists.freedesktop.org Tested-By: Eero Tamminen <eero.t.tamminen@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-30 11:41:50 -05:00
Jason Ekstrand	b95896f492	egl/dri2: Add a helper for the number of planes for a FOURCC format This also serves as a convenient "is this a fourcc format" check as well which we'll take advantage of in the next commit. Cc: mesa-stable@lists.freedesktop.org Tested-By: Eero Tamminen <eero.t.tamminen@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-30 11:41:50 -05:00
Jason Ekstrand	19bdc7dd0f	radv/meta: Set num_components on image_store intrinsics Now that image load/store intrinsics are variable-width, we need to set num_components accordingly. In `15d39f474b`, both glsl_to_nir and spirv_to_nir were updated to properly set num_components but radv meta was left behind. Fixes: `15d39f474b` "nir: Make image load/store intrinsics..." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-30 08:26:14 -05:00
Vicki Pfau	8c0e3f3822	gallivm: Detect VSX separately from Altivec Previously gallivm would attempt to use VSX instructions on all systems where it detected that Altivec is supported; however, VSX was added to POWER long after Altivec, causing lots of crashes on older POWER/PPC hardware, e.g. PPC Macs. By detecting VSX separately from Altivec we can automatically disable it on hardware that supports Altivec but not VSX Signed-off-by: Vicki Pfau <vi@endrift.com>	2018-08-30 06:09:49 +02:00
Ilia Mirkin	3e04c67950	nv50: bump compat glsl level to same as core Passes the compat piglits. I'm sure that there will be odd issues that aren't caught by them, but at least it should basically work. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-08-29 20:51:40 -04:00
Ilia Mirkin	a608e5cc9f	nvc0: bump compat GLSL version to match core This passes the handful of tests in piglit. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-08-29 20:51:40 -04:00
Ilia Mirkin	52a7297dc6	glsl: avoid lowering texcoord array except in simple cases With compat creeping up to geometry and tess shaders, lowering texcoord accesses/writes becomes more complicated. Since it's an optimization anyways, just avoid the complication for now. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-29 20:51:23 -04:00
Andres Gomez	3731233cba	docs: update calendar 18.2.0-rc5 is out, extend to 18.2.0-rc6 Signed-off-by: Andres Gomez <agomez@igalia.com>	2018-08-30 03:33:08 +03:00
Timothy Arceri	9c47c39687	st/mesa, gallium: add a workaround for No Mans Sky The spec seems clear this is not allowed but the Nvidia binary forces apps to add layout qualifiers so this works around the issue for No Mans Sky until the CTS can be sorted out. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 09:54:40 +10:00
Timothy Arceri	9ce7d79cdc	glsl: add a mechanism to allow layout qualifiers on function params The spec is quite clear this is not allowed: From Section 4.4. (Layout Qualifiers) of the GLSL 4.60 spec: "Layout qualifiers can appear in several forms of declaration. They can appear as part of an interface block definition or block member, as shown in the grammar in the previous section. They can also appear with just an interface-qualifier to establish layouts of other declarations made with that qualifier: layout-qualifier interface-qualifier ; Or, they can appear with an individual variable declared with an interface qualifier: layout-qualifier interface-qualifier declaration ;" From Section 4.10 (Memory Qualifiers) of the GLSL 4.60 spec: "Layout qualifiers cannot be used on formal function parameters, and layout qualification is not included in parameter matching." However on the Nvidia binary driver they actually fail to compile if image function params don't have a layout qualifier. This results in applications such as No Mans Sky using layout qualifiers on params. I've submitted a CTS test to expose this problem in the Nvidia driver but until that is resolved this patch will help Mesa drivers work around the issue. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 09:54:40 +10:00
Timothy Arceri	28a3731e3f	glsl: skip stringification in preprocessor if in unreachable branch This fixes compilation of some "No Mans Sky" shaders where the stringification happens in branches intended for DX12. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-30 09:51:57 +10:00
Bas Nieuwenhuizen	4738b6ac81	radv: Add missing checks in radv_get_image_format_properties. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-30 01:21:20 +02:00
Dave Airlie	bb17ae49ee	gallivm: allow to pass two swizzles into fetches. This hijacks the top 16-bits of swizzle, to pass in the swizzle for the second channel. This fixes handling .yx swizzles of 64-bit values. This should fixup radeonsi and llvmpipe. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107524 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 00:15:40 +01:00
Timothy Arceri	3bcec6cf1c	radeonsi: enable radeonsi_zerovram for No Mans Sky Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 07:57:38 +10:00
Timothy Arceri	5566dd8a61	radeonsi: add radeonsi_zerovram driconfig option More and more games seem to require this so lets make it a config option. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 07:57:38 +10:00
Timothy Arceri	406c3d748d	radeonsi: enable GL 4.5 in compat profile Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 07:57:38 +10:00
Timothy Arceri	781a78914c	mesa: enable ARB_direct_state_access in compat for GL3.1+ We could enable it for lower versions of GL but this allows us to just use the existing version/extension checks that are already used by the core profile. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 07:57:38 +10:00
Marek Olšák	93b8b987d0	radeonsi: add a thorough clear/copy_buffer benchmark	2018-08-29 15:31:42 -04:00
Marek Olšák	5914f5bd4a	radeonsi: let internal compute dispatches tune WAVES_PER_SH	2018-08-29 15:31:42 -04:00
Marek Olšák	c5442c1165	radeonsi: add TGSI_SEMANTIC_CS_USER_DATA for reading up to 4 SGPRs with TGSI	2018-08-29 15:31:42 -04:00
Marek Olšák	d7250e4304	radeonsi: add SI_QUERY_TIME_ELAPSED_SDMA_SI for measuring DMA on SI DMA on SI doesn't support the timestamp packet, so it's emulated.	2018-08-29 15:31:42 -04:00
Marek Olšák	c359880d8b	radeonsi: add SI_QUERY_TIME_ELAPSED_SDMA for measuring SDMA performance	2018-08-29 15:31:42 -04:00
Marek Olšák	0c5429cc73	radeonsi: add flag L2_STREAM for minimal cache usage	2018-08-29 15:31:41 -04:00
Marek Olšák	8f6e06d160	gallium: add TGSI_MEMORY_STREAM_CACHE_POLICY For internal radeonsi shaders.	2018-08-29 15:31:41 -04:00
Jason Ekstrand	d8033d4083	intel/compiler: Remove surface_idx from brw_image_param Now that the drivers are lowering to surface indices themselves, we no longer need to push the surface index into the shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:03 -05:00
Jason Ekstrand	3cbc02e469	intel: Use TXS for image_size when we have a typed surface Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:03 -05:00
Jason Ekstrand	09f1de97a7	anv,i965: Lower away image derefs in the driver Previously, the back-end compiler turn image access into magic uniform reads and there was a complex contract between back-end compiler and driver about setting up and filling out those params. As of this commit, both drivers now lower image_deref_load_param_intel intrinsics to load_uniform intrinsics controlled by the driver and lower the other image_deref_* intrinsics to image_* intrinsics which take an actual binding table index. There are still "magic" uniforms but they are now added and controlled entirely by the driver and that contract no longer spans components. This also has the side-effect of making most image use compile-time binding table indices. Previously, all image access pulled the binding table index from a uniform. Part of the reason for this was that the magic uniforms made it difficult to decouple binding table indices from the uniforms and, since they are indexed completely differently (especially in Vulkan), it was hard to pull them apart. Now that the driver is handling both, it's trivial to decouple the two and provide actual binding table indices. Shader-db results on Kaby Lake: total instructions in shared programs: 15166872 -> 15164293 (-0.02%) instructions in affected programs: 115834 -> 113255 (-2.23%) helped: 191 HURT: 0 total cycles in shared programs: 571311495 -> 571196465 (-0.02%) cycles in affected programs: 4757115 -> 4642085 (-2.42%) helped: 73 HURT: 67 total spills in shared programs: 10951 -> 10926 (-0.23%) spills in affected programs: 742 -> 717 (-3.37%) helped: 7 HURT: 0 total fills in shared programs: 22226 -> 22201 (-0.11%) fills in affected programs: 1146 -> 1121 (-2.18%) helped: 7 HURT: 0 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:03 -05:00
Jason Ekstrand	0de003be03	nir: Add handle/index-based image intrinsics Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	3942943819	nir: Use a bitfield for image access qualifiers This commit expands the current memory access enum to contain the extra two bits provided for images. We choose to follow the SPIR-V convention of NonReadable and NonWriteable because readonly implies that you can read so readonly + writeonly doesn't make as much sense as NonReadable + NonWriteable. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	48e4fa7dd8	glsl/link,i965: Make ImageAccess four-state The GLSL spec allows you to set both the "readonly" and "writeonly" qualifiers on images to indicate that it can only be used with imageSize. However, we had no way of representing this int he linked shader and flagged it as GL_READ_ONLY. This is good from a "does it use this buffer?" perspective but not from a format and access lowering perspective. By using GL_NONE for if "readonly" and "writeonly" are both set, we can detect this case in the driver and handle it correctly. Nothing currently relies on the type of surface in the "readonly" + "writeonly" case but that's about to change. i965 is the only drier which uses the ImageAccess field and gl_bindless_image::access is currently unused. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	4289143899	intel/compiler: Use two components for 1D array image sizes Having the array length component stored in .z was a small convenience for the ISL image param filling code and an annoyance in the NIR lowering code. The only convenience of treating 1D arrays like 2D arrays in the lowering code is in the address calculation code so let's put all the complexity there as well. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	b1c414ef28	isl: Use the view array length for the image size Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	37f7983bcc	intel/compiler: Do image load/store lowering to NIR This commit moves our storage image format conversion codegen into NIR instead of doing it in the back-end. This has the advantage of letting us run it through NIR's optimizer which is pretty effective at shrinking things down. In the common case of rgba8, the number of instructions emitted after NIR is done with it is half of what it was with the lowering happening in the back-end. On the downside, the back-end's lowering is able to directly use predicates and the NIR lowering has to use IFs. Shader-db results on Kaby Lake: total instructions in shared programs: 15166910 -> 15166872 (<.01%) instructions in affected programs: 5895 -> 5857 (-0.64%) helped: 15 HURT: 0 Clearly, we don't have that much image_load_store happening in the shaders in shader-db.... Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	b217705dec	nir/types: Add a wrapper for coordinate_components Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	f2d0a2b110	anv/pipeline: Remove dead image loads in lower_input_attacnments Dead code will get rid of them eventually but it's better if they're just gone so we guarantee they won't trip up later passes. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	15d39f474b	nir: Make image load/store intrinsics variable-width Instead of requiring 4 components, this allows them to potentially use fewer. Both the SPIR-V and GLSL paths still generate vec4 intrinsics so drivers which assume 4 components should be safe. However, we want to be able to shrink them for i965. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	7cdf8f9339	nir/format_convert: Fix a bitmask in unpack_11f11f10f Fixes: `4e337b42f9` "nir/format_convert: Add pack/unpack for R11F_G11F_B10F" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	1f7be4968f	nir/format_convert: Rename pack_r11g11b10f to pack_11f11f10f This matches the unpack function. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	7bd0363d6f	nir/format_convert: Add [us]norm conversion helpers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	152fdeddbb	nir/format_convert: Rename nir_format_bitcast_uint_vec We have a name for that, it's called a uvec. This just makes the function name a bit shorter. While we're here, we also add an assert for one of the assumptions this function makes. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	7c5df52bdc	nir/format_convert: Add vec mask and sign-extend helpers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	ea4f200864	nir/format_convert: Add support for unpacking signed integers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	80c424148b	nir/opcodes: Make unpack_half_2x16_split_* variable-width There is nothing inherent about these opcodes that requires them to only take scalars. It's very convenient if we let them take vectors as well. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	d448fa3ae3	nir/algebraic: Add some max/min optimizations Found by inspection. This doesn't help much now but we'll see this pattern with images if you load UNORM and then store UNORM. Shader-db results on Kaby Lake: total instructions in shared programs: 15166916 -> 15166910 (<.01%) instructions in affected programs: 761 -> 755 (-0.79%) helped: 6 HURT: 0 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	4dd5263663	nir/algebraic: Add more extract_[iu](8\|16) optimizations This adds the "(a << N) >> M" family of mask or sign-extensions. Not a huge win right now but this pattern will soon be generated by NIR format lowering code. Shader-db results on Kaby Lake: total instructions in shared programs: 15166918 -> 15166916 (<.01%) instructions in affected programs: 36 -> 34 (-5.56%) helped: 2 HURT: 0 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Jason Ekstrand	116b47fe3c	nir/algebraic: Be more careful converting ushr to extract_u8/16 If it's not the right bit-size, it may not actually be the correct extraction. For now, we'll only worry about 32-bit versions. Fixes: `905ff86198` "nir: Recognize open-coded extract_u16" Fixes: `76289fbfa8` "nir: Recognize open-coded extract_u8" Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Sagar Ghuge	40fc4b5acd	intel/tools: new i965_disasm tool Adds a new i965 instruction disassemble tool v2: 1) fix a few nits (Matt Turner) 2) Remove i965_disasm header (Matt Turner) v3: 1) Redirect output to correct file descriptors (Matt Turner) 2) Refactor code (Matt Turner) 3) Use better formatting style (Matt Turner) Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com>	2018-08-29 11:19:55 -07:00
Kenneth Graunke	8fb966688b	st/mesa: Disable blending for integer formats. Blending isn't valid for integer formats. Rather than having drivers worry about this, just disable blending in this case. This hopefully will increase hits in the CSO cache as well, by eliminating most of the meaningless fields in this case. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-29 10:51:11 -07:00
Brian Paul	18e9b4791b	svga: add missing switch cases for shadow textures This doesn't seem to make any difference in testing, but it fixes a failed assertion when dumping sm3 shaders. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-08-29 11:29:07 -06:00
Brian Paul	fb7e462c97	svga: fix vgpu9 sprite coordinate bug Setting GL_POINT_SPRITE_COORD_ORIGIN to GL_LOWER_LEFT did not work for vgpu9. We can use the rasterizer sprite_coord_enable bitfield as-is. We need to index into it using the TGSI semantic index, not the register index. This fixes the Piglit fbo-gl_pointcoord and glsl-fs-pointcoord tests. Testing done: Piglit, Mesa sprite demos Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-08-29 11:29:07 -06:00
Brian Paul	8331d69a87	svga: fix PIPE_TEXTURE_RECT/BUFFER const buffer issue The flag_rect and flag_buffer fields didn't sufficiently capture the state changes needed for those resource types. For example, if a texture binding was changed from a 500x500 rect texture to a 400x400 rect texture we didn't set SVGA_NEW_TEXTURE_CONSTS. But we need to do that to emit the new texcoord scale factors to the constant buffers. Rather than track the sizes of all bound resources, just set the flag if the resource is a rect. Same story with texture buffers. Also, since rect/buffer textures are usable with VS/GS shaders, add SVGA_NEW_TEXTURE_CONSTS to the flags we check for emitting VS/GS constants. This seems to help with XFCE / xfwm4 desktop scaling. VMware issue 2156696. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-08-29 11:29:07 -06:00
Brian Paul	46c7433da8	svga: minor improvements in svga_state_constants.c Add const qualifiers. Add 'f' suffix on floats to avoid double promotion. Remove unneeded shader type assertion since the switch statement handled it already. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-08-29 11:29:07 -06:00
Jason Ekstrand	cdea5d996e	anv: Free the app and engine name Fixes: `8c048af589` "anv: Copy the appliation info into the instance" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-29 11:24:57 -05:00
Rhys Kidd	f7d0c112cb	nv50/ir: silence partitionLoadStore() unused function warning Move this now-unused function into the existing comment block, which was its only prior use. ../../../../../src/gallium/drivers/nouveau/codegen/nv50_ir_from_tgsi.cpp:2645:1: warning: unused function 'partitionLoadStore' [-Wunused-function] partitionLoadStore(uint8_t comp[2], uint8_t size[2], uint8_t mask) Fixes: ("86e4440361 nouveau: codegen: Disable more old resource handling code") Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-08-29 08:59:27 -04:00
vadym.shovkoplias	966a797e43	glsl/linker: Link all out vars from a shader objects on a single stage During intra stage linking some out variables can be dropped because it is not used in a shader with the main function. But these out vars can be referenced on later stages which can lead to further linking errors. Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105731	2018-08-29 20:03:56 +10:00
Lionel Landwerlin	5a1c23d150	anv: blorp: support multiple aspect blits Newer blit tests are enabling depth&stencils blits. We currently don't support it but can do by iterating over the aspects masks (copy some logic from the CopyImage function). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9f44745eca` ("anv: Use blorp to implement VkBlitImage") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-29 10:31:06 +01:00
Tapani Pälli	a72dbc461b	mesa: allow GL_UNSIGNED_BYTE type for SNORM reads OpenGL ES spec states: "For normalized fixed-point rendering surfaces, the combination format RGBA and type UNSIGNED_BYTE is accepted." This fixes following failing VK-GL-CTS tests: KHR-GLES3.packed_pixels.pbo_rectangle.rgba8_snorm KHR-GLES3.packed_pixels.rectangle.rgba8_snorm KHR-GLES3.packed_pixels.varied_rectangle.rgba8_snorm Signed-off-by: Tapani Pälli <tapani.palli@intel.com> https://bugs.freedesktop.org/show_bug.cgi?id=107658 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Andres Gomez <agomez@igalia.com>	2018-08-29 09:26:23 +03:00
Timothy Arceri	5db981952a	nir: add loop unroll support for wrapper loops This adds support for unrolling the classic do { // ... } while (false) that is used to wrap multi-line macros. GLSL IR also wraps switch statements in a loop like this. shader-db results IVB: total loops in shared programs: 2515 -> 2512 (-0.12%) loops in affected programs: 33 -> 30 (-9.09%) helped: 3 HURT: 0 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-29 16:02:05 +10:00
Timothy Arceri	0f450b57a1	nir/opt_loop_unroll: Remove unneeded phis if we make progress Now that SSA values can be derefs and they have special rules, we have to be a bit more careful about our LCSSA phis. In particular, we need to clean up in case LCSSA ended up creating a phi node for a deref. This avoids validation issues with some CTS tests with the following patch, but its possible this we could also see the same problem with the existing unrolling passes. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-29 16:02:05 +10:00
Timothy Arceri	5a6b04d94b	nir: add complex_loop bool to loop info In order to be sure loop_terminator_list is an accurate representation of all the jumps in the loop we need to be sure we didn't encounter any other complex behaviour such as continues, nested breaks, etc during analysis. This will be used in the following patch. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-29 16:02:05 +10:00
Timothy Arceri	fef6325e58	nir: always attempt to find loop terminators This will help later patches with unrolling loops that end with a break i.e. loops the always exit on their first interation. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-29 16:02:05 +10:00
Marek Olšák	1e40f69483	ac/surface: fix CMASK fast clear for NPOT textures with mipmapping on SI/CI/VI This fixes VM faults and corruption. Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-28 19:51:51 -04:00
Ian Romanick	c836326a29	i965/vec4: Emit BRW_AOP_INC or BRW_AOP_DEC for atomicAdd of +1 or -1 No shader-db changes on any Intel platform. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-28 15:35:50 -07:00
Ian Romanick	c856403868	i965/fs: Emit BRW_AOP_INC or BRW_AOP_DEC for imageAtomicAdd of +1 or -1 v2: Refactor selection of atomic opcode to a separate function. Suggested by Jason. No changes on any other Intel platforms. Skylake total instructions in shared programs: 14304261 -> 14304241 (<.01%) instructions in affected programs: 1625 -> 1605 (-1.23%) helped: 4 HURT: 0 helped stats (abs) min: 1 max: 8 x̄: 5.00 x̃: 5 helped stats (rel) min: 1.01% max: 14.29% x̄: 5.86% x̃: 4.07% 95% mean confidence interval for instructions value: -10.66 0.66 95% mean confidence interval for instructions %-change: -15.91% 4.19% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 527531226 -> 527531194 (<.01%) cycles in affected programs: 92204 -> 92172 (-0.03%) helped: 2 HURT: 0 Haswell and Broadwell had similar results. (Broadwell shown) total instructions in shared programs: 14615730 -> 14615710 (<.01%) instructions in affected programs: 1838 -> 1818 (-1.09%) helped: 4 HURT: 0 helped stats (abs) min: 1 max: 8 x̄: 5.00 x̃: 5 helped stats (rel) min: 0.89% max: 13.04% x̄: 5.37% x̃: 3.78% 95% mean confidence interval for instructions value: -10.66 0.66 95% mean confidence interval for instructions %-change: -14.59% 3.85% Inconclusive result (value mean confidence interval includes 0). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-28 15:35:46 -07:00
Ian Romanick	b6e247cf0e	i965/fs: Refactor image atomics to be a bit more like other atomics This greatly simplifies the next patch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-28 15:35:46 -07:00
Ian Romanick	fabe3ead57	i965/fs: Emit BRW_AOP_INC or BRW_AOP_DEC for atomicAdd of +1 or -1 Funny story... a single shader was hurt for instructions, spills, fills. That same shader was also the most helped for cycles. #GPUsAreWeird No changes on any other Intel platform. v2: Refactor selection of atomic opcode to a separate function. Suggested by Jason. Haswell, Broadwell, and Skylake had similar results. (Skylake shown) total instructions in shared programs: 14304116 -> 14304261 (<.01%) instructions in affected programs: 12776 -> 12921 (1.13%) helped: 19 HURT: 1 helped stats (abs) min: 1 max: 16 x̄: 2.32 x̃: 1 helped stats (rel) min: 0.05% max: 7.27% x̄: 0.92% x̃: 0.55% HURT stats (abs) min: 189 max: 189 x̄: 189.00 x̃: 189 HURT stats (rel) min: 4.87% max: 4.87% x̄: 4.87% x̃: 4.87% 95% mean confidence interval for instructions value: -12.83 27.33 95% mean confidence interval for instructions %-change: -1.57% 0.31% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 527552861 -> 527531226 (<.01%) cycles in affected programs: 1459195 -> 1437560 (-1.48%) helped: 16 HURT: 2 helped stats (abs) min: 2 max: 21328 x̄: 1353.69 x̃: 6 helped stats (rel) min: 0.01% max: 5.29% x̄: 0.36% x̃: 0.03% HURT stats (abs) min: 12 max: 12 x̄: 12.00 x̃: 12 HURT stats (rel) min: 0.03% max: 0.03% x̄: 0.03% x̃: 0.03% 95% mean confidence interval for cycles value: -3699.81 1295.92 95% mean confidence interval for cycles %-change: -0.94% 0.30% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 8025 -> 8033 (0.10%) spills in affected programs: 208 -> 216 (3.85%) helped: 1 HURT: 1 total fills in shared programs: 10989 -> 11040 (0.46%) fills in affected programs: 444 -> 495 (11.49%) helped: 1 HURT: 1 Ivy Bridge total instructions in shared programs: 11709181 -> 11709153 (<.01%) instructions in affected programs: 3505 -> 3477 (-0.80%) helped: 3 HURT: 0 helped stats (abs) min: 1 max: 23 x̄: 9.33 x̃: 4 helped stats (rel) min: 0.11% max: 1.16% x̄: 0.63% x̃: 0.61% total cycles in shared programs: 254741126 -> 254738801 (<.01%) cycles in affected programs: 919067 -> 916742 (-0.25%) helped: 3 HURT: 0 helped stats (abs) min: 21 max: 2144 x̄: 775.00 x̃: 160 helped stats (rel) min: 0.03% max: 0.90% x̄: 0.32% x̃: 0.03% total spills in shared programs: 4536 -> 4533 (-0.07%) spills in affected programs: 40 -> 37 (-7.50%) helped: 1 HURT: 0 total fills in shared programs: 4819 -> 4813 (-0.12%) fills in affected programs: 94 -> 88 (-6.38%) helped: 1 HURT: 0 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-28 15:35:38 -07:00
Ian Romanick	41399f4bc7	intel/compiler: Silence unused parameter warnings in brw_eu.h All of the other brw__desc functions take a devinfo parameter, and all of the others at least have an assert that uses it. Keep the parameter, but mark it as unused. Silences 37 warnings like: In file included from src/intel/common/gen_disasm.c:27:0: src/intel/compiler/brw_eu.h: In function ‘brw_pixel_interp_desc’: src/intel/compiler/brw_eu.h:377:53: warning: unused parameter ‘devinfo’ [-Wunused-parameter] brw_pixel_interp_desc(const struct gen_device_info devinfo, ^~~~~~~ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-28 15:35:38 -07:00
Sagar Ghuge	56574f4df3	i965: enable AMD_depth_clamp_separate Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-28 12:57:27 -07:00
Sagar Ghuge	e6adea0dc0	i965: add functional changes for AMD_depth_clamp_separate Gen >= 9 have ability to control clamping of depth values separately at near and far plane. z_w is clamped to the range [min(n,f), 0] if clamping at near plane is enabled, [0, max(n,f)] if clamping at far plane is enabled and [min(n,f) max(n,f)] if clamping at both plane is enabled. v2: 1) Use better coding style (Ian Romanick) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-28 12:57:27 -07:00
Sagar Ghuge	2765749e0f	mesa: add EXTRA_EXT for AMD_depth_clamp_separate Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-28 12:57:27 -07:00
Sagar Ghuge	2770446740	mesa: add support for GL_AMD_depth_clamp_separate tokens _mesa_set_enable() and _mesa_IsEnabled() extended to accept new two tokens GL_DEPTH_CLAMP_NEAR_AMD and GL_DEPTH_CLAMP_FAR_AMD. v2: Remove unnecessary parentheses (Marek Olsak) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-28 12:57:27 -07:00
Sagar Ghuge	5650d39978	mesa: Add support for AMD_depth_clamp_separate Enable _mesa_PushAttrib() and _mesa_PopAttrib() to handle GL_DEPTH_CLAMP_NEAR_AMD and GL_DEPTH_CLAMP_FAR_AMD tokens. Remove DepthClamp, because DepthClampNear + DepthClampFar replaces it, as suggested by Marek Olsak. Driver that enables AMD_depth_clamp_separate will only ever look at DepthClampNear and DepthClampFar, as suggested by Ian Romanick. v2: 1) Remove unnecessary parentheses (Marek Olsak) 2) if AMD_depth_clamp_separate is unsupported, TEST_AND_UPDATE GL_DEPTH_CLAMP only (Marek Olsak) 3) Clamp against near and far plane separately (Marek Olsak) 4) Clip point separately for near and far Z clipping plane (Marek Olsak) v3: Clamp raster position zw to the range [min(n,f), 0] for near plane and [0, max(n,f)] for far plane (Marek Olsak) v4: Use MIN2 and MAX2 instead of CLAMP (Marek Olsak) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-28 12:57:27 -07:00
Sagar Ghuge	379949b967	mesa: Add types for AMD_depth_clamp_separate. Add some basic types and storage for the AMD_depth_clamp_separate extension. v2: 1) Drop unnecessary definition (Marek Olsak) 2) Expose extension in compatibility profile (Marek Olsak) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-28 12:57:27 -07:00
Sagar Ghuge	f663fb5487	glapi: define AMD_depth_clamp_separate Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-28 12:57:27 -07:00
Jason Ekstrand	c92a463d23	anv: Claim to support depthBounds for ID games Cc: "18.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-28 13:05:54 -05:00
Jason Ekstrand	8c048af589	anv: Copy the appliation info into the instance Cc: "18.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-28 13:05:54 -05:00
Jason Ekstrand	4ffb575da5	vulkan/alloc: Add a vk_strdup helper Cc: "18.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-28 13:05:54 -05:00
Dylan Baker	7c00db9527	meson: Actually load translation files Currently we run the script but don't actually load any files, even in a tarball where they exist. Fixes: `3218056e0e` ("meson: Build i965 and dri stack") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-28 08:51:05 -07:00
Caio Marcelo de Oliveira Filho	f172a77dd8	nir: Remove outdated comment Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-28 08:11:03 -07:00
Kevin Rogovin	03ecec9ed2	i965: Add INTEL_fragment_shader_ordering support. Adds suppport for INTEL_fragment_shader_ordering. We achieve the fragment ordering by using the same instruction as for beginInvocationInterlockARB() which is by issuing a memory fence via sendc. Signed-off-by: Kevin Rogovin <kevin.rogovin@intel.com> Reviewed-by: Plamena Manolova <plamena.manolova@intel.com>	2018-08-28 17:15:10 +03:00
Kevin Rogovin	119435c877	mesa: Add GL/GLSL plumbing for INTEL_fragment_shader_ordering This extension provides new GLSL built-in function beginFragmentShaderOrderingIntel() that guarantees (taking wording of GL_INTEL_fragment_shader_ordering extension) that any memory transactions issued by shader invocations from previous primitives mapped to same xy window coordinates (and same sample when per-sample shading is active), complete and are visible to the shader invocation that called beginFragmentShaderOrderingINTEL(). One advantage of INTEL_fragment_shader_ordering over ARB_fragment_shader_interlock is that it provides a function that operates as a memory barrie (instead of a defining a critcial section) that can be called under arbitary control flow from any function (in contrast the begin/end of ARB_fragment_shader_interlock may only be called once, from main(), under no control flow. Signed-off-by: Kevin Rogovin <kevin.rogovin@intel.com> Reviewed-by: Plamena Manolova <plamena.manolova@intel.com>	2018-08-28 17:15:10 +03:00
Andrii Simiklit	1b0df8a460	i965/gen6/xfb: handle case where transform feedback is not active When the SVBI Payload Enable is false I guess the register R1.4 which contains the Maximum Streamed Vertex Buffer Index is filled by zero and GS stops to write transform feedback when the transform feedback is not active. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107579 Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-08-28 15:32:45 +02:00
Rhys Perry	743e11c10b	docs: add forgotten features to 18.2.0 release notes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewied-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 18.2: <mesa-stable@lists.freedesktop.org>	2018-08-28 13:50:51 +01:00
Erik Faye-Lund	a4e60ccb56	virgl: add debug-switch to output TGSI This is quite useful for debugging shader-transpiling issues in virglrenderer. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2018-08-28 14:13:43 +02:00
Erik Faye-Lund	4ab06cc56e	virgl: introduce $VIRGL_DEBUG=verbose This adds an environment-varaible that can be used for driver-specific flags, as well as a flag for it to enable verbose output. While we're at it, quiet some overly chatty debug-output by default. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2018-08-28 14:13:43 +02:00
Erik Faye-Lund	1b2444dffc	virgl: replace fprintf-call with debug_printf This is the only direct call-site for fprintf in virgl; all other call-sites call debug_printf instead. So let's follow in style here. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2018-08-28 14:13:43 +02:00
Erik Faye-Lund	2ebfa90abe	virgl: delete commented out fprintf-call This is just debug-cruft left over. Let's just get rid of it. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2018-08-28 14:13:43 +02:00
Guido Günther	9de34b4dde	meson: Don't enable any vulkan drivers on arm, aarch64 There's no Vulkan support for arm atm. Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-27 11:32:04 -07:00
Guido Günther	05e2fc6860	meson: Be a bit more helpful when arch or OS is unknown V2: Add one missing @0@ Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-27 11:31:52 -07:00
Sagar Ghuge	a1e3305f75	intel/eu: print bytes instead of 32 bit hex value INTEL_DEBUG=hex prints 32 bit hex value and due to endianness of CPU byte order is reversed. In order to disassemble binary files, print each byte instead of 32 bit hex value. v2: Print blank spaces in order to vertically align output of compacted instructions hex value with uncompacted instructions hex value. (Matt Turner) v3: Fix line wrap at correct length Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-08-27 11:07:39 -07:00
Lionel Landwerlin	440a988bd1	intel: decoder: handle 0 sized structs Gen7.5 has a BLEND_STATE of size 0 which includes a variable length group. We did not deal with that very well, leading to an endless loop. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107544 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-27 18:33:18 +01:00
Rhys Perry	e56e600bd3	nv50/ir,nvc0: use constant buffers for compute when possible on Kepler+ Gives a +7.79% increase in FPS with Hitman on lowest quality settings on my GTX 1060. total instructions in shared programs : 5787979 -> 5748677 (-0.68%) total gprs used in shared programs : 669901 -> 669373 (-0.08%) total shared used in shared programs : 548832 -> 548832 (0.00%) total local used in shared programs : 21068 -> 21064 (-0.02%) local shared gpr inst bytes helped 1 0 152 274 274 hurt 0 0 0 0 0 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-08-27 14:23:42 +01:00
Rhys Perry	d27c791891	nv50/ir: optimize multiplication by 16-bit immediates into two xmads Rather than the usual three that would be created. total instructions in shared programs : 5796385 -> 5786560 (-0.17%) total gprs used in shared programs : 670103 -> 669968 (-0.02%) total shared used in shared programs : 548832 -> 548832 (0.00%) total local used in shared programs : 21164 -> 21068 (-0.45%) local shared gpr inst bytes helped 1 0 64 1040 1040 hurt 0 0 27 0 0 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-08-27 13:57:11 +01:00
Rhys Perry	400a4eb964	nv50/ir: optimize near power-of-twos into shladd total instructions in shared programs : 5819319 -> 5796385 (-0.39%) total gprs used in shared programs : 670571 -> 670103 (-0.07%) total shared used in shared programs : 548832 -> 548832 (0.00%) total local used in shared programs : 21164 -> 21164 (0.00%) local shared gpr inst bytes helped 0 0 318 1758 1758 hurt 0 0 63 0 0 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-08-27 13:57:01 +01:00
Rhys Perry	2f52925f5c	nv50/ir: move a * b -> a << log2(b) code into createMul() With this commit, OP_MAD is handled on nv50 too. This commit is also useful for later commits. Also, instead of creating a shladd, it relies on LateAlgebraicOpt to create one. This simplifies the code and helps shader-db slightly overall. total instructions in shared programs : 5820882 -> 5819319 (-0.03%) total gprs used in shared programs : 670595 -> 670571 (-0.00%) total shared used in shared programs : 548832 -> 548832 (0.00%) total local used in shared programs : 21164 -> 21164 (0.00%) local shared gpr inst bytes helped 0 0 18 230 230 hurt 0 0 8 263 263 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-08-27 13:56:47 +01:00
Rhys Perry	b60bc7a4ab	nv50/ir: optimize imul/imad to xmads This hits the shader-db numbers a good bit, though a few xmads is way faster than an imul or imad and the cost is mitigated by the next commit, which optimizes many multiplications by immediates into shorter and less register heavy instructions than the xmads. total instructions in shared programs : 5768871 -> 5820882 (0.90%) total gprs used in shared programs : 669919 -> 670595 (0.10%) total shared used in shared programs : 548832 -> 548832 (0.00%) total local used in shared programs : 21068 -> 21164 (0.46%) local shared gpr inst bytes helped 0 0 38 0 0 hurt 1 0 365 3076 3076 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-08-27 13:56:44 +01:00
Rhys Perry	bcbcdf8448	gm107/ir: add support for OP_XMAD on GM107+ v4: make the immediate field 16 bits v5: don't ever emit h1 flags for immediates Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-08-27 13:56:41 +01:00
Rhys Perry	5d6952d2de	nv50/ir: add preliminary support for OP_XMAD v4: remove uint16_t(...) v4: don't allow immediates outside [0,65535] in insnCanLoad() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-08-27 13:56:36 +01:00
vadym.shovkoplias	4a8444d5bc	glsl/linker: Allow unused in blocks which are not declated on previous stage >From Section 4.3.4 (Inputs) of the GLSL 1.50 spec: "Only the input variables that are actually read need to be written by the previous stage; it is allowed to have superfluous declarations of input variables." Fixes: * interstage-multiple-shader-objects.shader_test v2: Update comment in ir.h since the usage of "used" field has been extended. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101247 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-27 12:13:53 +02:00
Jason Ekstrand	07a227f543	nir: Pull block_ends_in_jump into nir.h We had two different implementations in different files. May as well have one and put it in nir.h. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-27 02:15:38 -05:00
Samuel Iglesias Gonsálvez	59a8e0dbf8	anv: Add support for protected memory properties on anv_GetPhysicalDeviceProperties2() VkPhysicalDeviceProtectedMemoryProperties structure is new on Vulkan 1.1. Fixes Vulkan CTS CL#2849. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-27 09:07:52 +02:00
Jason Ekstrand	aad501f15e	intel/tools: Add 0x in front of a couple of hex values Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-25 18:47:08 -05:00
Jason Ekstrand	76b0e4d8c9	anv: Fill holes in the VF VUE to zero This fixes a GPU hang in DOOM 2016 running under wine. Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104809 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-25 18:47:08 -05:00
Kai Wasserbäch	b2313ef4a8	intel: tools: Fix aubinator_error's fprintf call (format-security) The recent commit `4616639b49` introduced the new function aubinator_error() which is a trivial wrapper around fprintf() to STDERR. The call to fprintf() however is passed the message msg directly: fprintf(stderr, msg); This is a format-security violation and leads to an FTBFS with -Werror=format-security (GCC 8): ../../../src/intel/tools/aubinator.c: In function 'aubinator_error': ../../../src/intel/tools/aubinator.c:74:4: error: format not a string literal and no format arguments [-Werror=format-security] fprintf(stderr, msg); ^~~~~~~ This patch fixes this trivially by introducing a catch-all "%s" format argument. Fixes: `4616639b49` ("intel: tools: split aub parsing from aubinator") Cc: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-25 16:52:12 +01:00
Jason Ekstrand	70de31d0c1	intel/batch_decoder: Print blend states properly Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-25 07:50:45 -05:00
Jason Ekstrand	cbd4bc1346	intel/batch_decoder: Fix dynamic state printing Instead of printing addresses like everyone else, we were accidentally printing the offset from state base address. Also, state_map is a void pointer so we were incrementing in bytes instead of dwords and every state other than the first was wrong. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-25 07:50:43 -05:00
Jason Ekstrand	d1971be6ea	intel/decoder: Print ISL formats for vertex elements Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-25 07:50:40 -05:00
Jason Ekstrand	2abd7ae189	intel/decoder: Clean up field iteration and fix sub-dword fields First of all, setting iter->name in advance_field is unnecessary because it gets set by gen_decode_field which gets called immediately after gen_decode_field in the one call-site. Second, we weren't properly initializing start_bit and end_bit in the initial condition of gen_field_iterator_next so the first field of a struct would get printed wrong if it doesn't start on the first bit. This is fixed by adding a iter_start_field helper which sets the field and also sets up the other bits we need. This fixes decoding of 3DSTATE_SBE_SWIZ. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-25 07:50:36 -05:00
Kenneth Graunke	1281608849	gallium: Split out PIPE_CAP_TEXTURE_MIRROR_CLAMP_TO_EDGE. Some hardware can do PIPE_TEX_WRAP_MIRROR_REPEAT but not PIPE_TEX_WRAP_MIRROR_CLAMP and PIPE_TEX_WRAP_MIRROR_CLAMP_TO_BORDER. Drivers for such hardware would like to advertise support for ARB_texture_mirror_clamp_to_edge but not EXT_texture_mirror_clamp. This commit adds a new PIPE_CAP_TEXTURE_MIRROR_CLAMP_TO_EDGE bit, changes the extension enable to be based on that, and enables it in all upstream drivers which supported PIPE_CAP_TEXTURE_MIRROR_CLAMP (so they continue supporting this mode).	2018-08-24 17:25:36 -07:00
Lionel Landwerlin	f430a37fa7	intel: decoder: unify MI_BB_START field naming The batch decoder looks for a field with a particular name to decide whether an MI_BB_START leads into a second batch buffer level. Because the names are different between Gen7.5/8 and the newer generation we fail that test and keep on reading (invalid) instructions. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107544 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-24 23:10:08 +01:00
Dylan Baker	7f745c19c1	docs: Update calendar, news, relnotes for 18.1.7	2018-08-24 09:35:24 -07:00
Dylan Baker	82c2e7bf9e	docs: Add mesa 18.1.7 notes	2018-08-24 09:34:03 -07:00
Dylan Baker	2d8569073e	docs: Add mesa 18.1.7 docs	2018-08-24 09:33:59 -07:00
Andres Gomez	0d3bb146a8	docs: update calendar 18.2.0-rc4 is out, extend to 18.2.0-rc5 Signed-off-by: Andres Gomez <agomez@igalia.com>	2018-08-24 18:58:00 +03:00
Kevin Rogovin	e345247092	docs/relnotes: Mark NV_fragment_shader_interlock support in i965 Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-24 08:59:54 -05:00
Emil Velikov	081395e99d	egl/drm: use gbm_dri_bo() wrapper Remove the explicit cast, using the appropriate wrapper instead. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Daniel Stone <daniels@collabora.com>	2018-08-24 11:53:24 +01:00
Emil Velikov	7b4269a5e0	egl/drm: use gbm_dri_surface() wrapper Remove the explicit cast, using the appropriate wrapper instead. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Daniel Stone <daniels@collabora.com>	2018-08-24 11:53:20 +01:00
Emil Velikov	7eb4a28d41	egl/drm: use gbm_dri_device() wrapper Remove the explicit cast, using the appropriate wrapper instead. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Daniel Stone <daniels@collabora.com>	2018-08-24 11:52:48 +01:00
Emil Velikov	2c049384b1	egl/android: simplify device open/probe Currently droid_probe_device, does not do any 'probing' but filtering out a device if it doesn't match the vendor string given. Rename the function, straighten the return type and call it only as needed - an actual vendor string is provided. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org>	2018-08-24 11:52:44 +01:00
Emil Velikov	2f8403a4ca	egl/android: remove drmVersion::name NULL check The name string is guaranteed to be non-NULL. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org>	2018-08-24 11:52:41 +01:00
Emil Velikov	d1211f3112	egl/android: remove droid_probe_driver() The function name is misleading - it effectively checks if loader_get_driver_for_fd fails. Which can happen only only on strdup error - a close to impossible scenario. Drop the function - we call the loader API at at later stage. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org>	2018-08-24 11:52:39 +01:00
Emil Velikov	9b5bf7afce	egl/android: use strcmp with drmVersion::name The name string is guaranteed to be NULL terminated. Drop the explicit length check that comes with strncmp(). Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org>	2018-08-24 11:52:37 +01:00
Emil Velikov	3827966643	egl/android: use drmDevice instead of the manual /dev/dri iteration Replace the manual handling of /dev/dri in favor of the drmDevice API. The latter provides a consistent way of enumerating the devices, providing device details as needed. v2: - Use ARRAY_SIZE (Frank) - s/famour/favor/ typo (Frank) - Make MAX_DRM_DEVICES a macro - fix vla errors (RobF) - Remove left-over dev_path instance (RobF) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Robert Foss <robert.foss@collabora.com> (v1) Reviewed-by: Tomasz Figa <tfiga@chromium.org>	2018-08-24 11:50:36 +01:00
Emil Velikov	cff80b6c15	Revert "configure: allow building with python3" This reverts commit `ae7898dfdb`. Turns out the python scripts are _not_ fully python 3 compatible. As Ilia reported using get_xmlpool.py with LANG=C produces some weird output - see the link for details. Even though the issue was spotted with the autoconf build, it exposes a genuine problem with the script (and lack of lang handling of the meson build.) https://lists.freedesktop.org/archives/mesa-dev/2018-August/203508.html	2018-08-24 11:14:15 +01:00
Emil Velikov	7a4d2d1fdf	Revert "travis: use python3 for the autoconf builds" This reverts commit `855af9a5a2`. Turns out the python scripts are _not_ fully python 3 compatible. As Ilia reported using get_xmlpool.py with LANG=C produces some weird output - see the link for details. Even though the issue was spotted with the autoconf build, it exposes a genuine problem with the script (and lack of lang handling of the meson build.) https://lists.freedesktop.org/archives/mesa-dev/2018-August/203508.html	2018-08-24 11:10:24 +01:00
Kenneth Graunke	93e8e17fa4	Revert "mesa: bump GL_MAX_ELEMENTS_INDICES and GL_MAX_ELEMENTS_VERTICES" This reverts commit `095515e16c`. This breaks KHR-GL46.map_buffer_alignment.functional on i965. This code was apparently not reviewed and I don't know why we would move from a driver configurable constant to a hardcoded value for all drivers. This really looks like an accidental hack push.	2018-08-24 00:36:01 -07:00
Kenneth Graunke	9d670fd86c	Revert recent changes about not including compute in combined limits. As far as I can tell, no one reviewed these changes, they made i965 assert fail on driver load, and I am not certain they are correct. (Hopefully reverting these does not break radeonsi too badly...) The uniform related changes seem fine and reasonable, but the texture image units change is possibly incorrect. According to the OES_tessellation_shader spec issue 5: (5) How are aggregate shader limits computed? RESOLVED: Following the GL 4.4 model, but we restrict uniform buffer bindings to 12/stage instead of 14, this results in MAX_UNIFORM_BUFFER_BINDINGS = 72 This is 12 bindings/stage * 6 shader stages, allowing a static partitioning of the bindings even though at most 5 stages can appear in a program object). MAX_COMBINED_UNIFORM_BLOCKS = 60 This is 12 blocks/stage * 5 stages, since compute shaders can't be mixed with other stages. MAX_COMBINED_TEXTURE_IMAGE_UNITS = 96 This is 16 textures/stage * 6 stages. which definitely is including compute shaders in that last limit. Not including compute shaders breaks the following test: dEQP-GLES31.functional.state_query.integer.max_combined_texture_image_units_getinteger There was enough breakage that I figured we should just send this back to the drawing board. Revert "i965: don't include compute resources in "Combined" limits" Revert "st/mesa: don't include compute resources in "Combined" limits" Revert "mesa: don't include compute resources in MAX_COMBINED_* limits" This reverts commit `b03dcb1e5f`. This reverts commit `cff290df4c`. This reverts commit `45f87a48f9`.	2018-08-24 00:36:01 -07:00
Roland Scheidegger	8e1be9a34a	gallivm: don't use saturated unsigned add/sub intrinsics for llvm 8.0 These have been removed. Unfortunately auto-upgrade doesn't work for jit. (Worse, it seems we don't get a compilation error anymore when compiling the shader, rather llvm will just do a call to a null function in the jitted shaders making it difficult to detect when intrinsics vanish.) Luckily the signed ones are still there, I helped convincing llvm removing them is a bad idea for now, since while the unsigned ones have sort of agreed-upon simplest patterns to replace them with, this is not the case for the signed ones, and they require _significantly_ more complex patterns - to the point that the recognition is IMHO probably unlikely to ever work reliably in practice (due to other optimizations interfering). (Even for the relatively trivial unsigned patterns, llvm already added test cases where recognition doesn't work, unsaturated add followed by saturated add may produce atrocious code.) Nevertheless, it seems there's a serious quest to squash all cpu-specific intrinsics going on, so I'd expect patches to nuke them as well to resurface. Adapt the existing fallback code to match the simple patterns llvm uses and hope for the best. I've verified with lp_test_blend that it does produce the expected saturated assembly instructions. Though our cmp/select build helpers don't use boolean masks, but it doesn't seem to interfere with llvm's ability to recognize the pattern. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106231 Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-08-24 07:50:13 +02:00
Marek Olšák	45b5f5fa25	st/mesa: expose KHR_texture_compression_astc_sliced_3d This is ASTC 2D LDR allowing texture arrays and 3D, compressing each slice as a separate 2D image. Tested by piglit. Trivial.	2018-08-24 00:36:18 -04:00
Marek Olšák	dae4cf397d	st/mesa: expose EXT_disjoint_timer_query same cap as ARB_timer_query, no changes needed, tested by piglit	2018-08-24 00:36:18 -04:00
Marek Olšák	263c962cfd	mesa: expose EXT_vertex_attrib_64bit because the closed driver exposes it. It's the same as the ARB extension. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-24 00:36:18 -04:00
Marek Olšák	5c90091036	mesa: expose AMD_query_buffer_object it's a subset of the ARB extension. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-24 00:36:18 -04:00
Marek Olšák	056b9a5a36	mesa: expose AMD_multi_draw_indirect because the closed driver exposes it. This is equivalent to the ARB extension. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-24 00:36:18 -04:00
Marek Olšák	b3c17330e6	mesa: expose AMD_gpu_shader_int64 because the closed driver exposes it. It's equivalent to ARB_gpu_shader_int64. In this patch, I did everything the same as we do for ARB_gpu_shader_int64. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-24 00:36:18 -04:00
Marek Olšák	1cf3631b9c	mesa: expose ARB_post_depth_coverage in the Compatibility profile It only contains GLSL changes. v2: allow the layout qualifier on GLSL <= 1.30	2018-08-24 00:36:18 -04:00
Jason Ekstrand	8d8222461f	intel/nir: Enable nir_opt_find_array_copies We have to be a bit careful with this one because we want it to run in the optimization loop but only in the first brw_nir_optimize call. Later calls assume that we've lowered away copy_deref instructions and we don't want to introduce any more. Shader-db results on Kaby Lake: total instructions in shared programs: 15176942 -> 15176942 (0.00%) instructions in affected programs: 0 -> 0 helped: 0 HURT: 0 In spite of the lack of any shader-db improvement, this patch completely eliminates spilling in the Batman: Arkham City tessellation shaders. This is because we are now able to detect that the temporary array created by DXVK for storing TCS inputs is a copy of the input arrays and use indirect URB reads instead of making a copy of 4.5 KiB of input data and then indirecting on it with if-ladders. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 21:47:51 -05:00
Jason Ekstrand	53072582dc	nir: Add an array copy optimization This peephole optimization looks for a series of load/store_deref or copy_deref instructions that copy an array from one variable to another and turns it into a copy_deref that copies the entire array. The pattern it looks for is extremely specific but it's good enough to pick up on the input array copies in DXVK and should also be able to pick up the sequence generated by spirv_to_nir for a OpLoad of a large composite followed by OpStore. It can always be improved later if needed. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 21:47:47 -05:00
Jason Ekstrand	a4a9c07549	intel/nir: Use nir_shrink_vec_array_vars Shader-db results on Kaby Lake: total instructions in shared programs: 15177605 -> 15176765 (<.01%) instructions in affected programs: 4259 -> 3419 (-19.72%) helped: 1 HURT: 0 total spills in shared programs: 10954 -> 10855 (-0.90%) spills in affected programs: 295 -> 196 (-33.56%) helped: 1 HURT: 0 total fills in shared programs: 22222 -> 22117 (-0.47%) fills in affected programs: 417 -> 312 (-25.18%) helped: 1 HURT: 0 The helped shader is from the OglCSDof synmark test. On my Kaby Lake laptop, the actual framerate of the benchmark didn't appear to improve beyond the noise. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 21:46:56 -05:00
Jason Ekstrand	be8d009908	nir: Add a array-of-vector variable shrinking pass This pass looks for variables with vector or array-of-vector types and narrows the type to only the components used. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 21:46:56 -05:00
Jason Ekstrand	02a5442dd7	intel/nir: Use the new structure and array splitting passes We call structure splitting once because it is guaranteed to split all the structures in the entire shader in one go. We call array splitting in the loop in case future optimizations turn indirects into direct dereferences and we can split more arrays. Shader-db results on Kaby Lake: total instructions in shared programs: 15177605 -> 15177605 (0.00%) instructions in affected programs: 0 -> 0 helped: 0 HURT: 0 This is unsurprising because nir_lower_vars_to_ssa already effectively does structure and array splitting internally. It doesn't actually split the variables but it's ability to reason about aliasing in the presence of arrays and structures and pick out scalars or vectors to be lowered to SSA values is fairly advanced. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 21:44:14 -05:00
Jason Ekstrand	fa6417495c	nir: Add an array splitting pass This pass looks for array variables where at least one level of the array is never indirected and splits it into multiple smaller variables. This pass doesn't really do much now because nir_lower_vars_to_ssa can already see through arrays of arrays and can detect indirects on just one level or even see that arr[i][0][5] does not alias arr[i][1][j]. This pass exists to help other passes more easily see through arrays of arrays. If a back-end does implement arrays using scratch or indirects on registers, having more smaller arrays is likely to have better memory efficiency. v2 (Jason Ekstrand): - Better comments and naming (some from Caio) - Rework to use one hash map instead of two v2.1 (Jason Ekstrand): - Fix a couple of bugs that were added in the rework including one which basically prevented it from running Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 21:44:14 -05:00
Jason Ekstrand	26eb077ec4	nir: Add a structure splitting pass This pass doesn't really do much now because nir_lower_vars_to_ssa can already see through structures and considers them to be "split". This pass exists to help other passes more easily see through structure variables. If a back-end does implement arrays using scratch or indirects on registers, having more smaller arrays is likely to have better memory efficiency. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 21:44:14 -05:00
Jason Ekstrand	b489998e63	nir/types: Add array_or_matrix helpers Reviewed-by: Thomas Helland<thomashelland90@gmail.com>	2018-08-23 21:44:14 -05:00
Kenneth Graunke	b03dcb1e5f	i965: don't include compute resources in "Combined" limits The combined limits should only include shader stages that can be active at the same time. We don't need to include compute. See also `cff290df4c` for st/mesa. Unbreaks i965 from assert failing on driver load since Marek's `45f87a48f9`, which dropped the core Mesa capabilities before adjusting driver limits down to match.	2018-08-23 17:27:27 -07:00
Marek Olšák	9176703788	radeonsi: increase the maximum UBO size to 2 GB Same as the closed driver. This causes a failure in GL45-CTS.compute_shader.max, which has a trivial bug. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	5693ca865d	radeonsi: bump MAX_GS_INVOCATIONS same as the closed driver Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	d3c1b212bc	gallium: add PIPE_CAP_MAX_SHADER_BUFFER_SIZE Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	f6ccd594e7	gallium: add PIPE_CAP_MAX_GS_INVOCATIONS Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	8c71b70f07	tgsi/ureg: don't call tgsi_sanity when it's too slow Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	80aecad0ca	st/mesa: fix up uniform limits to be able to expose large UBOs Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	cff290df4c	st/mesa: don't include compute resources in "Combined" limits The combined limits should only include shader stages that can be active at the same time. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	d36af3a9d9	st/mesa: set ctx->Const.SubPixelBits Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	3867af39f9	glsl: fix error checking against MAX_UNIFORM_LOCATIONS Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	f01338118c	mesa: make MaxCombinedUniformComponents 64-bit to allow large UBOs Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	a8b71f2db8	mesa: add ctx->Const.MaxGeometryShaderInvocations radeonsi wants to report a different value Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	45f87a48f9	mesa: don't include compute resources in MAX_COMBINED_* limits 5 is the maximum number of shader stages that can be used by 1 execution call at the same time (e.g. a draw call). The limit ensures that each stage can use all of its binding points. Compute is separate and doesn't need the 5x multiplier. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	095515e16c	mesa: bump GL_MAX_ELEMENTS_INDICES and GL_MAX_ELEMENTS_VERTICES same number as our closed GL driver v2: don't use MaxArrayLockSize Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	356ff963ec	mesa: remove incorrect change for EXT_disjoint_timer_query Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-08-23 16:56:17 -04:00
Marek Olšák	37eee90df7	glapi: actually implement GL_EXT_robustness for GLES The extension was exposed but not the functions. This fixes: dEQP-GLES31.functional.debug.negative_coverage.get_error.buffer.readn_pixels dEQP-GLES31.functional.debug.negative_coverage.get_error.state.get_nuniformfv dEQP-GLES31.functional.debug.negative_coverage.get_error.state.get_nuniformiv Cc: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-23 16:54:30 -04:00
Kenneth Graunke	578e45ab7b	intel/decoder: Decode SFIXED values. This lets us example SAMPLER_STATE's LOD Bias field, among other things. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-23 13:04:53 -07:00
Emil Velikov	855af9a5a2	travis: use python3 for the autoconf builds Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-23 17:00:28 +01:00
Emil Velikov	ae7898dfdb	configure: allow building with python3 Pretty much all of the scripts are python2+3 compatible. Check and allow using python3, while adjusting the PYTHON2 refs. Note: - python3.4 is used as it's the earliest supported version - python3 chosen prior to python2 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-23 17:00:13 +01:00
Emil Velikov	c51e7486d9	bin/git_sha1_gen.py: remove execute bit/shebang The script is executed explicitly via the build system, that uses PYTHON/prog_python and equivalent. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-23 17:00:04 +01:00
Eric Engestrom	993a456360	vk/wsi: avoid reading uninitialised memory It will be ignored by x11_swapchain_result() anyway (because reaching the `fail` label without setting `result` means the swapchain status was already a hard error), but the compiler still complains about reading uninitialised memory. While at it, drop the unused assignment right before returning. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-23 14:47:59 +01:00
Eric Engestrom	a0f6a11944	egl: drop unused _EGL_BUILT_IN_DRIVER_DRI2 Unused since `b174a1ae72` "egl: Simplify the "driver" interface". Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-08-23 14:47:59 +01:00
Samuel Pitoiset	87fbc16e34	radv/gfx9: implement coherent shaders for VK_ACCESS_SHADER_READ_BIT Single-sample color and single-sample depth (not stencil) are coherent with shaders. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl	2018-08-23 15:42:56 +02:00
Mathieu Bridon	6027d354d1	bin/install_megadrivers.py: Remove shebang and executable bit Since the script is never executed directly, but launched by Meson as an argument to the Python interpreter, those are not needed any more. In addition, they are the reason this script was missed when I moved the Meson buildsystem to Python 3, so removing them helps avoiding future confusion. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-23 12:12:06 +01:00
Mathieu Bridon	8c8fd0bb8e	meson: Run the install script with Python 3 The script was being run directly as an executable, and it has a Python 2 shebang. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-23 12:12:06 +01:00
Emil Velikov	48820ed8da	glsl: remove execute bit and shebang from python tests Just like the rest of the tree - these should be run either as part of the build system check target, or at the very least with an explicitly versioned python executable. Fixes: `db8cd8e367` ("glcpp/tests: Convert shell scripts to a python script") Fixes: `97c28cb082` ("glsl/tests: Convert optimization-test.sh to pure python") Fixes: `3b52d29227` ("glsl/tests: reimplement warnings-test in python") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-23 12:02:45 +01:00
Emil Velikov	e39b916d0c	docs: update required mako version The requirement was bumped a while back, but we forgot to update the docs. Fixes: `ed871af91c` ("configure.ac: raise Mako required version to 0.8.0") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-23 12:02:45 +01:00
Emil Velikov	e7149369bd	configure: use distutils in ax_check_python_mako_module Handling the version comparison by hand is a bad idea. Python has a handy module distutils for that - use it. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-23 11:59:48 +01:00
Emil Velikov	df2042d99a	configure: enforce python 2.7 with AM_PATH_PYTHON Currently we use AC_CHECK_PROGS looking for python2.7, python2 and finally python. That is due to the varying names used across the different OS. Use the handy AM_PATH_PYTHON which finds the correct name and checks for the version. Note: python2.7 has been an unofficial requirement for quite some time. Update the docs to reflect that. Cc: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-23 11:55:55 +01:00
Ian Romanick	c7c0b391ef	i965: Enable INTEL_shader_atomic_float_minmax on Gen9+ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	59c17dbc6c	i965: Sort Gen9+ extension enables This is a strictly alphabetic sort, as is done in extensions_table.h There are other options. We should pick one and document it. Right now, this file is chaos. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	d515c75463	intel/compiler: Implement untyped atomic float min, max, and compare-swap dataport messages v2: Split changes to the message type field to another patch. Suggested by Caio. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	f347348f8a	intel/compiler: Expand untyped atomic message type field by a bit This is necessary for a new Gen9 message type that will be added in the next patch. There are also Gen8 message types that need the extra bit (mostly for bindless). v2: Split off from the next patch. Suggested by Caio. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	d628642a34	intel/compiler: Silence unused parameter warnings src/intel/compiler/brw_disasm_info.c: In function ‘nir_print_instr’: src/intel/compiler/brw_disasm_info.c:30:61: warning: unused parameter ‘instr’ [-Wunused-parameter] __attribute__((weak)) void nir_print_instr(const nir_instr instr, FILE fp) {} ^~~~~ src/intel/compiler/brw_disasm_info.c:30:74: warning: unused parameter ‘fp’ [-Wunused-parameter] __attribute__((weak)) void nir_print_instr(const nir_instr instr, FILE fp) {} ^~ src/intel/compiler/brw_disasm.c: In function ‘src_ia1’: src/intel/compiler/brw_disasm.c:850:18: warning: unused parameter ‘_reg_file’ [-Wunused-parameter] unsigned _reg_file, ^~~~~~~~~ src/intel/compiler/brw_fs_surface_builder.cpp: In function ‘void brw::surface_access::emit_byte_scattered_write(const brw::fs_builder&, const fs_reg&, const fs_reg&, const fs_reg&, unsigned int, unsigned int, unsigned int, brw_predicate)’: src/intel/compiler/brw_fs_surface_builder.cpp:193:57: warning: unused parameter ‘size’ [-Wunused-parameter] unsigned dims, unsigned size, ^~~~ v2: Update commit message. brw_fs_generator.cpp warnings were already fixed by another patch. Noticed by Caio. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	0842655ac6	nir: Add floating point atomic min, max, and compare-swap instrinsics Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	69ce7baa9e	nir: Add floating point atomic add instrinsics Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	a390158d10	glsl: Add support for lowering shared-variable float atomics Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	39bf3100ac	glsl: Add support for lowering SSBO float atomics Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	280ab4afa8	glsl: Add built-in functions for INTEL_shader_atomic_float_minmax Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	c9d52c83a4	mesa: Extension boilerplate for INTEL_shader_atomic_float_minmax Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	346321a836	docs: Initial version of INTEL_shader_atomic_float_minmax spec v2: Describe interactions with the capabilities added by SPV_INTEL_shader_atomic_float_minmax v3: Remove 64-bit float support. v4: Explain NaN issues. Explain issues with atomicMin(-0, +0) and atomicMax(-0, +0). v5: Fix whitespace issues noticed by Caio. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	88b6c7bc14	glsl: Add built-in functions for NV_shader_atomic_float Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Ian Romanick	9527bb4e70	mesa: Extension boilerplate for NV_shader_atomic_float Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 20:31:32 -07:00
Gurchetan Singh	c731508b98	meson: fix egl build for android Haven't tested this, but we do include loader.h in platform_android.c Fixes: `c5ec155685` ("meson: wire up egl/android") Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-22 16:47:19 -07:00
Gurchetan Singh	ec6cb01e21	meson: fix egl build for surfaceless Without this, I get: > platform_surfaceless.c:38:10: fatal error: 'loader.h' file not found > #include "loader.h" > ^~~~~~~~~~ > 1 error generated. Fixes: `108d257a16` ("meson: build libEGL") Reviewed-by: Dylan Baker <dylan@pnwbakers.com> v2: Split up patches, modify commit message (Dylan)	2018-08-22 16:47:09 -07:00
Caio Marcelo de Oliveira Filho	410de0e3f1	nir: Give end_block its own index Since there's no particular reason for the index to be 0, choose an index that is not used by other block. This is convenient when we store "per-block" data in an array AND look for the successors data (e.g. any kind of backwards data-flow analysis). v2: Add a note about end_block's index. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-22 14:41:26 -07:00
Caio Marcelo de Oliveira Filho	8364ec3fce	nir: Skip common instructions when comparing deref paths Deref paths may share the same deref instructions in their chains, e.g. ssa_100 = deref_var A ssa_101 = deref_struct "array_field" of ssa_100 ssa_102 = deref_array "[1]" of ssa_101 ssa_103 = deref_struct "field_a" of ssa_102 ssa_104 = deref_struct "field_a" of ssa_103 when comparing the two last deref instructions, their paths will share a common sequence ssa_100, ssa_101, ssa_102. This patch skips to next iteration if the deref instructions are the same. Path[0] (the var) is still handled specially, so in the case above, only ssa_101 and ssa_102 will be skipped. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-22 14:41:26 -07:00
Caio Marcelo de Oliveira Filho	5196041e93	nir: Export deref comparison functions Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-22 14:41:26 -07:00
Caio Marcelo de Oliveira Filho	7f8ecedced	util/dynarray: add a clone function v2: Fix mem_ctx parameter type. (Thomas) Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-22 14:41:26 -07:00
Mariusz Ceier	61b84b8c14	amd/addrlib: Fix include path for c99_compat.h Without this patch mesa doesn't compile: In file included from ../mesa-9999/src/amd/addrlib/addrinterface.cpp:39: ../mesa-9999/src/util/macros.h:29:10: fatal error: c99_compat.h: No such file or directory #include "c99_compat.h" ^~~~~~~~~~~~~~ compilation terminated. Fixes: `15ca5ce99a` ("amd/addrlib: mark returnCode as MAYBE_UNUSED in") Signed-off-by: Mariusz Ceier <mceier+mesa-dev@gmail.com> Acked-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-22 14:39:02 -07:00
Grazvydas Ignotas	0076ea92a9	vulkan/wsi: fix pointer-integer conversion warnings For 32bit build. Trivial. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-23 00:34:32 +03:00
Grazvydas Ignotas	9177074524	radv: use different builtin shader cache for 32bit Currently if 64bit and 32bit programs are used interchangeably, radv will keep overwriting the cache. Use separate cache files to avoid that. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-23 00:34:32 +03:00
Grazvydas Ignotas	356f6673d6	radv: place pointer length into cache uuid Thanks to reproducible builds, binary file timestamps may be identical for both 32bit and 64bit packages when built from the same source. This means radv will use the same cache for both 32 and 64 bit processes, which leads to crashes. Conveniently there is a spare byte in cache_uuid, let's place the pointer size there. Fixes: `f4e499ec79` "radv: add initial non-conformant radv vulkan driver" CC: 18.1 18.2 <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107601 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105904 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-23 00:34:32 +03:00
Grazvydas Ignotas	2edf47edf0	llvmpipe: add cc clobber to inline asm The bsr instruction modifies flags, so that needs to be indicated to the compiler. No effect on generated code, but still needed for correctness. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-08-23 00:34:32 +03:00
Nanley Chery	6d80b0b4ba	intel/isl: Avoid tiling some 16K-wide render targets Fix rendering issues on BDW and SKL. Fixes: `0288fe8d04` ("i965/miptree: Use the correct BLT pitch") Fixes the following regressions seen exclusively on SKL: * KHR-GL46.texture_barrier_ARB.disjoint-texels * KHR-GL46.texture_barrier_ARB.overlapping-texels * KHR-GL46.texture_barrier.disjoint-texels * KHR-GL46.texture_barrier.overlapping-texels and both on BDW and SKL: * GTF-GL46.gtf21.GL2FixedTests.buffer_corners.buffer_corners * GTF-GL46.gtf21.GL2FixedTests.stencil_plane_corners.stencil_plane_corners v2: Note the fixed tests (Andres). Don't cause failures with multisampled buffers (Andres). Don't hamper SKL GT4 (Ken). v3: Fix the Fixes tag (Dylan). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107359 Cc: <mesa-stable@lists.freedesktop.org> Tested-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-22 13:53:19 -07:00
Nanley Chery	b041fc0649	i965/miptree: Fix can_blit_slice() Check the destination's row pitch against the BLT engine's row pitch limitation as well. Fixes: `0288fe8d04` ("i965/miptree: Use the correct BLT pitch") v2: Fix the Fixes tag (Dylan). Check the destination row pitch (Chris). Reported-by: Dylan Baker <dylan@pnwbakers.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-22 13:53:02 -07:00
Nanley Chery	030b6efcfd	i965/miptree: Use miptree_map in map_blit functions This struct contains all the data of interest. can_blit_slice() will use it in the next patch to calculate the correct pitch. Suggested-by: Chris Wilson <chris@chris-wilson.co.uk> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-22 13:23:17 -07:00
Rafael Antognolli	f8cfc77660	intel/tools/aubwrite: Always use physical addresses for traces. It looks like we can't rely on the simulator to always translate virtual addresses to physical ones correctly. So let's use physical everywhere. Since our current GGTT maps virtual to physical addresses in a 1:1 way, no further changes are required. Additionally, we have other address spaces not in use right now. So let's make it easier to switch which one we are using but putting the default one into the aub_file struct. Cc: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-22 12:52:41 -07:00
Rafael Antognolli	e82d8fa964	intel/tools/aubwrite: Rename "legacy" to "Trace Block". Hopefully it's a little more descriptive, and more accurate. Cc: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-22 12:52:41 -07:00
Jason Ekstrand	68ae66542a	nir/vars_to_ssa: Don't build deref nodes for non-local variables Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-08-22 14:17:38 -05:00
Marek Olšák	e80e8d7adc	ac: fix WAITCNT flags for GFX9 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-22 14:34:43 -04:00
Kai Wasserbäch	c836a751bc	amd/addrlib: mark physicalSliceSize as MAYBE_UNUSED in Addr::V1::EgBasedLib::HwlGetSizeAdjustmentMicroTiled Only used, when asserts are enabled. Fixes an unused-but-set-variable warning with GCC 8: ../../../src/amd/addrlib/r800/egbaddrlib.cpp: In member function 'virtual long long unsigned int Addr::V1::EgBasedLib::HwlGetSizeAdjustmentMicroTiled(unsigned int, unsigned int, ADDR_SURFACE_FLAGS, unsigned int, unsigned int, unsigned int, unsigned int, unsigned int) const': ../../../src/amd/addrlib/r800/egbaddrlib.cpp:4111:13: warning: variable 'physicalSliceSize' set but not used [-Wunused-but-set-variable] UINT_64 physicalSliceSize; ^~~~~~~~~~~~~~~~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-22 14:33:21 -04:00
Kai Wasserbäch	2e0586e379	amd/addrlib: mark numPipes as MAYBE_UNUSED in Addr::V1::EgBasedLib::SanityCheckMacroTiled (v2) Only used, when asserts are enabled. Fixes an unused-variable warning with GCC 8: ../../../src/amd/addrlib/r800/egbaddrlib.cpp: In member function 'int Addr::V1::EgBasedLib::SanityCheckMacroTiled(ADDR_TILEINFO*) const': ../../../src/amd/addrlib/r800/egbaddrlib.cpp:982:13: warning: unused variable 'numPipes' [-Wunused-variable] UINT_32 numPipes = HwlGetPipes(pTileInfo); ^~~~~~~~ v2: Don't realign other variable definitions, to keep in line with file style (Marek) Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-22 14:33:21 -04:00
Kai Wasserbäch	6a7ef7c7dc	amd/addrlib: mark pEqToCheck as MAYBE_UNUSED in Addr::V2::Gfx9Lib::ComputeStereoInfo (v2) Only used, when asserts are enabled. Fixes an unused-variable warning with GCC 8: ../../../src/amd/addrlib/gfx9/gfx9addrlib.cpp: In member function 'ADDR_E_RETURNCODE Addr::V2::Gfx9Lib::ComputeStereoInfo(const ADDR2_COMPUTE_SURFACE_INFO_INPUT, ADDR2_COMPUTE_SURFACE_INFO_OUTPUT, unsigned int) const': ../../../src/amd/addrlib/gfx9/gfx9addrlib.cpp:3879:34: warning: unused variable 'pEqToCheck' [-Wunused-variable] const ADDR_EQUATION *pEqToCheck = &m_equationTable[eqIndex]; ^~~~~~~~~~ v2: Don't realign other variable definitions, to keep in line with file style (Marek) Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-22 14:33:21 -04:00
Kai Wasserbäch	556f89a715	amd/addrlib: mark microBlockDim as MAYBE_UNUSED in Addr::V2::Gfx9Lib::HwlComputeBlock256Equation Only used, when asserts are enabled. Fixes an unused-but-set-variable warning with GCC 8: ../../../src/amd/addrlib/gfx9/gfx9addrlib.cpp: In member function 'virtual ADDR_E_RETURNCODE Addr::V2::Gfx9Lib::HwlComputeBlock256Equation(AddrResourceType, AddrSwizzleMode, unsigned int, ADDR_EQUATION*) const': ../../../src/amd/addrlib/gfx9/gfx9addrlib.cpp:2473:15: warning: variable 'microBlockDim' set but not used [-Wunused-but-set-variable] Dim2d microBlockDim = Block256_2d[elementBytesLog2]; ^~~~~~~~~~~~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-22 14:33:21 -04:00
Kai Wasserbäch	15ca5ce99a	amd/addrlib: mark returnCode as MAYBE_UNUSED in ElemGetExportNorm Only used, when asserts are enabled. Fixes an unused-but-set-variable warning with GCC 8: ../../../src/amd/addrlib/addrinterface.cpp: In function 'int ElemGetExportNorm(ADDR_HANDLE, const ELEM_GETEXPORTNORM_INPUT*)': ../../../src/amd/addrlib/addrinterface.cpp:835:23: warning: variable 'returnCode' set but not used [-Wunused-but-set-variable] ADDR_E_RETURNCODE returnCode = ADDR_OK; ^~~~~~~~~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-22 14:33:21 -04:00
Lionel Landwerlin	8b0e48887f	intel: aubinator_viewer: add urb view This is available through a "Show URB" button on the 3DPRIMITIVE instructions. v2: Fix urb allocation end value in tooltip (Rafael) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	d1c4a62bf8	intel: aubinator_viewer: store urb state during decoding Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	38f10d5a03	intel: tools: add aubinator viewer A graphical user interface version of aubinator. Allows you to : - simultaneously look at multiple points in the aub file (using all the goodness of the existing decoding in aubinator) - edit an aub file v2: Switch from GLFW to GTK+3 v3: Fix warning when exiting Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Rafael Antognolli <rafael.antognolli@intel.com> (v1)	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	ea83a1d304	intel: tools: import ImGui We want to add a new UI tool to decode aub files. This will use the Dear ImGui library to render its interface. The build of this UI toolkit is conditional to -Dwith_tools=intel-ui which superseeds -Dwith_tools=intel. The main way to use ImGui is to embed its source code at a particular revision. Most embedding projects have to do a bit of integration which is really specific to one's project. In our case the only modification is to include libepoxy. We also choose to use Gtk+3 for the window system integration. As oppose to the previous previous version of this patch using GLFW, Gtk+ is able to handle X11/Wayland session as well as property DPI scaling on retina monitors. The import was done at this commit (https://github.com/ocornut/imgui) : commit 6211f40f3d903dd9df961256e044029c49793aa3 Author: omar <omarcornut@gmail.com> Date: Fri Jul 27 12:29:33 2018 +0200 Internals: Drag and Drop: default drop preview use a narrower clipping rectangle (no effect here, but other branches uses a narrow clipping rectangle that was too small so this is a fix for it) + Comments v2: Switch from GLFW to GTK+ (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	4ba12e8c54	intel: tools: aub_mem: reuse already mapped ppgtt buffers When we map a PPGTT buffer into a continous address space of aubinator to be able to inspect it, we currently add it to the list of BOs to unmap once we're finished. An optimization we can apply it to look up that list before trying to remap PPGTT buffers again (we already do this for GGTT buffers). We need to take some care before doing this because the list also contains GGTT BOs. As GGTT & PPGTT are 2 different address spaces, we can have matching addresses in both that point to different physical locations. This changes adds a flag on the elements of the list of mapped BOs to differenciate between GGTT & PPGTT, which allows use to reuse that list when looking up both address spaces. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	8fd78b4eea	intel: tools: aubmem: map gtt data to aub file This will allow the aubinator viewer tool to modify the aub data that was loaded at a particular gtt address. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	ebb145ee12	intel: tools: create libaub Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	475d670ef7	intel: tools: aubwrite: wrap function declarations for c++ Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	ed21007a6a	intel: tools: split memory management out of aubinator Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 18:02:11 +01:00
Lionel Landwerlin	14a1cb37eb	util: rb_tree: add safe iterators v2: Add helper to make iterators more readable (Rafael) Fix rev iterator bug (Rafael) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-22 17:49:36 +01:00
Lionel Landwerlin	4616639b49	intel: tools: split aub parsing from aubinator v2: add parsing error callback (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> (v1)	2018-08-22 17:49:36 +01:00
Mathieu Bridon	e15686567c	meson: Run the test with Python 3 This is a patch from me and a patch from Mathieu Bridon squashed together. Signed-off-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Mathieu Bridon <bochecha@daitauha.fr>	2018-08-22 08:41:01 -07:00
Mathieu Bridon	ff0ce31e2a	python: Disable universal newlines We are testing the behaviour of a tool, for different input files, each one using a different newline sequence. ('\n' on UNIX, '\r\n' on Windows, …) Unfortunately, when opening a file in text mode, Python 3 will by default enable the "universal newlines" mode, which means it replaces all the known newline sequences by '\n'. This (usually useful) behaviour breaks the tests, which are specifically trying to handle files with newline sequences different from '\n'. Disabling the universal newlines mode fixes the tests. However, to keep the script compatible with both Python 2 and 3, we must use the io.open() function instead of the open() builtin, as the latter only knows about the `newline` argument on Python 3. Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-22 08:41:01 -07:00
Mathieu Bridon	fc708069f7	python: difflib prefers unicode strings Python 3 does not automatically convert from bytes to unicode strings like Python 2 used to do. This commit makes sure we pass unicode strings to difflib.unified_diff, so that the script works on both Python 2 and 3. Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-22 08:41:01 -07:00
Dylan Baker	477d4b9960	compiler/glsl/tests: Make tests python3 safe v2: - explicitly decode the output of subprocesses - handle bytes and string types consistently rather than relying on python 2's coercion for bytes and ignoring them in python 3 v3: - explicitly set encode as well as decode - python 2.7 and 3.x `bytes` instead of defining an alias Reviewed-by: Mathieu Bridon <bochecha@daitauha.fr>	2018-08-22 08:41:01 -07:00
Juan A. Suarez Romero	6ea5718318	travis: SWR requires LLVM 6.0 v2: update clarification why ubuntu-toolchain-r-test is required (Emil) Fixes: `0cef0cccf5` ("swr: bump minimum supported LLVM version to 6.0") Cc: Dylan Baker <dylan@pnwbakers.com> Cc: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-22 17:29:20 +02:00
Samuel Pitoiset	4c43ec461d	ac/nir: fix getting GLSL type of array of samplers for TG4 This fixes a crash in build_tex_intrinsic() when trying to launch the Basemark GPU benchmark on GFX8. It looks like there is still something wrong because some frames are black. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106980 CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-22 15:23:11 +02:00
Samuel Pitoiset	24ee53231d	radv: remove dead variables after splitting per member structs Otherwise, nir_lower_clip_cull_distance_arrays might report wrong number of output clips/culls because it relies on shader output variables and some of them might be dead. This fixes a rendering issue with Dolphin and Super Mario Sunshine. Fixes: `b0c643d8f5` ("spirv: Use NIR per-member splitting") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107610 CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-22 13:57:18 +02:00
Yunchao He	bea4d4c78c	anv: add VK_EXT_sampler_filter_minmax support This extension can be supported on SKL+. With this patch, all corresponding tests (6K+) in CTS can pass. No test fails. I verified CTS with the command below: deqp-vk --deqp-case=dEQP-VK.pipeline.sampler.view_type.reduce v2: 1) support all depth formats, not depth-only formats, 2) fix a wrong indention (Jason). v3: fix a few nits (Lionel). v4: fix failures in CI: disable sampler reduction when sampler reduction mode is not specified via this extension (Lionel). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-22 11:56:19 +01:00
Samuel Pitoiset	0608349232	radv: use ac_build_imad() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-22 09:17:40 +02:00
Marek Olšák	d87fe1f0fd	ac,radeonsi: use ac_build_gather_values more Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Marek Olšák	60beac9efc	ac,radeonsi: use ac_build_fmad Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Marek Olšák	c401ead68a	radeonsi: use ac_build_imad Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Marek Olšák	659f2e0fcb	ac: add imad & fmad helpers Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Marek Olšák	2276f8f064	ac: add ac_build_s_barrier Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Marek Olšák	6224144b6d	radeonsi: print the shader stage name when printing LLVM IR Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Marek Olšák	5d20b9be90	radeonsi: use is_merged shader in si_prolog_get_rw_buffers needed to change the input type to si_shader_context Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Marek Olšák	a4a104fc81	ac: completely remove +auto-waitcnt-before-barrier it causes corruption on several different GPU generations. Cc: 18.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-21 20:50:37 -04:00
Anuj Phogat	2383ddace1	anv/icl: Allow headerless sampler messages for pre-emptable contexts It fixes simulator warnings in vulkancts tests complaining about missing support for headerless sampler messages for pre-emptable contexts. Bit 5 in SAMPLER MODE register is newly introduced for ICLLP. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 12:50:05 -07:00
Anuj Phogat	81b74b5d96	anv/icl: Disable binding table prefetching Gen 11 workarounds table #2056 WABTPPrefetchDisable suggests to disable prefetching of binding tables for ICLLP A0 and B0 steppings. We have a similar patch for i965 driver in Mesa commit `a5889d70`. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 12:50:05 -07:00
Anuj Phogat	482f328f3b	i965/icl: Allow headerless sampler messages for pre-emptable contexts It fixes simulator warnings in piglit tests complaining about missing support for headerless sampler messages for pre-emptable contexts. Bit 5 in SAMPLER MODE register is newly introduced for ICLLP. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 12:50:05 -07:00
Dave Airlie	32529e6084	r600/eg: rework atomic counter emission with flushes With the current code, we didn't do the space checks prior to atomic counter setup emission, but we also didn't add atomic counters to the space check so we could get a flush later as well. These flushes would be bad, and lead to problems with parallel tests. We have to ensure the atomic counter copy in, draw emits and counter copy out are kept in the same command submission unit. This reworks the code to drop some useless masks, make the counting separate to the emits, and make the space checker handle atomic counter space. [airlied: want this in 18.2] Fixes: `06993e4ee` (r600: add support for hw atomic counters. (v3))	2018-08-21 20:45:38 +01:00
Dave Airlie	41d58e2098	virgl: ARB_enhanced_layouts support We need to handle the gaps in the streamout bindings on the guest side and enable if it the host has the rest enabled. Reviewed-by: Jakob Bornecrantz <jakob@collabora.com>	2018-08-22 05:05:21 +10:00
Chad Versace	aa79cc2bc8	i965: Implement EGL_KHR_mutable_render_buffer Testing: - Manually tested a low-latency handwriting demo that toggles EGL_RENDER_BUFFER. Toggling changed the display latency as expected. Used Android on Chrome OS, Kabylake GT2. - No change in dEQP-EGL.functional.* on Fedora 27, Wayland, Skylake GT2. Used deqp at tag android-p-preview-5. - No regressions in dEQP-EGL.functional., ran on Android on Chrome OS, Kabylake GT2. Some dEQP-EGL.functional.mutable_render_buffer. test change from NotSupported to Pass. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-21 09:56:20 -07:00
Chad Versace	ed7c694688	egl/android: Implement EGL_KHR_mutable_render_buffer Specifically, implement the extension DRI_MutableRenderBufferLoader. However, the loader enables EGL_KHR_mutable_render_buffer only if the DRI driver implements its half of the extension, DRI_MutableRenderBufferDriver. Testing: - No change in dEQP-EGL.functional.* on Fedora 27, Wayland, Skylake GT2. Used deqp at tag android-p-preview-5. - No change in dEQP-EGL.functional.*, ran on Android on Chrome OS, Kabylake GT2. - Manually inspected Android apps on same Chrome OS device. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-21 09:56:20 -07:00
Eric Engestrom	317c460a4d	util/xmlpool: make indentation coherent Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-21 17:36:13 +01:00
Eric Engestrom	2de9e841e7	egl: add helper to combine two u32 into one u64 Use a helper to avoid the common issues of upcasting after the right shift (losing the upper bits) and shifting signed values (sign gets shifted too). Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-21 15:50:02 +01:00
Eric Engestrom	1ca23420c1	docs: trivial s/>/>/ html fix Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-21 15:41:41 +01:00
Eric Engestrom	6ff1c47996	autotools: don't ship the git_sha1.h generated in git in the tarballs This file is regenerated at build time anyway, so this would just get overwritten anyway. No reason to ship it in the tarball. Fixes: `44df06211c` "autotools: include git_sha1.h in dist tarball" Fixes: `471f708ed6` "git_sha1: simplify logic" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-21 15:30:56 +01:00
Eric Engestrom	81fe9bdf6d	intel/genxml: minor python style fix Suggested-by: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-21 15:30:55 +01:00
Jose Fonseca	9e5e3a8ead	appveyor: Set git core.autocrlf setting to true. The git core.autocrlf setting defaults to true (ie, all text files get checked out as CRLF on Windows), except on Appveyor where's set to "input" (ie, all text files get checked out with the upstream repository's line endings, which for us typically means LF.) And this was masking on Appveyor a regression in gen_xmlpool.py processing t_options.h with CRLF line endings. This change makes core.autocrlf to be true, which would have enabled to immediately catch the issue, as seen in https://ci.appveyor.com/project/jrfonseca/mesa/build/51 Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-08-21 09:46:19 +01:00
Timothy Arceri	797cd198ae	mesa: move legacy hyperz option from dri config Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 09:19:02 +10:00
Timothy Arceri	02062ab1e1	mesa: remove unused dri config option disable_shader_bit_encoding This was added as a workaround for Heaven 3.0 but was later removed by `5ead448719` to allow Heaven 4.0 to work correctly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 09:19:02 +10:00
Timothy Arceri	c5f863f2fd	mesa: drop legacy no_rast dri option Add enviroment var overrides to legacy drivers instead. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 09:19:01 +10:00
Timothy Arceri	02e32c92a2	i965: remove unused no_rast bool Forcing software fallbacks for i965 hasn't been an option since `5e3c093ff8`. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 09:19:01 +10:00
Timothy Arceri	7867c1078a	i915: remove early_z dri option This driver is in maintenance mode so lets remove this hidden unsafe option. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-21 09:19:01 +10:00
Kevin Rogovin	7ec308d978	Add NV_fragment_shader_interlock support. The main purpose for having NV_fragment_shader_interlock extension is because that extension is also for GLES31 while the ARB extension is for GL only. Reviewed-by: Plamena Manolova <plamena.manolova@intel.com>	2018-08-20 13:32:43 -07:00
Juan A. Suarez Romero	44df06211c	autotools: include git_sha1.h in dist tarball This fixes `make distcheck`. Fixes: `471f708ed6` ("git_sha1: simplify logic") CC: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-20 18:43:50 +02:00
Juan A. Suarez Romero	0cef0cccf5	swr: bump minimum supported LLVM version to 6.0 RADV now requires LLVM 6.0 or greater, and thus we can't build dist tarball because swr requires LLVM 5.0. Let's bump required LLVM to 6.0 in swr too. v2: bump also in meson.build (Eric) Fixes: `fd1121e839` ("amd: remove support for LLVM 5.0") Cc: Tim Rowley <timothy.o.rowley@intel.com> Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Dylan Baker <dylan@pnwbakers.com> Cc: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-08-20 16:13:37 +02:00
Danylo Piliaiev	25ec806eb2	i965: Advertise 8 bits subpixel precision for viewport bounds on gen6+ We use floating-points for viewport bounds so VIEWPORT_SUBPIXEL_BITS should reflect this. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105975 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-20 15:11:57 +01:00
Rob Clark	e11e9d6394	freedreno: fix context teardown race We could still have batches queued up to flush, so fd_context_destroy() (which will kill and sync on the flush_queue) before deleting buffers that might be referenced from fdN_gmem() from context of flush_queue. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-20 10:03:05 -04:00
Kai Wasserbäch	5fab32ddad	intel/decoder: mark total_length as MAYBE_UNUSED in gen_spec_load Only used, when asserts are enabled. Fixes an unused-variable warning with GCC 8: ../../../src/intel/common/gen_decoder.c: In function 'gen_spec_load': ../../../src/intel/common/gen_decoder.c:535:47: warning: variable 'total_length' set but not used [-Wunused-but-set-variable] uint32_t text_offset = 0, text_length = 0, total_length; ^~~~~~~~~~~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-20 11:08:52 +01:00
Kai Wasserbäch	4228e052b3	intel/tools: initialise bo_addr to 0 in main Supresses a maybe-uninitialized warning with GCC 8. Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-20 11:08:52 +01:00
Kai Wasserbäch	ccdefbb559	intel: aubinator: mark ftruncate_res as MAYBE_UNUSED in ensure_phys_mem Only used, when asserts are enabled. Fixes an unused-variable warning with GCC 8: ../../../src/intel/tools/aubinator.c: In function 'ensure_phys_mem': ../../../src/intel/tools/aubinator.c:209:11: warning: unused variable 'ftruncate_res' [-Wunused-variable] int ftruncate_res = ftruncate(mem_fd, mem_fd_len += 4096); ^~~~~~~~~~~~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-20 11:08:52 +01:00
Kai Wasserbäch	64c2bca59f	intel/aubinator_error_decode: mark ret as MAYBE_UNUSED in main Only used, when asserts are enabled. Fixes an unused-but-set-variable warning with GCC 8: ../../../src/intel/tools/aubinator_error_decode.c: In function 'main': ../../../src/intel/tools/aubinator_error_decode.c:759:11: warning: variable 'ret' set but not used [-Wunused-but-set-variable] int ret; ^~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-20 11:08:52 +01:00
Samuel Pitoiset	0aacb5eab6	radv: do not use CP predication for DCC decompressions This fixes a regression with some Unity demos. Not sure what the root cause of the problem is, especially because the driver doesn't perform any fast color clears. So, it shouldn't be needed to decompress DCC. RadeonSI says that the decompression is relatively cheap if the surface has been decompressed already. One possible improvement is to two use predicates, one for DCC and one for FCE that could be cleared when DCC, FMASK or CMASK are performed by the driver. That might skip some unnecessary decompression passes (not DCC though). Fixes: `ff7daadca1` ("radv: enable/disable predication for the DCC decompression pass") CC: 18.2 <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107563 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-20 11:54:37 +02:00
Tapani Pälli	799b3d16d4	egl: implement EXT_surface_SMPTE2086_metadata and EXT_surface_CTA861_3_metadata Patch implements common bits for EXT_surface_SMPTE2086_metadata and EXT_surface_CTA861_3_metadata extensions by adding new required attributes and eglQuerySurface + eglSurfaceAttrib changes. Currently none of the drivers are utilizing this data but this patch is enabler in getting there. v2: don't enable extension globally, should be only enabled by EGL drivers that can transfer metadata to the window system (Jason) use EGLint instead of uint16_t (Eric) Signed-off-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-20 09:44:53 +03:00
Timothy Arceri	5a0684d665	mesa: move legacy dri config option texture_units Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-20 13:53:59 +10:00
Timothy Arceri	8b4157d578	mesa: remove unused dri config option texture_heaps This seems to have only been used by DRI1 drivers which were removed with `e4344161bd`. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-20 13:53:59 +10:00
Timothy Arceri	fb277f504e	mesa: move legacy dri config option texture_blend_quality Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-20 13:53:59 +10:00
Timothy Arceri	c470db706a	util: remove unused S3TC translation for dri config Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-20 13:53:59 +10:00
Timothy Arceri	7d2474afb5	mesa: remove dri configs unused software-fallback options These seems to have only been used by DRI1 drivers which were removed with `e4344161bd`. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-20 13:53:58 +10:00
Timothy Arceri	24da2d162d	mesa: remove unused dri config option excess_mipmap This seems to have only been used by DRI1 drivers which were removed with `e4344161bd`. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-20 13:53:58 +10:00
Timothy Arceri	498831c7e6	mesa: remove unused dri config option performance_boxes This seems to have only been used by DRI1 drivers which were removed with `e4344161bd`. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-20 13:53:58 +10:00
Timothy Arceri	4a91d4ef0f	docs: update the default mesa shader cache dir We renamed the dir in commit `28b326238b`, this just updates the website to reflect the change.	2018-08-20 08:08:58 +10:00
Kai Wasserbäch	2c020dbf06	vulkan/wsi: initialise image_index to 0 in x11_manage_fifo_queues Supresses a maybe-uninitialized warning with GCC 8. Note: image_index should always be initialised due to the result check, but the compiler doesn't see that. Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-18 10:34:19 +10:00
Kai Wasserbäch	6f0647c0b2	nir: mark prev_block as MAYBE_UNUSED in opt_peel_loop_initial_if Only used, when asserts are enabled. Fixes an unused-variable warning with gcc-8: ../../../src/compiler/nir/nir_opt_if.c: In function 'opt_peel_loop_initial_if': ../../../src/compiler/nir/nir_opt_if.c:109:15: warning: unused variable 'prev_block' [-Wunused-variable] nir_block prev_block = ^~~~~~~~~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-18 10:34:15 +10:00
Kai Wasserbäch	9387ca29ae	util: mark s as MAYBE_UNUSED in _mesa_half_to_unorm8 Only used, when asserts are enabled. Fixes an unused-variable warning with gcc-8: ../../../src/util/half_float.c: In function '_mesa_half_to_unorm8': ../../../src/util/half_float.c:189:14: warning: unused variable 's' [-Wunused-variable] const int s = (val >> 15) & 0x1; ^ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-18 10:34:12 +10:00
Timothy Arceri	0da93de9c8	util: add drirc workarounds for RAGE This allows the game to run on wine (tested on radeonsi where we have compat profile support).	2018-08-18 09:26:51 +10:00
Timothy Arceri	3f9d8e9c88	util: better handle program names from wine For some reason wine will sometimes give us a windows style path for an application. For example when running the 64bit version of Rage wine gives a Unix style path, but when running the 32bit version is gives a windows style path. If we detect no '/' in the path at all it should be safe to assume we have a wine application and instead look for a '\'. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-18 09:20:39 +10:00
Timothy Arceri	d0803dea11	nir: allow more nested loops to be unrolled The innermost check was added to stop us from unrolling multiple loops in a single pass, and to stop outer loops from unrolling. When we successfully unroll a loop we need to run the analysis pass again before deciding if we want to go ahead an unroll a second loop. However the logic was flawed because it never tried to unroll any nested loops other than the first innermost loop it found. If this innermost loop is not unrolled we end up skipping all other nested loops. This unrolls a loop in a Deus Ex: MD shader on ultra settings and also unrolls a loop in a shader from the game Prey when running on DXVK. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-18 09:03:13 +10:00
Ray Strode	9baff597ce	gallium/winsys/kms: don't unmap what wasn't mapped At the moment, depending on pipe transfer flags, the dumb buffer map address can end up at either kms_sw_dt->ro_mapped or kms_sw_dt->mapped. When it's time to unmap the dumb buffer, both locations get unmapped, even though one is probably initialized to 0. That leads to the code segment getting unmapped at runtime and crashes when trying to call into unrelated code. This commit addresses the problem by using MAP_FAILED instead of NULL for ro_mapped and mapped when the dumb buffer is unmapped, and only unmapping mapped addresses at unmap time. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107098 Signed-off-by: Ray Strode <rstrode@redhat.com> Fixes: `d891f28df9` ("gallium/winsys/kms: Fix possible leak in map/unmap.") Cc: Lepton Wu <lepton@chromium.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-17 17:16:32 +01:00
Qiang Yu	0aa80abf25	loader: add dri_driver option to override dri driver to load drirc implementation of MESA_LOADER_DRIVER_OVERRIDE which can be used to override dri driver to load. Usage: override dri driver for device with spec kernel driver name: <device kernel_driver="kernel_driver_name"> <option name="dri_driver" value="new_dri_driver" /> </device> or <device driver="loader" kernel_driver="kernel_driver_name"> <option name="dri_driver" value="new_dri_driver" /> </device> v2: add kernel_driver device attribute to specify kernel driver name instead of reuse driver attribute v3: seperate loader_get_kernel_driver_name into another patch seperate add kernel_driver attribute into another patch Suggested-by: Michel Dänzer <michel@daenzer.net> Signed-off-by: Qiang Yu <Qiang.Yu@amd.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> [v4 Emil: add HAVE_LIBDRM guard around __driConfigOptionsLoader and loader_get_dri_config_driver] Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-17 17:16:32 +01:00
Qiang Yu	3bbe180b98	xmlconfig: add kernel_driver device attribute This attribute can be used by loader to apply different option to device use specific kernel driver. Signed-off-by: Qiang Yu <Qiang.Yu@amd.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-17 17:16:32 +01:00
Qiang Yu	e8b91e99e9	loader: abstract loader_get_kernel_driver_name for reuse This function can be shared by the following kernel_driver drirc patch. Signed-off-by: Qiang Yu <Qiang.Yu@amd.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-17 17:16:32 +01:00
Qiang Yu	30b10dbb7c	driconf: move ${sysconfdir}/drirc to ${datadir}/drirc.d/00-mesa-defaults.conf ${sysconfdir} is for store admin config files, so move this mesa default config file to ${datadir}/drirc.d. Signed-off-by: Qiang Yu <Qiang.Yu@amd.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-17 17:16:32 +01:00
Qiang Yu	04bdbbcab3	xmlconfig: read more config files from drirc.d/ Driver and application can put their drirc files in ${datadir}/drirc.d/ with name xxx.conf. Config files will be read and applied in file name alphabetic order. So there are three places for drirc listed in order: 1. /usr/share/drirc.d/ 2. /etc/drirc 3. ~/.drirc v4: fix meson build v3: 1. seperate driParseConfigFiles refine into another patch 2. fix entries[i] mem leak v2: drop /etc/drirc.d Signed-off-by: Qiang Yu <Qiang.Yu@amd.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-17 17:16:32 +01:00
Emil Velikov	0da417129e	xmlconfig: refine driParseConfigFiles to use parseOneConfigFile Also prepare for the usage of following parseConfigDir patch. Signed-off-by: Qiang Yu <Qiang.Yu@amd.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> [Emil: add #include <limits.h>] Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-17 17:16:32 +01:00
Jason Ekstrand	d9ea015ced	anv/pipeline: Lower pipeline layouts etc. after linking This allows us to use the link-optimized shader for determining binding table layouts and, more importantly, URB layouts. For apps running on DXVK, this is extremely important as DXVK likes to declare max-size inputs and outputs and this lets is massively shrink our URB space requirements. VkPipeline-db results (Batman pipelines only) on KBL: total instructions in shared programs: 820403 -> 790008 (-3.70%) instructions in affected programs: 273759 -> 243364 (-11.10%) helped: 622 HURT: 42 total spills in shared programs: 8449 -> 5212 (-38.31%) spills in affected programs: 3427 -> 190 (-94.46%) helped: 607 HURT: 2 total fills in shared programs: 11638 -> 6067 (-47.87%) fills in affected programs: 5879 -> 308 (-94.76%) helped: 606 HURT: 3 Looking at shaders by hand, it makes the URB between TCS and TES go from containing 32 per-vertex varyings per tessellation shader pair to a more reasonable 8-12. For a 3-vertex patch, that's at least half the URB space no matter how big the patch section is. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-17 10:50:28 -05:00
Jason Ekstrand	f210a5f4bb	anv/pipeline: Set tess IO read/written key fields in compile_* We want these to be set as close to the final compile as possible so that they are guaranteed to happen after nir_shader_gather_info is called. The next commit is going to move nir_shader_gather_info to after the linking step which makes this necessary. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-17 10:50:28 -05:00
Jason Ekstrand	2e4094cd8f	anv/pipeline: Use more fields from stage in compile_cs Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-17 10:50:28 -05:00
Jason Ekstrand	4af1a8c9e4	anv/apply_pipeline_layout: Add to the bind map instead of replacing it This commit makes three changes. One is to only walk the descriptors once and set bind map sizes at the same time as filling out the entries. The second is to make the pass additive so that we can put stuff in the bind map before applying the pipeline layout. Third, we switch to using designated initializers. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-17 10:50:28 -05:00
Jason Ekstrand	320dacb0a0	anv/lower_ycbcr: Use the binding array size for bounds checks Because lower_ycbcr gets called before apply_pipeline_layout, the indices are all logical and the binding layout HW size is actually too big for the bounds check. We should just use the regular logical array size instead. Fixes: `f3e91e78a3` "anv: add nir lowering pass for ycbcr textures" Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-17 10:50:28 -05:00
Mathieu Bridon	459ec5265c	python: Open the template as text, with an explicit encoding In commit `bd27203f4d` we changed this to open in binary mode, to then explicitly decode the lines with the right encoding. Unfortunately, that broke the build on Windows, where the template file can have '\r\n' as line terminators: opening in binary mode would keep those terminators and break the regexp. We need to go back to text mode, where the "universal newlines" mode takes care of this. However, to fix the initial issue, let's specify the encoding explicitly when opening the file, and make sure it is open in text mode, so we only get unicode strings. Reviewed-by: Jose Fonseca <jfonseca@vmware>	2018-08-17 09:34:49 -06:00
Mathieu Bridon	f9415d760a	python: Help Python 2 print the line Reviewed-by: Jose Fonseca <jfonseca@vmware>	2018-08-17 09:33:16 -06:00
Rob Clark	a8ef7f5e02	freedreno/a6xx: streamout Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-17 11:04:21 -04:00
Rob Clark	7fa2a8c3c4	freedreno/a6xx: fragz fixes Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-17 11:04:21 -04:00
Rob Clark	7c73d41160	freedreno/a6xx: scissor fixes Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-17 11:04:21 -04:00
Rob Clark	b7f18e49b7	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-17 11:04:21 -04:00
Rob Clark	a4754c245b	freedreno/a6xx: fix srgb Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-17 11:04:21 -04:00
Rob Clark	2658f63701	freedreno: fix dEQP-GLES3.functional.fence_sync.* Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-17 11:04:21 -04:00
Samuel Pitoiset	d27e1584ce	radv/winsys: fix creating the BO list for virtual buffers When the number of unique BO is 0, we optimize the list creation by copying all buffers of the current CS directly into it. But this is only valid if the CS doesn't have virtual buffers, otherwise they are not added and hw might report VM faults. This fixes VM faults with: dEQP-VK.sparse_resources.image_sparse_binding.2d.rgba8ui.1024_128_1 CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-17 15:00:21 +02:00
Kristian H. Kristensen	de3b34df97	freedreno: Add a6xx backend This adds a freedreno backend for the a6xx generation GPUs, which at the time of this commit is about 98% GLES2 conformant. Much remains to be done - both performance work and feature work towards more recent GLES versions, but this is a good start. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-16 19:13:36 -04:00
Rob Clark	6ee58e8257	freedreno: update generated headers pull in a6xx registers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-16 19:11:08 -04:00
Kristian H. Kristensen	e89683d5a2	freedreno: Fix warnings Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-16 19:11:08 -04:00
Dylan Baker	c782168751	scons: Check for mako 0.8.0 v2: - Use distutils to do the version checking Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107565 Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-08-16 13:53:10 -07:00
Dylan Baker	64e4638130	scons: Require python 2.7 less than 2.7 is not supported. v2: - Remove check for python >= 2.0, since we've already enforced 2.7 Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-08-16 13:52:56 -07:00
Dylan Baker	5a8f824d8c	meson: use python3 module to find python3 This handy helper is nice for OSes that are not linux or BSD like (mac and windows) as it knows how to find python3 in odd places. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-08-16 13:51:44 -07:00
Dylan Baker	52194ae4df	meson: Ensure that mako is >= 0.8.0 It's what autotools has required for a long time. v3: - Use distutils.version.StrictVersion instead of comparing strings Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-08-16 13:50:51 -07:00
Eric Engestrom	03ec672213	svga: simplify Mesa version string Suggested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-16 17:38:31 +01:00
Eric Engestrom	bc8abc1adf	bin: always define MESA_GIT_SHA1 to make it directly usable in code Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-16 17:38:31 +01:00
Eric Engestrom	471f708ed6	git_sha1: simplify logic Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-16 17:38:31 +01:00
Eric Engestrom	9a6a631762	i965: drop unused assignment Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-16 17:38:31 +01:00
Eric Engestrom	7a1f4340b6	anv: drop cast-to-void of used variable `device` is used 2 lines below, even visible in the diff context printed. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-16 17:38:31 +01:00
Eric Engestrom	6cf0d4f91f	anv: use safer snprintf() to ensure NULL string-terminator Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-16 17:38:31 +01:00
Eric Engestrom	d6aea40326	intel/batch-decoder: replace local ARRAY_LENGTH() macro with global ARRAY_SIZE() Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-16 17:38:31 +01:00
Eric Engestrom	81c1989e4f	intel: various python cleanups Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-16 17:38:25 +01:00
Eric Engestrom	aa78b29eba	egl: check for buffer overflow before corrupting our memory Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-16 17:38:22 +01:00
Eric Engestrom	eb6b41749b	egl/wayland: remove sign from bitfield `formats` Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-16 17:38:18 +01:00
Eric Engestrom	c5d9b48a71	mailmap: add various typos of Emil's address from the log Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-16 17:38:04 +01:00
Eric Engestrom	882ed53946	egl: some spelling fixes Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-16 14:15:18 +01:00
Samuel Pitoiset	f9e8456c39	radv: initialize the DCC predicate correctly when it's compressed We have to do a fast-clear eliminate when clearing DCC metadata with 0x20202020. I don't know if that fixes anything but that seems correct to me. CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-16 14:11:51 +02:00
Samuel Pitoiset	f3a78a9da0	radv: fix missing initialization of the conditional rendering state This was missing when VK_EXT_conditional_rendering has been implemented. The predication type should be -1 to avoid restoring previous state when performing a decompression pass with DCC enabled. Note that we don't have to handle secondary command buffers because we don't support this feature currently. CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-16 14:11:48 +02:00
Eric Engestrom	c5dd02287f	bin: split `write_if_different()` out Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-16 12:33:35 +01:00
Eric Engestrom	c2e00f9eee	bin: whitespace cleanup Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-16 12:30:30 +01:00
Bas Nieuwenhuizen	011a811652	radv: Revert divisor = 0 case for vertex attribute extension. Seems like DXVK depends on that and it might get reverted upstream. Since apps are not supposed to use 0 in v2 anyway, we should be safe implementing the old behavior there. Fixes: `66e12451ac` "radv: Update to new VK_EXT_vertex_attribute_divisor to version 2." CC: 18.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-16 11:13:19 +02:00
Bas Nieuwenhuizen	3308db2dd7	radv: Possible on-demand compilation fix. Seems that in a single case we use the renderpass before checking the pipeline, so check the renderpass before we use it. Fixes: `fbcd167314` "radv: Add on-demand compilation of built-in shaders." Tested-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-16 11:13:19 +02:00
Gert Wollny	1560c58b12	mesa/st: fix array indices off-by-one error in remapping When moving the array sizes from the old list to the new one it was not taken into account that the array indices start with one, but the array_size array started at index zero, which resulted in incorrect array sizes when arrays were merged. Correct this by copying the array_size values of the retained arrays with an offset of -1. Also fix whitespaces for the replaced lines. Fixes: `d8c2119f9b` mesa/st/glsl_to_tgsi: Expose array live range tracking and merging Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-16 08:52:26 +02:00
Alexander Tsoy	9a96bf0ecd	meson: fix build for egl platform_x11 without dri3 and gbm Compiling EGL's platform_x11 without dri3 and gbm yields this compile failure: platform_x11 needs inc_loader: ../mesa-18.2.0-rc2/src/egl/drivers/dri2/platform_x11.c:48:10: fatal error: loader.h: No such file or directory #include "loader.h" ^~~~~~~~~~ Fixes: `108d257a16` ("meson: build libEGL") Bugzilla: https://bugs.gentoo.org/663534 Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-08-15 16:37:16 -07:00
Jason Ekstrand	10f44da775	Revert "intel/nir: Call nir_lower_io_to_scalar_early" Commit `4434591bf5` caused substantially more URB messages in geometry and tessellation shaders. Before we can really enable this sort of optimization, We either need some way of combining them back together into vectors or we need to do cross-stage vector element elimination without splitting everything into scalars. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107510 Fixes: `4434591bf5` "intel/nir: Call nir_lower_io_to_scalar_early" Acked-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Mark Janes <mark.a.janes@intel.com>	2018-08-15 17:56:50 -05:00
Erik Faye-Lund	da1f7c56da	i965: do not emit empty surface state If called with an empty size, brw_emit_buffer_surface_state asserts. We already have a dedicated helper for uploading nothing, so let's use that instead. Avoids an assert in dEQP-GLES31.functional.shaders.opaque_type_indexing.ssbo.const_literal_vertex when running a debug build of i965. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-15 23:23:16 +01:00
Sergii Romantsov	743dff1cca	intel/ppgtt: 4096 replaced by PAGE_SIZE Usage of number 4096 replaced by PAGE_SIZE. Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-15 23:23:16 +01:00
Sergii Romantsov	24839663a4	intel/ppgtt: memory address alignment Kernel (for ppgtt) requires memory address to be aligned to page size (4096). -v2: added marking that also fixes initial commit `01058a5522`. -v3: numbers replaced by PAGE_SIZE; buffer-object size is aligned instead of alignment of offsets (Chris Wilson). -v4: changes related to PAGE_SIZE moved to separate commit -v5: restored alignment to page-size for 0-size. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106997 Fixes: `a363bb2cd0` (i965: Allocate VMA in userspace for full-PPGTT systems.) Fixes: `01058a5522` (i965: Add virtual memory allocator infrastructure to brw_bufmgr.) Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-15 23:23:16 +01:00
Timothy Arceri	f0a8accb0d	radv: add Doom workaround Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-16 07:53:38 +10:00
Sergii Romantsov	efb28aa970	i965: Emitting 3DSTATE_SO_BUFFER of 0-size. Avoided filling of whole structure and bo-allocation if size of surface is 0. Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com>	2018-08-15 13:15:28 -07:00
Erik Faye-Lund	98b3b6367a	virgl: report actual max-texture sizes Instead of doing conservative guesses, we should report the max levels based on the max sizes we get from GL on the host. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jakob Bornecrantz <jakob@collabora.com>	2018-08-15 18:48:16 +02:00
Erik Faye-Lund	825aaeae39	virgl: do not use SP_MAX_TEXTURE_*_LEVELS defines These macro-names are also used for softpipe, so let's avoid confusion by avoiding them. Besides, they are just used in one place in virgl, so let's just inline them into the place they are used instead. While we're at it, fixup an error in the comment for the 3D version. Mesa subtracts computes max-size by doing by 2^(n-1), which means this should be 256 cubed, not 512 cubed. The other comments are correct. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jakob Bornecrantz <jakob@collabora.com>	2018-08-15 18:48:08 +02:00
Dylan Baker	ef7ae84daf	docs: Add news item for 18.1.6	2018-08-15 09:09:59 -07:00
Samuel Pitoiset	71d5b2fbf8	radv: disable the auto-waitcnt-before-barrier LLVM option This option allows us to remove additional s_waitcnt instructions because s_barrier internally does s_waitcnt 0. Though, apparently there is a problem with LDS accesses that causes rendering issues with FFXV and DXVK. Disable this optimization for now (RadeonSI still uses it). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107460 CC: 18.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-15 16:21:50 +02:00
Samuel Pitoiset	85113c4d05	radv: fix memory leaks in radv_load_meta_pipeline() Reported by Coverity. Fixes: `fbcd167314` ("radv: Add on-demand compilation of built-in shaders.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-15 16:20:58 +02:00
Samuel Pitoiset	17e79865cf	radv: drop wrong initialization of COMPUTE_RESOURCE_LIMITS The last parameter of radeon_set_sh_reg_seq() is the number of dwords to emit. We were lucky because WAVES_PER_SH(0x3) is 3 but it was initialized to 0. COMPUTE_RESOURCE_LIMITS is correctly set when generating compute pipelines, so we don't need to initialize it. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-15 16:20:38 +02:00
Andres Gomez	53b4701cb0	docs: update calendar 18.2.0-rc3 is out Signed-off-by: Andres Gomez <agomez@igalia.com>	2018-08-15 15:48:18 +03:00
Mauro Rossi	43318d5857	radv/meta_decompress: fix pointer to integer conversion VK_NULL_HANDLE replaces NULL to avoid following building error: external/mesa/src/amd/vulkan/radv_meta_decompress.c:365:54: error: incompatible pointer to integer conversion passing 'void ' to parameter of type 'VkShaderModule' (aka 'unsigned long long') [-Werror,-Wint-conversion] VkResult ret = create_pipeline(cmd_buffer->device, NULL, samples, ^~~~ prebuilts/clang/host/linux-x86/clang-4053586/lib64/clang/5.0.300080/include/stddef.h:105:16: note: expanded from macro 'NULL' # define NULL ((void)0) ^~~~~~~~~~ external/mesa/src/amd/vulkan/radv_meta_decompress.c:97:32: note: passing argument to parameter 'vs_module_h' here VkShaderModule vs_module_h, ^ 1 error generated. Fixes: `fbcd167314` ("radv: Add on-demand compilation of built-in shaders.") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-08-15 14:34:50 +02:00
Mauro Rossi	73b342c7a5	egl/android: fix regression in drm_gralloc path (v2) This patch fixes a regression in mesa 18.2 and mesa-dev branches for HAVE_DRM_GRALLOC code path which is causing black screen on Android and prevents boot due to SIGSEGV MAPERR crash related to unproper handling of drm_gralloc drm FD in new droid_open_device() path. Problem is due to `c7bb82136b` ("egl/android: Add DRM node probing and filtering") To avoid the crash the former existing working droid_open_device() is restored, renamed droid_open_device_drm_gralloc() and kept within HAVE_DRM_GRALLOC braces. Tested with mesa-dev and mesa 18.2 branch and oreo-x86 bootanimation and Androdi GUI booting is fixed with i965, nouveau, radeon. The changes are compatible with gbm_gralloc, I've tested build with hwc too. (v2) remove indentation from HAVE_DRM_GRALLOC pre-processor directive NOTE: Definition of enum{} for GRALLOC_MODULE_PERFORM_GET_DRM_FD is not necessary and it's actually causing a redefinition building error, because in HAVE_DRM_GRALLOC path gralloc_drm.h is already exported by libgralloc_drm which is currently still a dependency. Fixes: `c7bb82136b` ("egl/android: Add DRM node probing and filtering") Cc: "18.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Mauro Rossi <issor.oruam@gmail.com>	2018-08-15 14:07:49 +02:00
Tapani Pälli	656ccf4ef8	mesa: shader dump/read support for ARB programs Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106283 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2018-08-15 11:03:35 +03:00
Danylo Piliaiev	479a849ad6	glsl: Avoid calling get_array_element for scalar constants Accessing scalar constant as an array in function call or initializer list triggered assert in get_array_element. Examples: func(0[0]); vec2 t = { 0[0], 0 }; Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107550 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-15 10:01:43 +03:00
Marek Olšák	bffa025ada	radeonsi: enable 1 missing PS_SU perf counter on Polaris	2018-08-14 21:20:31 -04:00
Marek Olšák	df50099834	radeonsi: use radeon_info::name Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-14 21:20:31 -04:00
Marek Olšák	84652721b9	ac: add radeon_info::name Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-14 21:20:31 -04:00
Marek Olšák	de8d5edbc4	radeonsi: split si_clear_buffer to remove enum si_method Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:12 -04:00
Marek Olšák	4de92f2abb	radeonsi: replace CP_DMA_USE_L2 with enum si_cache_policy Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:10 -04:00
Marek Olšák	bc132d62f9	radeonsi: declare coher in si_copy_buffer Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:09 -04:00
Marek Olšák	cddd7ce325	radeonsi: make PFP_SYNC_ME an explicit CP DMA flag Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:07 -04:00
Marek Olšák	277295962c	radeonsi: don't use emit_data->args in load_emit Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:06 -04:00
Marek Olšák	8fb34050b5	radeonsi: don't use emit_data->args in store_emit Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:04 -04:00
Marek Olšák	a2c18bfbe3	radeonsi: don't use emit_data->args in atomic_emit Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:03 -04:00
Marek Olšák	297fb213b3	radeonsi: don't use emit_data->args in build_interp_intrinsic Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:21:01 -04:00
Marek Olšák	99ae440d4e	radeonsi: inline atomic_fetch_args Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:59 -04:00
Marek Olšák	267e92893c	radeonsi: inline store_fetch_args Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:58 -04:00
Marek Olšák	f15e55aa8a	radeonsi: inline load_fetch_args Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:56 -04:00
Marek Olšák	2c94f321eb	radeonsi: merge txq_emit and resq_emit Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:55 -04:00
Marek Olšák	a14c803166	radeonsi: inline resq_fetch_args Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:54 -04:00
Marek Olšák	347e52adcd	radeonsi: inline txq_fetch_args Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:52 -04:00
Marek Olšák	c9b2ce2672	radeonsi: use get_resinfo directly in lower_gather4_integer Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:36 -04:00
Marek Olšák	7804ddaf87	radeonsi: inline tex_fetch_args into build_tex_intrinsic The diff looks like it moves code that I didn't touch. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:34 -04:00
Marek Olšák	da1d8adc29	radeonsi: remove fetch_args callbacks for ALU instructions Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:33 -04:00
Marek Olšák	ac72a6bd0b	radeonsi: move internal TGSI shaders into si_shaderlib_tgsi.c Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:20:31 -04:00
Marek Olšák	0ca8294ece	radeonsi: implement EXT_window_rectangles Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:19:02 -04:00
Marek Olšák	465e929d6a	gallium/u_blitter: save/restore window rectangles Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:19:01 -04:00
Marek Olšák	15fc0f8d4a	noop: implement set_window_rectangles Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:18:59 -04:00
Marek Olšák	7c8716e4fb	ddebug: implement set_window_rectangles Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 21:18:51 -04:00
Rodrigo Vivi	44f1dcf9b3	i965: Add a new CFL PCI ID. One more CFL ID added to spec. Align with kernel commit d0e062ebb3a4 ("drm/i915/cfl: Add a new CFL PCI ID.") Cc: José Roberto de Souza <jose.souza@intel.com> Cc: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-14 15:46:56 -07:00
Rob Clark	70bf639328	freedreno/ir3: add support for a6xx 'merged' register set Starting with a6xx, half and full precision registers conflict. Which makes things a bit more efficient, ie. if some parts of the shader are heavy on half-precision and others on full precision, you don't have to allocate the worst case for both. But it means we need to setup some additional conflicts. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Rob Clark	4813060ed4	freedreno/ir3: small RA cleanup Collapse is_temp() into it's only callsite, and pass compiler object as struct rather than void. Just cleanups to reduce noise in next patch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Rob Clark	fdd35f497b	freedreno/ir3: stop hard-coding FS input regs We originally did this because at the time we didn't know all the bitfields to configure where various frag shader sysval's went. But we do. So switch to using sysvals for all the frag shader inputs. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Rob Clark	e97b56172c	freedreno/ir3: use r63.x for unused inputs This way, unused sysval inputs, like frag_vcoord, get the correct regid value to disable the input. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Rob Clark	066930e54d	freedreno/ir3: create all inputs in first block create_input()/create_input_compmask() should take the ctx as arg, rather than block, to enforce that all inputs are created in the first block, so that RA sees them as live at the start of the shader. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Rob Clark	62da068fd3	freedreno/ir3: rename s/frag_pos/frag_vcoord/g Make it more clear that this is varying fetch related. Also fixup some comments. Just cleanup for next patches. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Rob Clark	4a7f9feada	compiler: add SYSTEM_VALUE_VARYING_COORD Used internally in freedreno/ir3 for the vec2 value that hw passes to shader to use as coordinate for bary.f (varying fetch) instruction. This is not the same as SYSTEM_VALUE_FRAG_COORD. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Rob Clark	b5a098b202	freedreno/ir3: move per-generation compiler config Move it from the compile ctx to the compiler object, before adding new things for a6xx. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 17:59:02 -04:00
Bas Nieuwenhuizen	66e12451ac	radv: Update to new VK_EXT_vertex_attribute_divisor to version 2. Behavior wrt firstInstance got changed, and a divisor of 0 has been disallowed. The new version of the ext got published in specification 1.1.81. Sending to stable since the only known user is DXVK, which needs this for correctness. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> CC: 18.2 <mesa-stable@lists.freedesktop.org>	2018-08-14 22:13:09 +02:00
Bas Nieuwenhuizen	4bb6c49375	radv: Allow ETC2 on RAVEN and VEGA10 instead of all GFX9. Follow radeonsi. Fixes: `3665f66ef2` "radv: Add support for ETC2 textures." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 22:11:04 +02:00
Bas Nieuwenhuizen	bf33ca7512	radv: Fix missing Android platform define. CC: <mesa-stable@lists.freedesktop.org> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-08-14 22:11:04 +02:00
Rob Clark	13b9d32fb1	freedreno: move free() into fdN_context_destroy() Following patches will be doing further cleanup after calling fd_context_destroy() so it is easier if we move the free() into the per-gen backend code. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 15:46:34 -04:00
Jonathan Marek	dc9705f30d	freedreno: a2xx: ir2 update this patch brings a number of changes to ir2: -ir2 now generates CF clauses as necessary during assembly. this simplifies fd2_program/fd2_compiler and is necessary to implement optimization passes -ir2 now has separate vector/scalar instructions. this will make it easier to implementing scheduling of scalar+vector instructions together. dst_reg is also now seperate from src registers instead of a single list -ir2 now implements register allocation. this makes it possible to compile shaders which have more than 64 TGSI registers -ir2 now implements the following optimizations: removal of IN/OUT MOV instructions generated by TGSI and removal of unused instructions when some exports are disabled -ir2 now allows full 8-bit index for constants -ir2_alloc no longer allocates 4 times too many bytes Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-14 12:46:25 -04:00
Andres Gomez	5406eb5513	docs: update calendar 18.2.0-rc1 and 18.2.0-rc2 are out Signed-off-by: Andres Gomez <agomez@igalia.com>	2018-08-14 17:07:09 +03:00
Bas Nieuwenhuizen	fbcd167314	radv: Add on-demand compilation of built-in shaders. In environments where we cannot cache, e.g. Android (no homedir), ChromeOS (readonly rootfs) or sandboxes (cannot open cache), the startup cost of creating a device in radv is rather high, due to compiling all possible built-in pipelines up front. This meant depending on the CPU a 1-4 sec cost of creating a Device. For CTS this cost is unacceptable, and likely for starting random apps too. So if there is no cache, with this patch radv will compile shaders on demand. Once there is a cache from the first run, even if incomplete, the driver knows that it can likely write the cache and precompiles everything. Note that I did not switch the buffer and itob/btoi compute pipelines to on-demand, since you cannot really do anything in Vulkan without them and there are only a few. This reduces the CTS runtime for the no caches scenario on my threadripper from 32 minutes to 8 minutes. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-14 10:26:24 +02:00
Bas Nieuwenhuizen	24a9033d6f	radv: Refactor blit pipeline creation. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-14 10:26:11 +02:00
Bas Nieuwenhuizen	806a792b43	radv: Make fs key exemplars ordered to be a reverse fs_key lookup. While at it, share the exemplars and account for a non-occurring fs key. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-08-14 10:26:06 +02:00
Dave Airlie	0be5e9f5a1	virgl: ARB_texture_barrier support Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2018-08-14 16:55:56 +10:00
Dylan Baker	6d61aed231	docs: update calendar, add news item and link release notes for 18.1.6 Signed-off-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-13 10:06:45 -07:00
Dylan Baker	973ae7a06b	docs: Add sha256 sums for 18.1.6	2018-08-13 10:05:44 -07:00
Dylan Baker	66c8a64e67	docs: Add release notes for 18.1.6	2018-08-13 10:05:42 -07:00
Alejandro Piñeiro	668ab8aeb1	mesa/glspirv: fix compilation with MSVC From AppVeyor #8582, it seems that MSVC doesn't like uint, so this patch replaces it with unsigned. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-08-13 18:57:18 +02:00
Eric Engestrom	f976d22759	travis: install correct version of mako for each build system Meson now uses python3, so let's add a block for Autotools, move that line into the buildsys-specific blocks, and set the correct version for Meson. Fixes: `2ee1c86d71` "meson: Build with Python 3" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-13 17:29:42 +01:00
Erik Faye-Lund	ae5770171c	mesa/st/glsl_to_tgsi: fixup copy-paste mistake This is clearly a copy-paste error; if we validate the reladdr2-pointer, we don't want to traverse to the reladdr-pointer. Especially since the check above shows that reladdr could be NULL here. Noticed by Coverity. CID: 1438389, 1438390 Fixes: `568bda2f2d` ("mesa/st/glsl_to_tgsi: Split arrays whose elements are only accessed directly") Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Gert Wollny <gw.fossdev@gmail.com>	2018-08-13 18:15:36 +02:00
Neil Roberts	c91a5f70fb	i965/nir: Use the nir copy of shader_info to handle gl_PatchVerticesIn Instead of using the copy of shader_info stored in gl_program, it now uses the one in nir_shader. This is needed for SPIR-V because the info.tess.tcs_vertices_out is filled in via _mesa_spirv_to_nir which happens much later than with a GLSL shader. The copy of shader_data in gl_program is only updated later via brw_shader_gather_info but that is too late. For GLSL this shouldn't create any problems because the nir copy of the shader_info is immediately copied from the gl_program in glsl_to_nir. v2: updated after commit "i965: Combine both gl_PatchVerticesIn lowering passes." (488972) (Alejandro Piñeiro) Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 16:28:27 +02:00
Neil Roberts	a105c1e6e5	mesa/glspirv: Set separate_shader on shader_info The value is copied from the gl_program. If we don’t do this then it will get reset back to zero in brw_shader_gather_info. This isn’t a problem for GLSL because in that case the nir_shader is initialised with a copy of the shader_info from the gl_program. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 16:28:27 +02:00
Iago Toral Quiroga	40947d4744	mesa/glspirv: pick off the only entry point we need This is the same we do for vulkan drivers This is needed to pass the following CTS test: KHR-GL45.gl_spirv.spirv_modules_shader_binary_multiple_shader_objects_test Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 16:28:27 +02:00
Alejandro Piñeiro	32e1d4c34b	mesa/glspirv: compute double inputs and remap attributes input locations used by input attributes are not handled in the same way in OpenGL vs Vulkan. There is a detailed explanation of such differences on the following commit: `c2acf97fcc` So with this commit, the same adjustment that is done after glsl_to_nir, is being done after spirv_to_nir, when it is used on OpenGL (ARB_gl_spirv). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 16:28:27 +02:00
Alejandro Piñeiro	d6c8066663	nir/glsl: make nir_remap_attributes public As we plan to reuse it for ARB_gl_spirv implementation. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 16:28:27 +02:00
Alejandro Piñeiro	af194bd38e	nir/lower_samplers: don't assume a deref for both texture and sampler srcs After commit "nir: Use derefs in nir_lower_samplers" (`75286c2d08`) assumes one deref for both the texture and the sampler. However there are cases (on OpenGL, using ARB_gl_spirv) where SPIR-V is not providing a sampler, like for texture query levels ops. Although we could make spirv_to_nir to provide a sampler deref for those cases, it is not really needed, and wrong from the Vulkan point of view. This patch fixes the following (borrowed) tests run on SPIR-V mode: arb_compute_shader/execution/basic-texelFetch.shader_test arb_gpu_shader5/execution/sampler_array_indexing/fs-simple-texture-size.shader_test arb_texture_query_levels/execution/fs-baselevel.shader_test arb_texture_query_levels/execution/fs-maxlevel.shader_test arb_texture_query_levels/execution/fs-miptree.shader_test arb_texture_query_levels/execution/fs-nomips.shader_test arb_texture_query_levels/execution/vs-baselevel.shader_test arb_texture_query_levels/execution/vs-maxlevel.shader_test arb_texture_query_levels/execution/vs-miptree.shader_test arb_texture_query_levels/execution/vs-nomips.shader_test glsl-1.30/execution/fs-textureSize-compare.shader_test v2: merge lower_tex_src_to_offset and calc_sampler_offsets together, update texture/sampler index and texture_array_size directly on lower_tex_src_to_offset (Jason) v3: clarify one comment (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-13 16:28:27 +02:00
Alejandro Piñeiro	fe2de39fb2	nir/linker: take into account hidden uniforms So they are not exposed through the introspection API. It is worth to note that the number of hidden uniforms of GLSL linking vs SPIR-V linking would be somewhat different due the differen order of the nir lowerings/optimizations. For example: gl_FbWposYTransform. This is introduced as part of nir_lower_wpos_ytransform. On GLSL that is executed after the IR-based linking. So that means that on GLSL the UniformStorage will not include this uniform. With the SPIR-V linking, that uniform is already present, but marked as hidden. So it will be included on the UniformStorage, but as hidden. One alternative would create a special how_declared for that case, but seemed an overkill. Using hidden should be ok as far as it is used properly. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 16:28:27 +02:00
Alejandro Piñeiro	5332d7582d	nir: add how_declared to nir_variable.data Equivalent to the already existing how_declared at GLSL IR. The only difference is that we are not adding all the declaration_type available on GLSL, only the one that we will use on the short term. We would add more mode if needed on the future. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 16:28:26 +02:00
Neil Roberts	be6f472b23	spirv: Make VertexIndex and VertexId both non-zero-based GLSL has gl_VertexID which is supposed to be non-zero-based. SPIR-V has both VertexIndex and VertexId builtins whose meanings are defined by the APIs. Vulkan defines VertexIndex as being non-zero-based. In Vulkan VertexId and InstanceId have no meaning and are pretty much just reserved for OpenGL at this point. GL_ARB_spirv removes VertexIndex and defines VertexId to be the same as gl_VertexId (which is also non-zero-based). Previously in Mesa it was treating VertexIndex as non-zero-based and VertexId as zero-based, so it was breaking for GL. This behaviour was apparently based on Khronos bug 14255. However that bug doesn’t seem to have made a final decision for VertexId. Assuming there really is no other definition for VertexId for Vulkan it seems better to just make them both have the same value. v2: update comment and commit descriptions, based on Jason Ekstrand explanation of the meaning/rationale behind all those builtins (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-13 16:23:36 +02:00
Alejandro Piñeiro	624c00f1a6	spirv: fill info.gs.input_primitive too info.gs.output_primitive was already being filled. Not sure why this is not needed on Vulkan, but we found to be needed for ARB_gl_spirv. Specifically, this is needed to get the following test passing: KHR-GL45.gl_spirv.spirv_validation_builtin_variable_decorations_test Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-13 12:56:51 +02:00
Tapani Pälli	ed94a5799d	docs/features: mark GL_EXT_render_snorm as done for i965 Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-13 13:08:22 +03:00
Tapani Pälli	fa9e6c235d	i965: enable EXT_render_snorm Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-08-13 12:03:17 +03:00
Tapani Pälli	0d356cf478	mesa: enable EXT_render_snorm extension Patch sets additional formats renderable and enables the extension when OpenGL ES 3.1 is supported. v2: instead of dummy_true, have a separate toggle for extension (Eric Anholt) v3: add missing checks, simplify some existing checks and fix glCopyTexImage2D check (Nanley Chery) add SHORT and BYTE support in read_pixels_es3_error_check Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2018-08-13 12:03:17 +03:00
Kenneth Graunke	de57926dc9	blorp: Properly handle Z24X8 blits. One of the reasons we didn't notice that R24_UNORM_X8_TYPELESS destinations were broken was that an earlier layer was swapping it out for B8G8R8A8_UNORM. That made Z24X8 -> Z24X8 blits work. However, R32_FLOAT -> R24_UNORM_X8_TYPELESS was still totally broken. The old code only considered one format at a time, without thinking that format conversion may need to occur. This patch moves the translation out to a place where it can consider both formats. If both are Z24X8, we continue using B8G8R8A8_UNORM to avoid having to do shader math workarounds. If we have a Z24X8 destination, but a non-matching source, we use our shader hacks to actually render to it properly. Fixes: `804856fa57` (intel/blorp: Handle more exotic destination formats) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-11 12:34:01 -07:00
Kenneth Graunke	8a29086285	blorp: Don't try to use R32_UNORM for R24_UNORM_X8_TYPELESS rendering. The hardware doesn't support rendering to R24_UNORM_X8_TYPELESS, so Jason decided to fake it with a bit of shader math and R32_UNORM RTs. The only problem is that R32_UNORM isn't renderable either...so we've just traded one bad format for another. This patch makes us use R32_UINT instead. Fixes: `804856fa57` (intel/blorp: Handle more exotic destination formats) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-11 12:33:27 -07:00
Jason Ekstrand	a9f7bcfdf9	intel: Switch the order of the 2x MSAA sample positions The Vulkan 1.1.82 spec flipped the order to better match D3D. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2018-08-11 10:58:12 -05:00
Gert Wollny	8a87138885	mesa/st/tests: Add array life range estimation and renumbering tests Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	0981fc84df	mesa/st/tests: Add array life range tests infrastructure to common test class Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	d8c2119f9b	mesa/st/glsl_to_tgsi: Expose array live range tracking and merging This patch ties in the array split, merge, and interleave code. shader-db changes in the TGSI code are: original code \| array-merge \| change mean max \| mean max \| best mean % worst ----------------------------------------------------------- arrays 0.05 2 \| 0.00 0 \| -2 -100 0 total temps 5.05 21 \| 4.92 20 \| -15 -2.59 1 instr 55.33 988 \| 55.20 988 \| -15 -0.24 0 Evaluation: Run shader-db in single thread mode (otherwise the output is not ordered and the best and worst column don't make sense) to get results pre-stats.txt and post-stats.txt. Then using python pandas: import pandas as pd old_stats = pd.read_csv('pre-stats.txt') new_stats = pd.read_csv('post-stats.txt') omean = old_stats.mean() omax = old_stats.max() nmean = new_stats.mean() nmax = new_stats.max() delta = new_stats - old_stats pd.concat([omean, omax, nmean, nmax, delta.min(), delta.mean()/old_stats.mean()*100, delta.max()], axis=1, keys=['mean', 'max', 'mean', 'max', 'best', 'avg change %', 'worst']) v4: - Correct typo and add bugs that are fixed by this series. - Update stats and describe stats evaluation Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105371 https://bugs.freedesktop.org/show_bug.cgi?id=100200 Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	c317d0ab54	mesa/st/glsl_to_tgsi: add array life range evaluation into tracking code v4: Also track the register given in inst->resource. (thanks: Benedikt Schemmer for testing the patches on radeonsi, which revealed that I was missing tracking this) Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	5e58eb37f1	mesa/st/glsl_to_tgsi: add class for array access tracking Because of the indirect access it is impossible to obtain an accurate per component and array element tracking. Therefore, the tracking is simplified to only track whether any element was accessed, whether this happend conditionally in a loop. In addition, while tracking of temporaries requires a per-componet tracking that is later fused, for arrays only the components access mask is neede. The resulting tracking code and evaluation of the array live range is sufficiently different from the evaluation of the live range of temporaries to justify implementing this in a different class instead of adding more complexity to the already existing code for temporary life range evaluation. v4: Update commit message to make it clearer why this class is seperate from the tracking of temporaries. Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	7d55d01b53	mesa/st/glsl_to_tgsi: move evaluation of read mask up in the call hierarchy In preparation of the array live range tracking the evaluation of the read mask is moved out the register live range tracking to the enclosing call of the generalized read access tracking. Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	f2a4636339	mesa/st/glsl_to_tgsi: rename access_record to register_merge_record and some more renames In preparartion of adding the tracking of the live range the classes that refer to temporary registers are renamed. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	8c89728889	mesa/st/tests: Add tests for array merge helper classes. v2: - Define tests also in the meson.build file. v4: - Check no-op mapping of all bits. - Convert tests to the new class layout used in the merge evaulation. - remove dependency on llvm in meson build (Thanks Dylan Baker for pointing out that this might not needed) Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	12316aa217	mesa/st/glsl_to_tgsi: Add array merge logic v4: - Update the code to use the new merge logic. - Use a cleaner, class-based approach for the evaluation of merges. Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	d097ef4204	mesa/st/glsl_to_tgsi: Add helper classes to apply array merging and interleaving v4: - Remove logic for evaluation of swizzles and merges since this was moved to array_live_range. This class now only handles the actual remapping. Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	d54c2f92f9	mesa/st/glsl_to_tgsi: Add helper class for array live range merging and interleaving This class holds the array length, live range, and accessed components, and it implements the logic for evaluating how arrays are merged and interleaved. v4: - Add logic to evaluate merge and interleave of a pair of arrays to the class array_live_range. - document class - update commit message Thanks Nicolai Hähnle for the pointers given. Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	331ae3cde5	mesa/st/glsl_to_tgsi:rename lifetime to register_live_range On one hand "live range" is the term used in the literature, and on the other hand a distinction is needed from the array live ranges. v4: Fix indentions and white spaces Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v3) Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	f40c9d0225	mesa/st/glsl_to_tgsi: Properly resolve life times simple if/else + use constructs in constructs like below, currently the live range estimation extends the live range of t unecessarily to the whole loop because it was not detected that t is unconditional written and later read only in the "if (a)" scope. while (foo) { ... if (a) { ... if (b) t = ... else t = ... x = t; ... } ... } This patch adds a unit test for this case and corrects the minimal live range estimation accordingly. v4: update comments Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	568bda2f2d	mesa/st/glsl_to_tgsi: Split arrays whose elements are only accessed directly Array whose elements are only accessed directly are replaced by the according number of temporary registers. By doing so the otherwise reserved register range becomes subject to further optimizations like copy propagation and register merging. Thanks to the resulting reduced register pressure this patch makes the piglits spec/glsl-1.50/execution - variable-indexing/vs-output-array-vec3-index-wr-before-gs geometry/max-input-components pass on r600 (barts) where they would fail before with a "GPR limit exceeded" error (even with the spilling that was recently added). v2: * rename method dissolve_arrays to split_arrays * unify the tracking and remapping methods for src and dst registers * also track access to arrays via reladdr* v3: * enable this optimization only if the driver requests register merge v4: * Correct comments * Also update inst->resource if it is an array element (thanks: Benedikt Schemmer for testing the patches on radeonsi, which revealed that I was missing tracking this) Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	b1cead3add	mesa/st/glsl_to_tgsi: Add method to collect some TGSI statistics When mesa is compiled in debug mode then this adds the possibility to print out some statistics about the translated and optimized TGSI shaders to a file. The functionality is enabled by setting the environment variable GLSL_TO_TGSI_PRINT_STATS to the file name where the statistics should be collected. The file is opened in append mode so that statistics from various runs will be accumulated. v4: Make accress to log file thread save (thanks for pointing this out Nicolai Hähnle) Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-11 12:32:42 +02:00
Gert Wollny	be95ca9be7	Gallium/tgsi: Correct signdness of return value of bit operations The GLSL operations findLSB, findMSB, and countBits always return a signed integer type. Let TGSI reflect this. v2: Properly set values in infer_(src\|dst)_type (Thanks Roland Schneidegger for pointing out problems with my 1st approach) v2: Set values in the common infer_type code path, and only add the correct source type for UMSB (Roland Schneidegger) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-08-11 11:14:29 +02:00
Mathieu Bridon	2ee1c86d71	meson: Build with Python 3 Now that all the build scripts are compatible with both Python 2 and 3, we can flip the switch and tell Meson to use the latter. Since Meson already depends on Python 3 anyway, this means we don't need two different Python stacks to build Mesa. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-10 15:15:09 -07:00
Mathieu Bridon	bd27203f4d	python: Rework bytes/unicode string handling In both Python 2 and 3, opening a file without specifying the mode will open it for reading in text mode ('r'). On Python 2, the read() method of a file object opened in mode 'r' will return byte strings, while on Python 3 it will return unicode strings. Explicitly specifying the binary mode ('rb') then decoding the byte string means we always handle unicode strings on both Python 2 and 3. Which in turns means all re.match(line) will return unicode strings as well. If we also make expandCString return unicode strings, we don't need the call to the unicode() constructor any more. We were using the ugettext() method because it always returns unicode strings in Python 2, contrarily to the gettext() one which returns byte strings. The ugettext() method doesn't exist on Python 3, so we must use the right method on each version of Python. The last hurdles are that Python 3 doesn't let us concatenate unicode and byte strings directly, and that Python 2's stdout wants encoded byte strings while Python 3's want unicode strings. With these changes, the script gives the same output on both Python 2 and 3. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-10 15:14:48 -07:00
Mathieu Bridon	15ac05fd45	python: Fix inequality comparisons On Python 3, executing `foo != bar` will first try to call foo.__ne__(bar), and fallback on the opposite result of foo.__eq__(bar). Python 2 does not do that. As a result, those __eq__ methods were never called, when we were testing for inequality. Expliclty adding the __ne__ methods fixes this issue, in a way that is compatible with both Python 2 and 3. However, this means the __eq__ methods are now called when testing for `foo != None`, so they need to be guarded correctly. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-10 08:45:59 -07:00
Gert Wollny	e94095ec30	mesa/st: ETC2 now uses R8G8B8A8_SRGB as fallback The check for ETC2 compatibility was not updated when the fallback format was changed. Fixes: `71867a0a61` st/mesa: Fall back to R8G8B8A8_SRGB for ETC2 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-10 10:09:22 +02:00
Mathieu Bridon	08fe9b3e3a	python: Simplify list sorting Instead of copying the list, then sorting the copy in-place, we can just get a new sorted copy directly. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-09 16:49:19 -07:00
Mathieu Bridon	8d3ff6244c	python: Use key-functions when sorting containers In Python 2, the traditional way to sort containers was to use a comparison function (which returned either -1, 0 or 1 when passed two objects) and pass that as the "cmp" argument to the container's sort() method. Python 2.4 introduced key-functions, which instead only operate on a given item, and return a sorting key for this item. In general, this runs faster, because the cmp-function has to get run multiple times for each item of the container. Python 3 removed the cmp-function, enforcing usage of key-functions instead. This change makes the script compatible with Python 2 and Python 3. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-09 16:49:19 -07:00
Mathieu Bridon	1e668ca111	python: Better check for integer types Python 3 lost the long type: now everything is an int, with the right size. This commit makes the script compatible with Python 2 (where we check for both int and long) and Python 3 (where we only check for int). Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-09 16:49:19 -07:00
Mathieu Bridon	14f1ab998f	python: Do not mix bytes and unicode strings Mixing the two is a long-standing recipe for errors in Python 2, so much so that Python 3 now completely separates them. This commit stops treating both as if they were the same, and in the process makes the script compatible with both Python 2 and 3. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-09 16:49:19 -07:00
Mathieu Bridon	c644b2d7a7	python: Explicitly use a list On Python 2, the builtin functions filter() returns a list. On Python 3, it returns an iterator. Since we want to use those objects in contexts where we need lists, we need to explicitly turn them into lists. This makes the code compatible with both Python 2 and Python 3. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-09 16:49:18 -07:00
Mathieu Bridon	d9ca4a172e	python: Use the right function for the job The code was just reimplementing itertools.combinations_with_replacement in a less efficient way. This does change the order of the results slightly, but it should be ok. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-09 16:49:18 -07:00
Eric Anholt	b618d7ea59	egl: Fix leak of X11 pixmaps backing pbuffers in DRI3. This is basically copied from the DRI2 destroy path. Without this, Raspberry Pi would quickly run out of CMA during the EGL tests in the CTS due to all the pixmaps laying around. Fixes: `f35198bade` ("egl/x11: Implement dri3 support with loader's dri3 helper") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-09 13:12:13 -07:00
Kenneth Graunke	08a5c395ab	intel: Fix SIMD16 unaligned payload GRF reads on Gen4-5. When the SIMD16 Gen4-5 fragment shader payload contains source depth (g2-3), destination stencil (g4), and destination depth (g5-6), the single register of stencil makes the destination depth unaligned. We were generating this instruction in the RT write payload setup: mov(16) m14<1>F g5<8,8,1>F { align1 compr }; which is illegal, instructions with a source region spanning more than one register need to be aligned to even registers. This is because the hardware implicitly does (nr \| 1) instead of (nr + 1) when splitting the compressed instruction into two mov(8)'s. I believe this would cause the hardware to load g5 twice, replicating subspan 0-1's destination depth to subspan 2-3. This showed up as 2x2 artifact blocks in both TIS-100 and Reicast. Normally, we rely on the register allocator to even-align our virtual GRFs. But we don't control the payload, so we need to lower SIMD widths to make it work. To fix this, we teach lower_simd_width about the restriction, and then call it again after lower_load_payload (which is what generates the offending MOV). Fixes: `8aee87fe4c` (i965: Use SIMD16 instead of SIMD8 on Gen4 when possible.) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107212 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=13728 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Diego Viola <diego.viola@gmail.com>	2018-08-09 12:33:41 -07:00
Kenneth Graunke	11b9f63a74	i965: Only enable depth IZ signals if there's an actual depthbuffer. According to the G45 PRM Volume 2 Page 265 we're supposed to only set these signals when there is an actual depth buffer. Note that we already do this for the stencil buffer by virtue of brw->stencil_enabled invoking _mesa_is_stencil_enabled(ctx) which checks whether the current drawbuffer's visual has stencil bits (which is updated based on what buffers are bound). We just need to do it for depth as well. Not observed to fix anything. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-09 12:33:38 -07:00
Adam Jackson	63a6b719d9	glx: GLX_MESA_multithread_makecurrent is direct-only This extension is not defined for indirect contexts. Marking it as "client only", as the old code did here, would make the extension available in indirect contexts, even though the server would certainly not have it in its extension list. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-09 12:33:14 -04:00
Eric Engestrom	fcf259ef97	anv: set error in all failure paths Cc: Jason Ekstrand <jason.ekstrand@intel.com> Fixes: `5b196f39bd` "anv/pipeline: Compile to NIR in compile_graphics" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-09 11:20:27 +01:00
Eric Engestrom	aac80f7597	intel/tools: add missing variable initialisation Fixes: `6a60beba40` "intel/tools: Add an error state to aub translator" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-08-09 11:20:18 +01:00
vadym.shovkoplias	e0de26eacc	drirc: Allow extension midshader for Metro Redux This fixes both Metro 2033 Redux and Metro Last Light Redux Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99730 Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com> Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-09 13:13:20 +03:00
Tapani Pälli	03a5acec68	glsl: handle error case with ast_post_inc, ast_post_dec Return ir_rvalue::error_value with ast_post_inc, ast_post_dec if parser error was emitted previously. This way process_array_size won't see bogus IR generated like with commit `9c676a6427`. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98699 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-08-09 13:07:16 +03:00
Eric Anholt	fdfb689a48	vc4: Implement texture_subdata() to directly upload tiled data. This avoids a memcpy into a temporary in the upload path. Improves x11perf -putimage100 performance by 12.1586% +/- 1.38155% (n=145)	2018-08-08 18:14:31 -07:00
Eric Anholt	25bee5ef9e	vc4: Handle partial loads/stores of tiled textures. Previously, we would load out the tile-aligned area, update the raster copy, and store it back. This was a huge cost for XPutImage calls to the screen under glamor. Instead, implement a general load/store path that walks over the source x/y writing into the corresponding pixel of the destination (using clever math from https://fgiesen.wordpress.com/2011/01/17/texture-tiling-and-swizzling/). If things are aligned, we go through the previous utile-at-a-time loop. Improves x11perf -putimage10 performance by 139.777% +/- 2.83464% (n=5) Improves x11perf -putimage100 performance by 383.908% +/- 22.6297% (n=11) Improves x11perf -getimage10 performance by 2.75731% +/- 0.585054% (n=145)	2018-08-08 16:45:44 -07:00
Eric Anholt	3e06b918aa	vc4: Compile the LT image helper per cpp we might load/store. For the partial load/store support I'm about to add, we want the memcpy to be compiled out to a single load/store. This should also eliminate the calls to vc4_utile_width/height(). Improves x11perf -putimage100 performance by 3.76344% +/- 1.16978% (n=15)	2018-08-08 15:53:25 -07:00
Eric Anholt	d6a174669f	vc4: Refactor to reuse the LT tile walking code.	2018-08-08 12:34:48 -07:00
Juan A. Suarez Romero	a9fb331ea7	wayland/egl: update surface size on window resize According to EGL 1.5 spec, section 3.10.1.1 ("Native Window Resizing"): "If the native window corresponding to _surface_ has been resized prior to the swap, _surface_ must be resized to match. _surface_ will normally be resized by the EGL implementation at the time the native window is resized. If the implementation cannot do this transparently to the client, then eglSwapBuffers must detect the change and resize surface prior to copying its pixels to the native window." So far, resizing a native window in Wayland/EGL was interpreted in Mesa as a request to resize, which is not executed until the first draw call. And hence, surface size is not updated until executing it. Thus, querying the surface size with eglQuerySurface() after a window resize still returns the old values. This commit updates the surface size values as soon as the resize is done, even when the real resize is done in the draw call. This makes the semantics that any native window resize request take effect inmediately, and if user calls eglQuerySurface() it will return the new resized values. v2: update surface size if there isn't a back surface (Daniel) CC: Daniel Stone <daniel@fooishbar.org> CC: mesa-stable@lists.freedesktop.org Reviewed-by: Daniel Stone <daniels@collabora.com>	2018-08-08 18:29:58 +02:00
Juan A. Suarez Romero	1fe7cbdf05	wayland/egl: initialize window surface size to window size When creating a windows surface with eglCreateWindowSurface(), the width and height returned by eglQuerySurface(EGL_{WIDTH,HEIGHT}) is invalid until buffers are updated (like calling glClear()). But according to EGL 1.5 spec, section 3.5.6 ("Surface Attributes"): "Querying EGL_WIDTH and EGL_HEIGHT returns respectively the width and height, in pixels, of the surface. For a window or pixmap surface, these values are initially equal to the width and height of the native window or pixmap with respect to which the surface was created" This fixes dEQP-EGL.functional.color_clears.* CTS tests v2: - Do not modify attached_{width,height} (Daniel) - Do not update size on resizing window (Brendan) CC: Daniel Stone <daniel@fooishbar.org> CC: Brendan King <brendan.king@imgtec.com> CC: mesa-stable@lists.freedesktop.org Tested-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Daniel Stone <daniels@collabora.com>	2018-08-08 18:28:52 +02:00
Juan A. Suarez Romero	f9d0e7d3bc	travis: make drivers explicit in Meson targets Like in the autotools target, make the list of drivers to be built in each of the Meson targets explicit. This will help to identify missing dependencies and other issues more easily. CC: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-08 17:56:32 +02:00
Brian Paul	51e878cdb3	svga: use pipe_sampler_view::target in svga_set_sampler_views() instead of the underlying texture's target. This fixes an issue where the TGSI sampler type was not agreeing with the sampler view target/type. In particular, this fixes a Mint 19 XFCE desktop scaling issue because the TGSI code was using a RECT sampler but the sampler view's underlying texture was PIPE_TEXTURE_2D. We want to use the sampler view's type rather than the underlying resource, as we do for the view's surface format. No piglit regressions. VMware issue 2156696. Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-08-08 08:20:10 -06:00
Brian Paul	92e5dc94ac	svga: use SVGA3D_RS_FILLMODE for vgpu9 I'm not sure why we didn't support this in the past, but fillmode is supported by all renderers nowadays. Also fix the logic in svga_create_rasterizer_state() to avoid a few swtnl case. No piglit regressions Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-08-08 08:20:10 -06:00
Brian Paul	a45b495700	svga: add TGSI_SEMANTIC_FACE switch case in svga_swtnl_update_vdecl() Fixes failed assertion running Piglit polygon-mode-face test. Though, the test still does not pass. Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-08-08 08:20:10 -06:00
Brian Paul	92e7342a6f	xlib: remove unused Fake_glXGetAGPOffsetMESA() function To silence compiler warning. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-08 08:20:09 -06:00
Brian Paul	6ff4795c62	gl.h: define GLeglImageOES depending on GL_EXT_EGL_image_storage To avoid duplicate typedef with the definition in glext.h V2: test for both GL_OES_EGL_image and GL_EXT_EGL_image_storage in case both the GL and GLES headers are included. Per Emil. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107488 Tested-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Mathias Fröhlich <mathias.froehlich@web.de>	2018-08-08 08:20:01 -06:00
Emil Velikov	32aa7ff647	Android: copy -fnomath options from the autotools build Add -fno-math-errno and -fno-trapping-math to the build. Mesa does not depend on the functionality provided, thus this should result in slightly faster code and smaller binaries. Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Rob Herring <robh@kernel.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-08 13:45:55 +01:00
Emil Velikov	315c46cfdc	autotools: use correct gl.pc LIBS when using glvnd This is more of a hack, since glvnd itself should be providing the file. Until that happens, ensure the libs is correctly set to -lGL CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com>	2018-08-08 13:37:09 +01:00
Emil Velikov	8dc96416c9	glx: automake: add egl.pc/headers TODO when using glvnd Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com>	2018-08-08 13:37:09 +01:00
Emil Velikov	94ed4c4a16	egl: automake: add egl.pc/headers TODO when using glvnd Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com>	2018-08-08 13:37:09 +01:00
Emil Velikov	25a9450a44	autotools: error out when building with mangling and glvnd It's not a thing that can work, nor is a wise idea to attempt. v2: Tweak error message (Dylan) CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com> (v1)	2018-08-08 13:37:09 +01:00
Emil Velikov	d5ac236471	autotools: error out when using the broken --with-{gl, osmesa}-lib-name The toggles were broken with the introduction of --enable-mangling. Fixing that up might be possible, but it's not worth the complexity since one can rename the libraries at any point. CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com>	2018-08-08 13:37:09 +01:00
Emil Velikov	4f2b73d9fd	meson: recommend building the surfaceless platform It has no special requirements, size and build-time is effectively zero. v2: Rebase Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Adam Jackson <ajax@redhat.com>	2018-08-08 13:37:09 +01:00
Emil Velikov	a7ea7511ba	automake: require shared glapi when using DRI based libGL This has been a requirement for ages, yet it seems like we never explicitly errored out during configure. CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com>	2018-08-08 13:37:09 +01:00
Emil Velikov	834036500c	ttn: remove {varying_slot, frag_result}_to_tgsi_semantic helpers The respective drivers have been updated and the helpers are no longer needed. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-08-08 13:33:07 +01:00
Juan A. Suarez Romero	db432194a1	travis: remove libedit-dev dependency in LLVM 6.0 targets In LLVM <6.0 we added explicitly libedit-dev, as it was required to satisfy apt dependencies. In LLVM 6.0, this is not required anymore, so let's remove it. CC: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-08 13:00:33 +02:00
Erik Faye-Lund	0f450e0cbe	glsl_to_tgsi: plumb image writable through to driver The virgl driver cares about the writable-flag on image definitions, because it re-emits GLSL from the TGSI. However, so far it was hardcoded to true in glsl_to_tgsi, which cause problems when virglrenderer is running on top of GLES 3.1, where not all formats are supported for writable images. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-08 09:35:09 +02:00
Eric Anholt	cfe69d0aaa	vc4: Fix vc4_fence_server_sync() on pre-syncobj kernels. We won't have an FD if we're just having the server wait on a fence created by eglCreateSyncKHR(). Our seqno fences will happen in order, so server-side waits are no-ops in that case. Fixes dEQP-EGL.functional.sharing.gles2.multithread.simple_egl_server_sync.buffers.gen_delete Fixes: `b0acc3a562` ("broadcom/vc4: Native fence fd support")	2018-08-07 17:00:49 -07:00
Eric Anholt	69158c452b	vc4: Ignore samplers for finding uniform offsets. Fixes: dEQP-GLES2.shaders.struct.uniform.sampler_array_fragment dEQP-GLES2.shaders.struct.uniform.sampler_array_vertex dEQP-GLES2.shaders.struct.uniform.sampler_nested_fragment dEQP-GLES2.shaders.struct.uniform.sampler_nested_vertex Cc: mesa-stable@lists.freedesktop.org	2018-08-07 17:00:22 -07:00
Eric Anholt	e24a8e5232	vc4: Extend dumping of uniforms in QIR and in the command stream. Similar to what I did for V3D, provide some description of the uniforms.	2018-08-07 17:00:22 -07:00
Eric Anholt	3954331aff	vc4: Pull uinfo->data[i] dereference out to the top of the loop. Reduces the size of vc4_uniforms.o by about 10%. We would basically always end up loading the cachline of uinfo->data[i] anyway, so it should be good for performance as well as making the code a bit cleaner.	2018-08-07 17:00:22 -07:00
Eric Anholt	550e9c917c	vc4: Make sure to emit a tile coordinates between two MSAA loads. The HW only executes a load once the tile coordinates packet happens, and only tracks one at a time, so by emitting our two MSAA loads back to back we would end up with an undefined color or Z buffer. The simulator doesn't seem to care, but sync up the RCL generation with the kernel anyway. Fixes dEQP-EGL.functional.render.multi_context.gles2.rgb888_window	2018-08-07 17:00:22 -07:00
Eric Anholt	9ab6912a00	vc4: Respect a sampler view's first_layer field. Fixes texturing from EGL images created from cubemap faces, as in dEQP-EGL.functional.image.create.gles2_cubemap_negative_x_rgba_texture Cc: mesa-stable@lists.freedesktop.org	2018-08-07 17:00:22 -07:00
Dave Airlie	fe0a3a45bb	virgl: add ARB_shader_clock support Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-08-08 08:36:40 +10:00
Mathieu Bridon	ba1ebf2ee1	python: Specify the template output encoding We're trying to write a unicode string (i.e decoded) to a file opened in binary (i.e encoded) mode. In Python 2 this works, because of the automatic conversion between byte and unicode strings. In Python 3 this fails though, as no automatic conversion is attempted. This change makes the scripts compatible with both versions of Python. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-07 13:28:35 -07:00
Mathieu Bridon	e1b88aee68	python: Fix rich comparisons Python 3 doesn't call objects __cmp__() methods any more to compare them. Instead, it requires implementing the rich comparison methods explicitly: __eq__(), __ne(), __lt__(), __le__(), __gt__() and __ge__(). Fortunately Python 2 also supports those. This commit only implements the comparison methods which are actually used by the build scripts. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-07 13:10:34 -07:00
Mathieu Bridon	9b6746b7c0	python: Use explicit integer divisions In Python 2, divisions of integers return an integer: >>> 32 / 4 8 In Python 3 though, they return floats: >>> 32 / 4 8.0 However, Python 3 has an explicit integer division operator: >>> 32 // 4 8 That operator exists on Python >= 2.2, so let's use it everywhere to make the scripts compatible with both Python 2 and 3. In addition, using __future__.division tells Python 2 to behave the same way as Python 3, which helps ensure the scripts produce the same output in both versions of Python. Signed-off-by: Mathieu Bridon <bochecha@daitauha.fr> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v2) Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-07 13:07:44 -07:00
Chad Versace	3dc22381fa	egl/main: Add bits for EGL_KHR_mutable_render_buffer A follow-up patch enables EGL_KHR_mutable_render_buffer for Android. This patch is separate from the Android patch because I think it's easier to review the platform-independent bits separately. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-07 11:11:05 -07:00
Chad Versace	5c6d6eedb3	dri: Add param driCreateConfigs(mutable_render_buffer) If set, then the config will have __DRI_ATTRIB_MUTABLE_RENDER_BUFFER, which translates to EGL_MUTABLE_RENDER_BUFFER_BIT_KHR. Not used yet. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-07 11:11:05 -07:00
Chad Versace	bbe2d50b58	dri: Define DRI_MutableRenderBuffer extensions Define extensions DRI_MutableRenderBufferDriver and DRI_MutableRenderBufferLoader. These are the two halves for EGL_KHR_mutable_render_buffer. Outside the DRI code there is one additional change. Add gl_config::mutableRenderBuffer to match __DRI_ATTRIB_MUTABLE_RENDER_BUFFER. Neither are used yet. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-07 11:11:05 -07:00
Chad Versace	eabf59791e	egl/dri2: In dri2_make_current, return early on failure This pulls an 'else' block into the function's main body, making the code easier to follow. Without this change, the upcoming EGL_KHR_mutable_render_buffer patch transforms dri2_make_current() into spaghetti. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-07 11:11:05 -07:00
Chad Versace	f48f9a78da	egl: Simplify queries for EGL_RENDER_BUFFER There exist two queryable EGL_RENDER_BUFFER states in EGL: eglQuerySurface(EGL_RENDER_BUFFER) and eglQueryContext(EGL_RENDER_BUFFER). These changes eliminate potentially very fragile code in the upcoming EGL_KHR_mutable_render_buffer implementation. * eglQuerySurface(EGL_RENDER_BUFFER) The implementation of eglQuerySurface(EGL_RENDER_BUFFER) contained abstruse logic which required comprehending the specification complexities of how the two EGL_RENDER_BUFFER states interact. The function sometimes returned _EGLContext::WindowRenderBuffer, sometimes _EGLSurface::RenderBuffer. Why? The function tried to encode the actual logic from the EGL spec. When did the function return which variable? Go study the EGL spec, hope you understand it, then hope Mesa mutated the EGL_RENDER_BUFFER state in all the correct places. Have fun. To simplify eglQuerySurface(EGL_RENDER_BUFFER), and to improve confidence in its correctness, flatten its indirect logic. For pixmap and pbuffer surfaces, simply return a hard-coded literal value, as the spec suggests. For window surfaces, simply return _EGLSurface::RequestedRenderBuffer. Nothing difficult here. * eglQueryContext(EGL_RENDER_BUFFER) The implementation of this suffered from the same issues as eglQuerySurface, and the solution is the same. confidence in its correctness, flatten its indirect logic. For pixmap and pbuffer surfaces, simply return a hard-coded literal value, as the spec suggests. For window surfaces, simply return _EGLSurface::ActiveRenderBuffer. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-07 11:11:05 -07:00
Marek Olšák	d145e33e7c	radeonsi: set GLC=1 for all write-only shader resources	2018-08-07 13:52:34 -04:00
Marek Olšák	2ab8cf6de5	radeonsi: don't load block dimensions into SGPRs if they are not variable	2018-08-07 13:52:34 -04:00
Juan A. Suarez Romero	03cff7ecd8	travis: meson/Vulkan requires LLVM 6.0 RADV now requires LLVM 6.0. Fixes: `fd1121e839` ("amd: remove support for LLVM 5.0") CC: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Andres Gomez <agomez@igalia.com>	2018-08-07 19:29:29 +02:00
Juan A. Suarez Romero	80f937ea4d	travis: add ubuntu-toolchain-r-test LLVM 6.0 requires libstc++4.9, which is not available in main Travis repository. v2: LLVM 6.0 requires libstdc+4.9, rather than GCC 4.9 (Jan Vesely) Fixes: `fd1121e839` ("amd: remove support for LLVM 5.0") CC: Marek Olšák <marek.olsak@amd.com> CC: Emil Velikov <emil.velikov@collabora.com> CC: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-07 19:27:07 +02:00
Emil Velikov	85cad15298	egl: set EGL_BAD_NATIVE_PIXMAP in the copy_buffers fallback As the spec says: EGL_BAD_NATIVE_PIXMAP is generated if the implementation does not support native pixmaps. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-07 17:59:24 +01:00
Emil Velikov	5463064f7a	egl/x11: use the no-op dri2_fallback_copy_buffers for swrast Currently dri2_copy_buffers is used for swrast, which depends on the DRI2_FLUSH extension. Since that's not a thing on software based drivers we crash out. Do the slightly more graceful, thing of returning EGL_FALSE. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-07 17:59:09 +01:00
Emil Velikov	670cd4080b	egl: remove unneeded _eglGetNativePlatform check There's little point in calling _eglGetNativePlatform() in eglCopyBuffers. The platform returned should be identical to the one already stored in our _EGLDisplay. In the following corner case, the check is incorrect. The function _eglGetNativePlatform effectively invokes the old-style eglGetDisplay platform selection. Thus if the EGL_PLATFORM platform does not match with the EGL_EXT_platform_* used to create the display we'll error out. Addresses the egl-copy-buffers piglit test. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-07 17:58:52 +01:00
Emil Velikov	b4b277f770	travis: use https for all the links Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-07 17:27:06 +01:00
Emil Velikov	6b8657aff0	autoconf: stop exporting internal wayland details With version v1.15 the "code" option was deprecated in favour of "private-code" or "public-code". Before the interface symbol generated was exported (which is a bad idea since it's internal implementation detail) and others may misuse it. That was the case with libva approx. 1 year ago. Since then libva was fixed, so we can finally hide it by using "private-code" Inspired by similar xserver patch by Adam Jackson. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-07 17:23:17 +01:00
Emil Velikov	2f1d9e6cb8	meson: stop exporting internal wayland details With version v1.15 the "code" option was deprecated in favour of "private-code" or "public-code". Before the interface symbol generated was exported (which is a bad idea since it's internal implementation detail) and others may misuse it. That was the case with libva approx. 1 year ago. Since then libva was fixed, so we can finally hide it by using "private-code" Inspired by similar xserver patch by Adam Jackson. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-07 17:23:17 +01:00
Emil Velikov	c077b74ee8	meson: use dependency()+find_program() for wayland-scanner Helps when the native wayland-scanner is located outside of PATH. Inspired by the xserver code ;-) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-07 17:23:17 +01:00
Emil Velikov	54d844897f	swr: don't export swr_create_screen_internal With earlier rework the user and provider of the symbol are within the same binary. Thus there's no point in exporting the function. Spotted while reviewing patch from Chuck, that nearly added another unneeded PUBLIC function. Cc: Chuck Atkins <chuck.atkins@kitware.com> Cc: Tim Rowley <timothy.o.rowley@intel.com> Fixes: `f50aa21456` "(swr: build driver proper separate from rasterizer") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Tested-by: Chuck Atkins <chuck.atkins@kitware.com> Reviewed-By: George Kyriazis <george.kyriazis@intel.com<mailto:george.kyriazis@intel.com>> Tested-by: Chuck Atkins <chuck.atkins@kitware.com<mailto:chuck.atkins@kitware.com>>	2018-08-07 17:23:17 +01:00
Eric Engestrom	e02f061b69	meson: install KHR/khrplatform.h when needed Fixes: `f7d42ee7d3` "include: update GL & GLES headers (v2)" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-07 15:57:32 +01:00
Eric Engestrom	ed07e831a8	i965: gen_shader_sha1() doesn't use the brw_context Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-08-07 14:20:50 +01:00
Eric Engestrom	87c156183c	configure: install KHR/khrplatform.h when needed Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107511 Fixes: `f7d42ee7d3` "include: update GL & GLES headers (v2)" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Tested-by: Brad King <brad.king@kitware.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-08-07 14:20:50 +01:00
Lionel Landwerlin	303e7b39b5	intel: don't build tools without -Dtools=intel Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107487 Fixes: 4334196ab325c6w ("intel: tools: simplify meson build") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-07 11:58:47 +01:00
Erik Faye-Lund	c4f183492d	virgl: update virgl_hw.h from virglrenderer This just makes sure we're currently up-to-date with what virglrenderer has. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-07 09:38:41 +02:00
Erik Faye-Lund	0914e1464e	virgl: rename msaa_sample_positions -> sample_locations This matches what this field is called in virglrenderer's copy of this. This reduces the diff between the two different versions of virgl_hw.h, and should make it easier to upgrade the file in the future. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Dave Airlie <airlied@redhat.com>	2018-08-07 09:38:27 +02:00
Eric Anholt	9507e03699	vc4: Fix a leak of the no-vertex-elements workaround BO. Fixes: `bd1925562a` ("vc4: Convert the driver to emitting the shader record using pack macros.")	2018-08-06 19:10:06 -07:00
Eric Anholt	86095e9bb1	vc4: Fix context creation when syncobjs aren't supported. Noticed when trying to run current Mesa on rpi's downstream kernel. Fixes: `b0acc3a562` ("broadcom/vc4: Native fence fd support")	2018-08-06 19:10:06 -07:00
Eric Anholt	1561e4984e	v3d: Emit the VCM_CACHE_SIZE packet. This is needed to ensure that we don't get blocked waiting for VPM space with bin/render overlapping. Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-08-06 13:03:23 -07:00
Eric Anholt	5d49076990	v3d: Drop "VC5" from the renderer string. VC5 isn't a useful name any more, just stick to v3d.	2018-08-06 13:03:23 -07:00
Eric Anholt	50a8713d4f	v3d: Avoid spilling that breaks the r5 usage after a ldvary. Fixes bad rendering when forcing 2 spills in glxgears. Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-08-06 13:03:23 -07:00
Eric Anholt	f2c0d310d6	v3d: Make sure that QPU instruction-has-a-dest matches VIR. Found when debugging register spilling -- we would try to spill the dest of a STVPMV, inserting spill code after entering the last segment. In fact, we were likely to to choose to do this, given that the STVPMV "dest" temp was never read from, making it cheap to spill. Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-08-06 13:03:23 -07:00
Eric Anholt	3f9cb2eb05	v3d: Wait for TMU writes to complete before continuing after a spill. The simulator complained that we had write responses outstanding at shader end. It seems that a TMU read does not guarantee that previous TMU writes by the thread have completed, which surprised me. Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-08-06 13:03:23 -07:00
Eric Anholt	ccbe33af5b	v3d: Make sure we don't emit a thrsw before the last one finished. Found while forcing some spilling, which creates a lot of short tmua->thrsw->ldtmu sequences. Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-08-06 13:03:23 -07:00
Eric Anholt	f9d54dc3cf	v3d: Add some debug code for forcing register spilling. This is useful for periodically testing out register spilling to see how it goes on simple shaders, rather than only failing on insanely complicated ones.	2018-08-06 13:03:23 -07:00
Chad Versace	aaa41cd297	drisw: Fix build on Android Nougat, which lacks shm (v2) In commit `cf54bd5e8`, dri_sw_winsys.c began using <sys/shm.h> to support the new functions putImageShm, getImageShm in DRI_SWRastLoader. But Android began supporting System V shared memory only in Oreo. Nougat has no shm headers. Fix the build by ifdef'ing out the shm code on Nougat. Fixes: `cf54bd5e8` "drisw: use shared memory when possible" Reviewed-by: Dave Airlie <airlied@redhat.com> Cc: Marc-André Lureau <marcandre.lureau@gmail.com>	2018-08-06 11:09:38 -07:00
Ian Romanick	6229ee87c7	mesa: fix make check for AMD_framebuffer_multisample_advanced Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107483 Fixes: `3d6900d76e` ("glapi: define AMD_framebuffer_multisample_advanced and add its functions") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: Vinson Lee <vlee@freedesktop.org>	2018-08-06 10:31:56 -07:00
Ian Romanick	b7946f6778	glapi: Fix GLES versioning for AMD_framebuffer_multisample_advanced functions The GL_AMD_framebuffer_multisample_advanced spec says: OpenGL ES dependencies: Requires OpenGL ES 3.0. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107483 Fixes: `3d6900d76e` ("glapi: define AMD_framebuffer_multisample_advanced and add its functions") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: Vinson Lee <vlee@freedesktop.org>	2018-08-06 10:30:06 -07:00
Gert Wollny	7a46b2d641	meson, install_megadrivers: Also remove stale symlinks os.path.exists doesn't return True for stale symlinks, but they are in the way later, when a link/file with the same name is to be created. For instance it is conceivable that the pointed to file is replaced by a file with a new name, and then the symlink is dead. To handle this check specifically for all existing symlinks to be removed. (This bugged me for some time with a link libXvMCr600.so always being in the way of installing this file) v2: use only os.lexist and replace all instances of os.exist (Dylan Baker) v3: handle directory check correctly (Eric Engestrom) Fixes: `f7f1b30f81` ("meson: extend install_megadrivers script to handle symmlinking") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>(v2 minus dir check) Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Gert Wollny <gert.wollny@collabora.com>	2018-08-06 18:42:01 +02:00
Tapani Pälli	5eb4b384d9	anv: add more swapchain formats This change helps with some of the dEQP-VK.wsi.android.* tests that try to create swapchain with using such formats. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-08-06 09:25:11 +03:00
Karol Herbst	c3325097be	nvc0/ir: return 0 in imageLoad on incomplete textures We already guarded all OP_SULDP against out of bound accesses, but we ended up just reusing whatever value was stored in the dest registers. Fixes CTS test shader_image_load_store.incomplete_textures v2: fix for loads not ending up with predicates (bindless_texture) v3: fix replacing the def Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-08-04 18:25:20 +02:00
Karol Herbst	0ca046d7e9	gm200/ir: optimize rcp(sqrt) to rsq mitigates hurt shaders after adding sqrt: total instructions in shared programs : 5456166 -> 5454825 (-0.02%) total gprs used in shared programs : 647522 -> 647551 (0.00%) total shared used in shared programs : 389120 -> 389120 (0.00%) total local used in shared programs : 21064 -> 21064 (0.00%) total bytes used in shared programs : 58288696 -> 58274448 (-0.02%) local shared gpr inst bytes helped 0 0 0 516 516 hurt 0 0 27 2 2 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-08-04 15:24:08 +02:00
Karol Herbst	6f98a3065b	gm200/ir: add native OP_SQRT support ./GpuTest /test=pixmark_piano 1024x640 30sec: 301 -> 327 points shader-db: total instructions in shared programs : 5472103 -> 5456166 (-0.29%) total gprs used in shared programs : 647530 -> 647522 (-0.00%) total shared used in shared programs : 389120 -> 389120 (0.00%) total local used in shared programs : 21064 -> 21064 (0.00%) total bytes used in shared programs : 58459304 -> 58288696 (-0.29%) local shared gpr inst bytes helped 0 0 27 8281 8281 hurt 0 0 21 431 431 v2: use NVISA_GM200_CHIPSET Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-08-04 15:24:08 +02:00
Lionel Landwerlin	4334196ab3	intel: tools: simplify meson build Remove the if tools condition and just put it through the install: parameter. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-04 09:45:34 +01:00
Lionel Landwerlin	87a3c97781	intel: aubinator: simplify decoding Since we don't support streaming an aub file, we can drop the decoding status enum. v2: include stdbool (Eric) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-04 09:40:14 +01:00
Lionel Landwerlin	02ebc064ea	intel: common: add missing stdint include Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-04 09:39:01 +01:00
Lionel Landwerlin	db4770ee57	intel: decoder: remove unused variable Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-04 09:38:58 +01:00
Lionel Landwerlin	7471286bb0	intel: tools: aubwrite: reuse canonical address helper Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-04 09:38:44 +01:00
Lionel Landwerlin	35955afa7a	intel: aubinator: fix read the context/ring Up to now we've been lucky that the buffer returned was always exactly at the address we requested. Fixes: `144b40db54` ("intel: aubinator: drop the 1Tb GTT mapping") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-08-04 09:38:34 +01:00
Ian Romanick	3b07d28f81	nir: Transform expressions of b2f(a) and b2f(b) to a == b All Gen7+ platforms had similar results. (Skylake shown) total instructions in shared programs: 14276886 -> 14276838 (<.01%) instructions in affected programs: 312 -> 264 (-15.38%) helped: 2 HURT: 0 total cycles in shared programs: 532578395 -> 532570985 (<.01%) cycles in affected programs: 682562 -> 675152 (-1.09%) helped: 374 HURT: 4 helped stats (abs) min: 2 max: 200 x̄: 20.39 x̃: 18 helped stats (rel) min: 0.07% max: 11.64% x̄: 1.25% x̃: 1.28% HURT stats (abs) min: 2 max: 114 x̄: 53.50 x̃: 49 HURT stats (rel) min: 0.06% max: 11.70% x̄: 5.02% x̃: 4.15% 95% mean confidence interval for cycles value: -21.30 -17.91 95% mean confidence interval for cycles %-change: -1.30% -1.06% Cycles are helped. Sandy Bridge total instructions in shared programs: 10488123 -> 10488075 (<.01%) instructions in affected programs: 336 -> 288 (-14.29%) helped: 2 HURT: 0 total cycles in shared programs: 150260379 -> 150260439 (<.01%) cycles in affected programs: 4726 -> 4786 (1.27%) helped: 0 HURT: 2 No changes on Iron Lake or GM45. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	c658b6c4c8	nir: Transform expressions of b2f(a) and b2f(b) to a ^^ b All Gen platforms had pretty similar results. (Skylake shown) total instructions in shared programs: 14276892 -> 14276886 (<.01%) instructions in affected programs: 484 -> 478 (-1.24%) helped: 2 HURT: 0 total cycles in shared programs: 532578397 -> 532578395 (<.01%) cycles in affected programs: 3522 -> 3520 (-0.06%) helped: 1 HURT: 0 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	3aca80aabc	nir: Transform expressions of b2f(a) and b2f(b) to !(a && b) All Gen platforms had pretty similar results. (Skylake shown) total cycles in shared programs: 532578400 -> 532578397 (<.01%) cycles in affected programs: 2784 -> 2781 (-0.11%) helped: 1 HURT: 1 helped stats (abs) min: 4 max: 4 x̄: 4.00 x̃: 4 helped stats (rel) min: 0.26% max: 0.26% x̄: 0.26% x̃: 0.26% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.08% max: 0.08% x̄: 0.08% x̃: 0.08% v2: s/fmax/fmin/. Noticed by Thomas Helland. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	1713c97181	nir: Transform expressions of b2f(a) and b2f(b) to a && b No changes on any Gen platform. v2: s/fmax/fmin/. Noticed by Thomas Helland. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	4425f4786a	nir: Transform expressions of b2f(a) and b2f(b) to !(a \|\| b) All Gen6+ platforms had similar results. (Skylake shown) total instructions in shared programs: 14276961 -> 14276892 (<.01%) instructions in affected programs: 3215 -> 3146 (-2.15%) helped: 28 HURT: 0 helped stats (abs) min: 1 max: 6 x̄: 2.46 x̃: 2 helped stats (rel) min: 0.47% max: 9.52% x̄: 4.34% x̃: 1.92% 95% mean confidence interval for instructions value: -2.87 -2.06 95% mean confidence interval for instructions %-change: -5.73% -2.95% Instructions are helped. total cycles in shared programs: 532577068 -> 532578400 (<.01%) cycles in affected programs: 121864 -> 123196 (1.09%) helped: 35 HURT: 30 helped stats (abs) min: 2 max: 268 x̄: 42.34 x̃: 22 helped stats (rel) min: 0.12% max: 12.14% x̄: 3.22% x̃: 1.86% HURT stats (abs) min: 2 max: 246 x̄: 93.80 x̃: 36 HURT stats (rel) min: 0.09% max: 13.63% x̄: 4.47% x̃: 2.58% 95% mean confidence interval for cycles value: -5.02 46.01 95% mean confidence interval for cycles %-change: -0.99% 1.65% Inconclusive result (value mean confidence interval includes 0). Iron Lake and GM45 had similar results. (Iron Lake shown) total instructions in shared programs: 7781299 -> 7781342 (<.01%) instructions in affected programs: 22300 -> 22343 (0.19%) helped: 13 HURT: 40 helped stats (abs) min: 2 max: 3 x̄: 2.85 x̃: 3 helped stats (rel) min: 1.15% max: 7.69% x̄: 3.72% x̃: 3.33% HURT stats (abs) min: 2 max: 2 x̄: 2.00 x̃: 2 HURT stats (rel) min: 0.26% max: 1.30% x̄: 0.47% x̃: 0.43% 95% mean confidence interval for instructions value: 0.23 1.39 95% mean confidence interval for instructions %-change: -1.18% 0.07% Inconclusive result (%-change mean confidence interval includes 0). total cycles in shared programs: 177878928 -> 177879332 (<.01%) cycles in affected programs: 383298 -> 383702 (0.11%) helped: 7 HURT: 43 helped stats (abs) min: 2 max: 18 x̄: 10.00 x̃: 10 helped stats (rel) min: 0.17% max: 4.81% x̄: 2.62% x̃: 3.40% HURT stats (abs) min: 2 max: 38 x̄: 11.02 x̃: 12 HURT stats (rel) min: 0.08% max: 1.54% x̄: 0.25% x̃: 0.09% 95% mean confidence interval for cycles value: 5.21 10.95 95% mean confidence interval for cycles %-change: -0.51% 0.21% Inconclusive result (%-change mean confidence interval includes 0). v2: s/fmin/fmax/. Noticed by Thomas Helland. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	6b3670ae80	nir: Transform -fabs(a) >= 0 to a == 0 All Gen platforms had pretty similar results. (Skylake shown) total instructions in shared programs: 14276964 -> 14276961 (<.01%) instructions in affected programs: 411 -> 408 (-0.73%) helped: 3 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.47% max: 1.96% x̄: 1.04% x̃: 0.68% total cycles in shared programs: 532577062 -> 532577068 (<.01%) cycles in affected programs: 1093 -> 1099 (0.55%) helped: 1 HURT: 1 helped stats (abs) min: 16 max: 16 x̄: 16.00 x̃: 16 helped stats (rel) min: 7.77% max: 7.77% x̄: 7.77% x̃: 7.77% HURT stats (abs) min: 22 max: 22 x̄: 22.00 x̃: 22 HURT stats (rel) min: 2.48% max: 2.48% x̄: 2.48% x̃: 2.48% Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	46e7c340d4	nir: Transform expressions of b2f(a) and b2f(b) to a \|\| b All Gen6+ platforms had pretty similar results. (Skylake shown) total instructions in shared programs: 14277184 -> 14276964 (<.01%) instructions in affected programs: 10082 -> 9862 (-2.18%) helped: 37 HURT: 1 helped stats (abs) min: 1 max: 30 x̄: 5.97 x̃: 4 helped stats (rel) min: 0.14% max: 16.00% x̄: 5.23% x̃: 2.04% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.70% max: 0.70% x̄: 0.70% x̃: 0.70% 95% mean confidence interval for instructions value: -7.87 -3.71 95% mean confidence interval for instructions %-change: -6.98% -3.16% Instructions are helped. total cycles in shared programs: 532577990 -> 532577062 (<.01%) cycles in affected programs: 170959 -> 170031 (-0.54%) helped: 33 HURT: 9 helped stats (abs) min: 2 max: 120 x̄: 30.91 x̃: 30 helped stats (rel) min: 0.02% max: 7.65% x̄: 2.66% x̃: 1.13% HURT stats (abs) min: 2 max: 24 x̄: 10.22 x̃: 8 HURT stats (rel) min: 0.09% max: 1.79% x̄: 0.61% x̃: 0.22% 95% mean confidence interval for cycles value: -31.23 -12.96 95% mean confidence interval for cycles %-change: -2.90% -1.02% Cycles are helped. Iron Lake and GM45 had similar results. (Iron Lake shown) total instructions in shared programs: 7781539 -> 7781301 (<.01%) instructions in affected programs: 10169 -> 9931 (-2.34%) helped: 32 HURT: 0 helped stats (abs) min: 2 max: 20 x̄: 7.44 x̃: 6 helped stats (rel) min: 0.47% max: 17.02% x̄: 4.03% x̃: 1.88% 95% mean confidence interval for instructions value: -9.53 -5.34 95% mean confidence interval for instructions %-change: -5.94% -2.12% Instructions are helped. total cycles in shared programs: 177878590 -> 177878932 (<.01%) cycles in affected programs: 78706 -> 79048 (0.43%) helped: 7 HURT: 21 helped stats (abs) min: 6 max: 34 x̄: 24.57 x̃: 28 helped stats (rel) min: 0.15% max: 8.33% x̄: 4.66% x̃: 6.37% HURT stats (abs) min: 2 max: 86 x̄: 24.48 x̃: 22 HURT stats (rel) min: 0.01% max: 4.28% x̄: 1.21% x̃: 0.70% 95% mean confidence interval for cycles value: 0.30 24.13 95% mean confidence interval for cycles %-change: -1.52% 1.01% Inconclusive result (%-change mean confidence interval includes 0). v2: s/fmin/fmax/. Noticed by Thomas Helland. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	be7d3ba34a	nir: Transform -fabs(a) < 0 to a != 0 Unlike the much older -abs(a) >= 0.0 transformation, this is not precise. The behavior changes if a is NaN. All Gen platforms had pretty similar results. (Skylake shown) total instructions in shared programs: 14277216 -> 14277184 (<.01%) instructions in affected programs: 2300 -> 2268 (-1.39%) helped: 8 HURT: 0 helped stats (abs) min: 1 max: 8 x̄: 4.00 x̃: 3 helped stats (rel) min: 0.48% max: 15.15% x̄: 4.41% x̃: 1.01% 95% mean confidence interval for instructions value: -6.45 -1.55 95% mean confidence interval for instructions %-change: -9.96% 1.13% Inconclusive result (%-change mean confidence interval includes 0). total cycles in shared programs: 532577848 -> 532577990 (<.01%) cycles in affected programs: 17486 -> 17628 (0.81%) helped: 2 HURT: 5 helped stats (abs) min: 2 max: 6 x̄: 4.00 x̃: 4 helped stats (rel) min: 0.06% max: 1.81% x̄: 0.93% x̃: 0.93% HURT stats (abs) min: 6 max: 50 x̄: 30.00 x̃: 26 HURT stats (rel) min: 0.55% max: 2.17% x̄: 1.19% x̃: 1.02% 95% mean confidence interval for cycles value: -1.06 41.63 95% mean confidence interval for cycles %-change: -0.58% 1.74% Inconclusive result (value mean confidence interval includes 0). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	d49eab2757	nir: Rearrange bcsel with two bcsel sources All Gen platforms had pretty similar results. (Skylake shown) total instructions in shared programs: 14277220 -> 14277216 (<.01%) instructions in affected programs: 422 -> 418 (-0.95%) helped: 2 HURT: 0 total cycles in shared programs: 532577908 -> 532577848 (<.01%) cycles in affected programs: 2800 -> 2740 (-2.14%) helped: 2 HURT: 0 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	b92fded6eb	nir: Collapse more repeated bcsels on the same argument All Gen platforms had pretty similar results. (Skylake shown) total instructions in shared programs: 14277230 -> 14277220 (<.01%) instructions in affected programs: 751 -> 741 (-1.33%) helped: 4 HURT: 0 helped stats (abs) min: 2 max: 3 x̄: 2.50 x̃: 2 helped stats (rel) min: 1.23% max: 1.40% x̄: 1.32% x̃: 1.32% 95% mean confidence interval for instructions value: -3.42 -1.58 95% mean confidence interval for instructions %-change: -1.47% -1.17% Instructions are helped. total cycles in shared programs: 532577947 -> 532577908 (<.01%) cycles in affected programs: 10641 -> 10602 (-0.37%) helped: 4 HURT: 3 helped stats (abs) min: 1 max: 40 x̄: 13.75 x̃: 7 helped stats (rel) min: 0.11% max: 3.08% x̄: 1.10% x̃: 0.60% HURT stats (abs) min: 2 max: 8 x̄: 5.33 x̃: 6 HURT stats (rel) min: 0.13% max: 0.55% x̄: 0.30% x̃: 0.23% 95% mean confidence interval for cycles value: -20.69 9.55 95% mean confidence interval for cycles %-change: -1.63% 0.63% Inconclusive result (value mean confidence interval includes 0). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-04 01:12:03 -07:00
Ian Romanick	408330ed48	nir: Don't compare i2f or u2i with zero Broadwell and Skylake had similar results. (Skylake shown) total instructions in shared programs: 14277620 -> 14277230 (<.01%) instructions in affected programs: 36905 -> 36515 (-1.06%) helped: 101 HURT: 6 helped stats (abs) min: 1 max: 6 x̄: 4.46 x̃: 6 helped stats (rel) min: 0.32% max: 7.69% x̄: 1.80% x̃: 1.51% HURT stats (abs) min: 1 max: 28 x̄: 10.00 x̃: 1 HURT stats (rel) min: 0.33% max: 1.74% x̄: 0.68% x̃: 0.47% 95% mean confidence interval for instructions value: -4.59 -2.70 95% mean confidence interval for instructions %-change: -1.90% -1.41% Instructions are helped. total cycles in shared programs: 532580716 -> 532577947 (<.01%) cycles in affected programs: 940575 -> 937806 (-0.29%) helped: 92 HURT: 12 helped stats (abs) min: 2 max: 158 x̄: 51.04 x̃: 62 helped stats (rel) min: 0.24% max: 3.99% x̄: 2.14% x̃: 2.41% HURT stats (abs) min: 10 max: 1112 x̄: 160.58 x̃: 63 HURT stats (rel) min: 0.06% max: 21.90% x̄: 4.22% x̃: 0.20% 95% mean confidence interval for cycles value: -50.66 -2.59 95% mean confidence interval for cycles %-change: -2.09% -0.73% Cycles are helped. total spills in shared programs: 8116 -> 8124 (0.10%) spills in affected programs: 200 -> 208 (4.00%) helped: 0 HURT: 2 total fills in shared programs: 11086 -> 11094 (0.07%) fills in affected programs: 436 -> 444 (1.83%) helped: 0 HURT: 2 Ivy Bridge and Haswell had similar results. (Haswell shown) total instructions in shared programs: 12979054 -> 12978067 (<.01%) instructions in affected programs: 33633 -> 32646 (-2.93%) helped: 120 HURT: 2 helped stats (abs) min: 1 max: 13 x̄: 8.53 x̃: 13 helped stats (rel) min: 0.30% max: 16.67% x̄: 4.55% x̃: 3.17% HURT stats (abs) min: 18 max: 18 x̄: 18.00 x̃: 18 HURT stats (rel) min: 1.15% max: 2.84% x̄: 2.00% x̃: 2.00% 95% mean confidence interval for instructions value: -9.19 -6.99 95% mean confidence interval for instructions %-change: -5.27% -3.62% Instructions are helped. total cycles in shared programs: 411212880 -> 411199636 (<.01%) cycles in affected programs: 696441 -> 683197 (-1.90%) helped: 107 HURT: 5 helped stats (abs) min: 2 max: 864 x̄: 124.90 x̃: 146 helped stats (rel) min: 0.03% max: 29.20% x̄: 8.58% x̃: 5.88% HURT stats (abs) min: 2 max: 50 x̄: 24.00 x̃: 22 HURT stats (rel) min: 0.01% max: 5.35% x̄: 1.29% x̃: 0.25% 95% mean confidence interval for cycles value: -136.96 -99.54 95% mean confidence interval for cycles %-change: -9.75% -6.53% Cycles are helped. total spills in shared programs: 78623 -> 78631 (0.01%) spills in affected programs: 66 -> 74 (12.12%) helped: 0 HURT: 2 total fills in shared programs: 80104 -> 80108 (<.01%) fills in affected programs: 133 -> 137 (3.01%) helped: 0 HURT: 2 No changes on Sandy Bridge, Iron Lake, or GM45. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	a3845616a2	nir: Remove f2i(i2f(x)) conversions Broadwell and Skylake had similar results. (Skylake shown) total instructions in shared programs: 14277978 -> 14277620 (<.01%) instructions in affected programs: 36957 -> 36599 (-0.97%) helped: 76 HURT: 1 helped stats (abs) min: 2 max: 90 x̄: 4.89 x̃: 4 helped stats (rel) min: 0.44% max: 5.88% x̄: 1.04% x̃: 0.87% HURT stats (abs) min: 14 max: 14 x̄: 14.00 x̃: 14 HURT stats (rel) min: 0.36% max: 0.36% x̄: 0.36% x̃: 0.36% 95% mean confidence interval for instructions value: -7.06 -2.24 95% mean confidence interval for instructions %-change: -1.28% -0.77% Instructions are helped. total cycles in shared programs: 532584581 -> 532580716 (<.01%) cycles in affected programs: 973591 -> 969726 (-0.40%) helped: 76 HURT: 1 helped stats (abs) min: 2 max: 9940 x̄: 159.80 x̃: 32 helped stats (rel) min: <.01% max: 8.70% x̄: 1.15% x̃: 1.19% HURT stats (abs) min: 8280 max: 8280 x̄: 8280.00 x̃: 8280 HURT stats (rel) min: 2.10% max: 2.10% x̄: 2.10% x̃: 2.10% 95% mean confidence interval for cycles value: -386.98 286.59 95% mean confidence interval for cycles %-change: -1.41% -0.81% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 8127 -> 8116 (-0.14%) spills in affected programs: 108 -> 97 (-10.19%) helped: 1 HURT: 0 total fills in shared programs: 11090 -> 11086 (-0.04%) fills in affected programs: 440 -> 436 (-0.91%) helped: 1 HURT: 1 Haswell total instructions in shared programs: 12979174 -> 12979054 (<.01%) instructions in affected programs: 9040 -> 8920 (-1.33%) helped: 14 HURT: 1 helped stats (abs) min: 2 max: 34 x̄: 8.79 x̃: 6 helped stats (rel) min: 0.41% max: 7.04% x̄: 2.66% x̃: 1.14% HURT stats (abs) min: 3 max: 3 x̄: 3.00 x̃: 3 HURT stats (rel) min: 0.19% max: 0.19% x̄: 0.19% x̃: 0.19% 95% mean confidence interval for instructions value: -13.58 -2.42 95% mean confidence interval for instructions %-change: -3.94% -1.01% Instructions are helped. total cycles in shared programs: 411227148 -> 411212880 (<.01%) cycles in affected programs: 630506 -> 616238 (-2.26%) helped: 15 HURT: 0 helped stats (abs) min: 2 max: 11192 x̄: 951.20 x̃: 38 helped stats (rel) min: <.01% max: 16.01% x̄: 3.92% x̃: 0.17% 95% mean confidence interval for cycles value: -2544.28 641.88 95% mean confidence interval for cycles %-change: -6.89% -0.94% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 78626 -> 78623 (<.01%) spills in affected programs: 42 -> 39 (-7.14%) helped: 1 HURT: 0 total fills in shared programs: 80111 -> 80104 (<.01%) fills in affected programs: 140 -> 133 (-5.00%) helped: 1 HURT: 1 Ivy Bridge total instructions in shared programs: 11684101 -> 11684030 (<.01%) instructions in affected programs: 3080 -> 3009 (-2.31%) helped: 4 HURT: 1 helped stats (abs) min: 5 max: 59 x̄: 18.50 x̃: 5 helped stats (rel) min: 6.47% max: 7.04% x̄: 6.87% x̃: 6.99% HURT stats (abs) min: 3 max: 3 x̄: 3.00 x̃: 3 HURT stats (rel) min: 0.15% max: 0.15% x̄: 0.15% x̃: 0.15% 95% mean confidence interval for instructions value: -45.59 17.19 95% mean confidence interval for instructions %-change: -9.38% -1.56% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 258407697 -> 258389653 (<.01%) cycles in affected programs: 328323 -> 310279 (-5.50%) helped: 5 HURT: 0 helped stats (abs) min: 32 max: 14908 x̄: 3608.80 x̃: 32 helped stats (rel) min: 1.26% max: 17.22% x̄: 9.30% x̃: 10.60% 95% mean confidence interval for cycles value: -11616.71 4399.11 95% mean confidence interval for cycles %-change: -16.56% -2.03% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 4537 -> 4528 (-0.20%) spills in affected programs: 64 -> 55 (-14.06%) helped: 1 HURT: 0 total fills in shared programs: 4823 -> 4815 (-0.17%) fills in affected programs: 189 -> 181 (-4.23%) helped: 1 HURT: 1 Sandy Bridge total instructions in shared programs: 10488464 -> 10488449 (<.01%) instructions in affected programs: 272 -> 257 (-5.51%) helped: 3 HURT: 0 helped stats (abs) min: 5 max: 5 x̄: 5.00 x̃: 5 helped stats (rel) min: 5.49% max: 5.56% x̄: 5.51% x̃: 5.49% total cycles in shared programs: 150263359 -> 150263263 (<.01%) cycles in affected programs: 7978 -> 7882 (-1.20%) helped: 3 HURT: 0 helped stats (abs) min: 32 max: 32 x̄: 32.00 x̃: 32 helped stats (rel) min: 1.15% max: 1.23% x̄: 1.20% x̃: 1.23% No changes on Iron Lake or GM45. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Ian Romanick	ea6c276436	nir: Mark the 0.0 < abs(a) transformation as imprecise Unlike the much older -abs(a) >= 0.0 transformation, this is not precise. The behavior changes if the source is NaN. No shader-db changes on any platform. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-08-04 01:12:03 -07:00
Marek Olšák	4bad50ded9	radeonsi: cosmetic changes	2018-08-04 03:10:30 -04:00
Marek Olšák	6508b93d78	st/mesa: expose & set limits for AMD_framebuffer_multisample_advanced Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:47:58 -04:00
Marek Olšák	7f587b57f7	st/mesa: add renderbuffer support for AMD_framebuffer_multisample_advanced Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:46:55 -04:00
Marek Olšák	8e3d0019e1	st/mesa: pass storage_sample_count parameter into st_choose_format Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:46:55 -04:00
Marek Olšák	459f05c7ec	mesa: add functional FBO changes for AMD_framebuffer_multisample_advanced - relax FBO completeness rules - validate sample counts Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:46:55 -04:00
Marek Olšák	328c1c8d99	mesa: add gl_renderbuffer::NumStorageSamples Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:46:55 -04:00
Marek Olšák	a96e946d25	mesa: implement glGet for AMD_framebuffer_multisample_advanced Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:46:55 -04:00
Marek Olšák	3d6900d76e	glapi: define AMD_framebuffer_multisample_advanced and add its functions Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:46:55 -04:00
Marek Olšák	2d115056d3	mesa: add storageSamples parameter to renderbuffer functions It's just passed to other functions but otherwise unused. It will be used in following commits. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-04 02:46:55 -04:00
Marek Olšák	f7d42ee7d3	include: update GL & GLES headers (v2) v2: use correct files Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-04 02:43:05 -04:00
Marek Olšák	fd1121e839	amd: remove support for LLVM 5.0 Users are encouraged to switch to LLVM 6.0 released in March 2018. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-03 18:36:11 -04:00
Marek Olšák	461a864316	winsys/amdgpu: pass the BO list via the CS ioctl on DRM >= 3.27.0	2018-08-03 18:35:19 -04:00
Marek Olšák	0f79b2015b	gallium/u_vbuf: handle indirect multidraws correctly and efficiently (v3) v2: need to do MAX{start+count} instead of MAX{count} added piglit tests v3: use malloc Cc: 18.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-08-03 18:30:46 -04:00
Mauro Rossi	1c7a2433b2	android: radv: build vulkan.radv conditionally to radeonsi A problem was reported with arm,arm64 targets build due to missing libLLVM shared library dependency with AOSP; to avoid this issue vulkan.radv is built conditionally only when radeonsi is in BOARD_GPU_DRIVERS Fixes: `0ca153f869` ("android: radv: enable build of vulkan.radv HAL module") Reported-by: John Stultz <john.stultz@linaro.org> Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Cc: "18.2" <mesa-stable@lists.freedesktop.org>	2018-08-03 20:09:16 +02:00
Roland Scheidegger	c72f91deba	util: return 0 for NaNs in float_to_ubyte d3d10 requires NaNs to get converted to 0 for float->unorm conversions (and float->int etc.). GL spec probably doesn't care in general, but it would make sense to have reasonable behavior in any case imho - the old code was converting negative NaNs to 0, and positive NaNs to 255. (Note that using float comparison isn't actually all that much more effort in any case, at least with sse2 it's just float comparison (ucommiss) instead of int one - I converted the second comparison to float too simply because it saves the probably somewhat expensive transfer of the float from simd to int domain (with sse2 via stack), so the generated code actually has 2 less instructions, although float comparisons are more expensive than int ones.) Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-03 17:07:38 +02:00
Jason Ekstrand	1d900e55fd	anv/pipeline: Disable FS dispatch for pointless fragment shaders Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-03 05:52:23 -07:00
Timothy Arceri	d5175d21c7	nir: add fall through comment to nir_gather_info This stops Coverity reporting a defect and helps make the code less error-prone. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-03 09:30:57 +10:00
Dan Willemsen	12e3334f1e	CleanSpec.mk: Remove HOST_OUT_release This is a forward port of a patch from the AOSP/master tree: `bd633f11de`%5E%21/ Which replaces HOST_OUT_release with HOST_OUT As per Dan's explanation, the current code was incorrect to use $(HOST_OUT_release) as $(HOST_OUT) will be set properly for whether the current build that's being cleaned during incrementals is using host debug or release builds. Additionally Dan noted it was incredibly uncommon to use a debug host build, as there was never a shortcut and one had to set an environment variable manually. Thus it was rarely if ever tested. Change-Id: I7972c0a50fa3520dcfa962d6dd7e602bfe22368d Cc: Rob Herring <rob.herring@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Marissa Wall <marissaw@google.com> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Rob Clark <robdclark@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: John Stultz <john.stultz@linaro.org> Signed-off-by: Rob Herring <robh@kernel.org>	2018-08-02 15:42:40 -06:00
Sumit Semwal	d0b63b6583	Android.common.mk: define HAVE_TIMESPEC_GET This is a forward port of a patch from the AOSP/master tree: `bd30b663f5`%5E%21/ Since https://android-review.googlesource.com/c/718518 added timespec_get() to bionic, mesa3d doesn't build due to redefinition of timespec_get(). Avoid redefinition by defining HAVE_TIMESPEC_GET flag. Test: build and boot tested db820c to UI. Change-Id: I3dcc8034b48785e45cd3fa50e4d9cf2c684694a0 Cc: Rob Herring <rob.herring@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Marissa Wall <marissaw@google.com> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Rob Clark <robdclark@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Sumit Semwal <sumit.semwal@linaro.org> Signed-off-by: John Stultz <john.stultz@linaro.org> Signed-off-by: Rob Herring <robh@kernel.org>	2018-08-02 15:42:27 -06:00
Dan Willemsen	dc030d1ec9	util: Android.mk: Convert implicit rules to static pattern rules This is a partial cherry-pick from AOSP's mesa3d tree: `a88dcf769e`%5E%21/ "We're deprecating make implicit rules, preferring static pattern rules, or just regular rules." Without this patch, the freedesktop/master branch won't build in the AOSP environment, and this patch corrects that, as tested on the Dragonboard 820c. The i965 portion of the patch this is based on collided badly, and I'm not sure how to best forward port it. However, so far we don't see build issues without that portion. Comments or feedback would be appreciated! Change-Id: Id6dfd0d018cbd665fa19d80c14abd5f75fa10b8a Cc: Rob Herring <rob.herring@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Marissa Wall <marissaw@google.com> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Rob Clark <robdclark@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: John Stultz <john.stultz@linaro.org> Signed-off-by: Rob Herring <robh@kernel.org>	2018-08-02 15:42:23 -06:00
Darren Powell	726a48c94f	radeonsi: add new R600_DEBUG test "testclearbufperf" Signed-off-by: Darren Powell <darren.powell@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2018-08-02 16:09:22 -04:00
Brian Paul	977638006b	mesa: add switch case for GL 2.0 in _mesa_compute_version() Previously, I added a switch case for GL 2.1 (ed7a0770b881791dd697f3). I don't know of any driver which only supports GL 2.0, but adding this switch case avoids a failure if the app queries GL_SHADING_LANGUAGE_VERSION. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-08-02 13:20:00 -06:00
Andres Gomez	2d4d139877	intel/tools: add error2aub creation into autotools Tarball distribution is done through "make distcheck". We include the meson targets also into autotools so they won't fail when building from the tarball. Fixes: `6a60beba40` ("intel/tools: Add an error state to aub translator") Cc: Jason Ekstrand <jason.ekstrand@intel.com> Cc: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: Dylan Baker <dylan.c.baker@intel.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-02 21:15:57 +03:00
Jason Ekstrand	7ef6cd0ee8	anv/pipeline: Do cross-stage linking optimizations This appears to help the Aztec Ruins benchmark by about 2% on my Kaby Lake gt2 laptop. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	a5bffa061d	anv/pipeline: Pull most of the anv_pipeline_compile_* into common code This leaves us with a series of little anv_pipeline_compile_* functions which each take a compiler object, a mem_ctx, the stage to compile, and the previous stage for VUE linking purposes. Some of them do interesting things but most are little more than wrappers around brw_compile_*. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	5351339554	anv/pipeline: Add a separate "link" stage This breaks compilation up a bit into "link" and "compile". In the "link" stage, new anv_pipeline_link_* helpers are called which are responsible for setting up the binding table and doing anything needed to properly link with the next stage in the pipeline if one exists. They are called in reverse order starting with the fragment shader so you can assume linking in later stages is already done. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	5b196f39bd	anv/pipeline: Compile to NIR in compile_graphics This pulls the SPIR-V to NIR step out into common code. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	946fcd02a9	anv/pipeline: Recompile all shaders if any are missing from the cache Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	f76d6d8a63	anv/pipeline: Drop anv_pipeline_add_compiled_stage We can set active_stages much more directly and then it's just candy around setting pipeline->stages[stage]. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	703a24932a	anv/pipeline: Pull shader compilation out into a helper. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	f3c59ca947	anv/pipeline: Call anv_pipeline_compile_* in a loop Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	bdc3565c8c	anv/pipeline: Hash the entire pipeline in one go Instead of hashing each stage separately (and TES and TCS together), we hash the entire pipeline. This means we'll get fewer cache hits if they, for instance, re-use the same VS over and over again but it also means we can now safely do cross-stage optimizations. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	4a8236ae17	anv/pipeline: Populate keys up-front Instead of having each anv_pipeline_compile_* function populate the shader key, make it part of the anv_pipeline_stage struct and fill it out up-front. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jason Ekstrand	76503b319a	anv/pipline: Add a helper struct for per-stage info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-08-02 10:29:20 -07:00
Jon Turney	a48c0659e1	meson: use correct keyword to fix a meson warning With a sufficently recent meson, the following warning is produced: WARNING: Passed invalid keyword argument "extra_args". WARNING: This will become a hard error in the future. It seems that compiler.links(args:) is meant here. Signed-off-by: Jon Turney <jon.turney@dronecode.org.uk> Reviewed-and-Tested-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-08-02 18:12:49 +01:00
Andres Gomez	3013e22717	docs: add 18.3.0-devel release notes template Signed-off-by: Andres Gomez <agomez@igalia.com>	2018-08-02 18:15:33 +03:00
Andres Gomez	873767cf42	mesa: bump version to 18.3.0-devel Signed-off-by: Andres Gomez <agomez@igalia.com>	2018-08-02 18:00:15 +03:00
Eric Engestrom	44265cc65e	egl/main: fix indentation Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com>	2018-08-02 12:54:05 +01:00
Eric Engestrom	dd007d1c2a	loader: fix indentation Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com>	2018-08-02 12:53:58 +01:00
Vlad Golovkin	9d3a2394e4	swr: Remove unnecessary memset call Zeroing memory after calloc is not necessary. This also allows to avoid possible crash when allocation fails, because memset is called before checking screen for NULL. Fixes: `a29d63ecf7` "swr: refactor swr_create_screen to allow for proper cleanup on error" Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-08-02 11:13:40 +01:00

1844 changed files with 155020 additions and 52161 deletions

									
										1

.editorconfig
									
												View File
												
				@@ -11,6 +11,7 @@ tab_width = 8

				[*.{c,h,cpp,hpp,cc,hh}]

				indent_style = space

				indent_size = 3

				max_line_length = 78

				[{Makefile*,*.mk}]

				indent_style = tab

5

.mailmap

View File

@@ -145,6 +145,11 @@ Edward O'Callaghan <funfunctor@folklore1984.net> <eocallaghan@alterapraxis.com>
 Emeric Grange <emeric.grange@gmail.com> Emeric <emeric.grange@gmail.com>
 Emil Velikov <emil.l.velikov@gmail.com> <emil.velikov@collabora.com>
 Emil Velikov <emil.l.velikov@gmail.com> <emil.veliko@collabora.com>
 Emil Velikov <emil.l.velikov@gmail.com> <emil.velikov@collabora.co.uk>
 Emil Velikov <emil.l.velikov@gmail.com> <emil.veliikov@collabora.com>
 Emil Velikov <emil.l.velikov@gmail.com> <emil.velikov@gmail.com>
 Emil Velikov <emil.l.velikov@gmail.com> <emmil.velikov@collabora.com>
 Eric Anholt <eric@anholt.net> Eric Anholt <anholt@FreeBSD.org>

									
										451

.travis.yml
									
												View File
												
				@@ -1,7 +1,6 @@

				language: c

				sudo: false

				dist: trusty

				dist: xenial

				cache:

				  apt: true

				@@ -9,14 +8,14 @@ cache:

				env:

				  global:

				    - XORG_RELEASES=http://xorg.freedesktop.org/releases/individual

				    - XCB_RELEASES=http://xcb.freedesktop.org/dist

				    - WAYLAND_RELEASES=http://wayland.freedesktop.org/releases

				    - XORG_RELEASES=https://xorg.freedesktop.org/releases/individual

				    - XCB_RELEASES=https://xcb.freedesktop.org/dist

				    - WAYLAND_RELEASES=https://wayland.freedesktop.org/releases

				    - XORGMACROS_VERSION=util-macros-1.19.0

				    - GLPROTO_VERSION=glproto-1.4.17

				    - DRI2PROTO_VERSION=dri2proto-2.8

				    - LIBPCIACCESS_VERSION=libpciaccess-0.13.4

				    - LIBDRM_VERSION=libdrm-2.4.74

				    - LIBDRM_VERSION=libdrm-2.4.97

				    - XCBPROTO_VERSION=xcb-proto-1.13

				    - RANDRPROTO_VERSION=randrproto-1.3.0

				    - LIBXRANDR_VERSION=libXrandr-1.3.0

				@@ -35,38 +34,47 @@ matrix:

				    - env:

				        - LABEL="meson Vulkan"

				        - BUILD=meson

				        - MESON_OPTIONS="-Ddri-drivers=[] -Dgallium-drivers=[]"

				        - LLVM_VERSION=5.0

				        - UNWIND="false"

				        - DRI_LOADERS="-Dglx=disabled -Dgbm=false -Degl=false -Dplatforms=x11,wayland,drm -Dosmesa=none"

				        - GALLIUM_ST="-Ddri3=true -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=disabled"

				        - VULKAN_DRIVERS="intel,amd"

				        - LLVM_VERSION=7

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-5.0

				            - sourceline: 'deb http://apt.llvm.org/xenial/ llvm-toolchain-xenial-7 main'

				              key_url: https://apt.llvm.org/llvm-snapshot.gpg.key

				          packages:

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            # From sources above

				            - llvm-5.0-dev

				            - llvm-7-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="meson loaders/classic DRI"

				        - BUILD=meson

				        - MESON_OPTIONS="-Dvulkan-drivers=[] -Dgallium-drivers=[]"

				        - UNWIND="false"

				        - DRI_LOADERS="-Dglx=dri -Dgbm=true -Degl=true -Dplatforms=x11,wayland,drm,surfaceless -Dosmesa=classic"

				        - DRI_DRIVERS="i915,i965,r100,r200,swrast,nouveau"

				        - GALLIUM_ST="-Ddri3=true -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=disabled"

				      addons:

				        apt:

				          packages:

				            - xz-utils

				            - x11proto-xf86vidmode-dev

				            - libxxf86vm-dev

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libxdamage-dev

				            - libxfixes-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="make loaders/classic DRI"

				        - BUILD=make

				@@ -83,10 +91,200 @@ matrix:

				          packages:

				            - xz-utils

				            - x11proto-xf86vidmode-dev

				            - libxxf86vm-dev

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libxdamage-dev

				            - libxfixes-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        # NOTE: Building SWR is 2x (yes two) times slower than all the other

				        # gallium drivers combined.

				        # Start this early so that it doesn't hunder the run time.

				        - LABEL="meson Gallium Drivers SWR"

				        - BUILD=meson

				        - UNWIND="true"

				        - DRI_LOADERS="-Dglx=disabled -Degl=false -Dgbm=false"

				        - GALLIUM_ST="-Ddri3=false -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=disabled"

				        - GALLIUM_DRIVERS="swr"

				        - LLVM_VERSION=6.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          packages:

				            - llvm-6.0-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="meson Gallium Drivers RadeonSI"

				        - BUILD=meson

				        - UNWIND="true"

				        - DRI_LOADERS="-Dglx=disabled -Degl=false -Dgbm=false"

				        - GALLIUM_ST="-Ddri3=false -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=disabled"

				        - GALLIUM_DRIVERS="radeonsi"

				        - LLVM_VERSION=7

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          sources:

				            - sourceline: 'deb http://apt.llvm.org/xenial/ llvm-toolchain-xenial-7 main'

				              key_url: https://apt.llvm.org/llvm-snapshot.gpg.key

				          packages:

				            # From sources above

				            - llvm-7-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="meson Gallium Drivers Other"

				        - BUILD=meson

				        - UNWIND="true"

				        - DRI_LOADERS="-Dglx=disabled -Degl=false -Dgbm=false"

				        - GALLIUM_ST="-Ddri3=false -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=disabled"

				        - GALLIUM_DRIVERS="i915,nouveau,kmsro,r300,r600,freedreno,svga,swrast,v3d,vc4,virgl,etnaviv"

				        - LLVM_VERSION=5.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          packages:

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            - llvm-5.0-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="meson Gallium ST Clover LLVM-5.0"

				        - BUILD=meson

				        - UNWIND="true"

				        - DRI_LOADERS="-Dglx=disabled -Degl=false -Dgbm=false"

				        - GALLIUM_ST="-Ddri3=false -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=icd"

				        - GALLIUM_DRIVERS="r600"

				        - LLVM_VERSION=5.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          packages:

				            - libclc-dev

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            - llvm-5.0-dev

				            - clang-5.0

				            - libclang-5.0-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="meson Gallium ST Clover LLVM-6.0"

				        - BUILD=meson

				        - UNWIND="true"

				        - DRI_LOADERS="-Dglx=disabled -Degl=false -Dgbm=false"

				        - GALLIUM_ST="-Ddri3=false -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=icd"

				        - GALLIUM_DRIVERS="r600"

				        - LLVM_VERSION=6.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          packages:

				            - libclc-dev

				            - llvm-6.0-dev

				            - clang-6.0

				            - libclang-6.0-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="meson Gallium ST Clover LLVM-7"

				        - BUILD=meson

				        - UNWIND="true"

				        - DRI_LOADERS="-Dglx=disabled -Degl=false -Dgbm=false"

				        - GALLIUM_ST="-Ddri3=false -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=icd"

				        - GALLIUM_DRIVERS="r600,radeonsi"

				        - LLVM_VERSION=7

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          sources:

				            - sourceline: 'deb http://apt.llvm.org/xenial/ llvm-toolchain-xenial-7 main'

				              key_url: https://apt.llvm.org/llvm-snapshot.gpg.key

				          packages:

				            - libclc-dev

				            # From sources above

				            - llvm-7-dev

				            - clang-7

				            - libclang-7-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="meson Gallium ST Other"

				        - BUILD=meson

				        - UNWIND="true"

				        - DRI_LOADERS="-Dglx=disabled -Degl=false -Dgbm=false"

				        - GALLIUM_ST="-Ddri3=true -Dgallium-vdpau=true -Dgallium-xvmc=true -Dgallium-omx=bellagio -Dgallium-va=true -Dgallium-xa=true -Dgallium-nine=true -Dgallium-opencl=disabled -Dosmesa=gallium"

				        # We need swrast for osmesa and nine.

				        # Nouveau supports, or builds at least against all ST.

				        - GALLIUM_DRIVERS="nouveau,swrast"

				        - LLVM_VERSION=5.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          packages:

				            - llvm-5.0-dev

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            # Nine requires gcc 4.6... which is the one we have right ?

				            - libxvmc-dev

				            # Build locally, for now.

				            #- libvdpau-dev

				            #- libva-dev

				            - libomxil-bellagio-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3.5

				            - python3-pip

				            - python3-setuptools

				    - env:

				        # NOTE: Building SWR is 2x (yes two) times slower than all the other

				        # gallium drivers combined.

				@@ -95,10 +293,8 @@ matrix:

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=5.0

				        - LLVM_VERSION=6.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - OVERRIDE_CC="gcc-4.8"

				        - OVERRIDE_CXX="g++-4.8"

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				        - GALLIUM_ST="--enable-dri --disable-opencl --disable-xa --disable-nine --disable-xvmc --disable-vdpau --disable-va --disable-omx-bellagio --disable-gallium-osmesa"

				@@ -107,25 +303,22 @@ matrix:

				        - LIBUNWIND_FLAGS="--enable-libunwind"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-5.0

				          packages:

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            # From sources above

				            - llvm-5.0-dev

				            - llvm-6.0-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="make Gallium Drivers RadeonSI"

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=5.0

				        - LLVM_VERSION=7

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				@@ -136,18 +329,19 @@ matrix:

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-5.0

				            - sourceline: 'deb http://apt.llvm.org/xenial/ llvm-toolchain-xenial-7 main'

				              key_url: https://apt.llvm.org/llvm-snapshot.gpg.key

				          packages:

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            # From sources above

				            - llvm-5.0-dev

				            - llvm-7-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="make Gallium Drivers Other"

				        - BUILD=make

				@@ -155,23 +349,17 @@ matrix:

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=3.9

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        # New binutils linker is required for llvm-3.9

				        - OVERRIDE_PATH=/usr/lib/binutils-2.26/bin

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				        - GALLIUM_ST="--enable-dri --disable-opencl --disable-xa --disable-nine --disable-xvmc --disable-vdpau --disable-va --disable-omx-bellagio --disable-gallium-osmesa"

				        - GALLIUM_DRIVERS="i915,nouveau,pl111,r300,r600,freedreno,svga,swrast,v3d,vc4,virgl,etnaviv,imx"

				        - GALLIUM_DRIVERS="i915,nouveau,kmsro,r300,r600,freedreno,svga,swrast,v3d,vc4,virgl,etnaviv"

				        - VULKAN_DRIVERS=""

				        - LIBUNWIND_FLAGS="--enable-libunwind"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-3.9

				          packages:

				            - binutils-2.26

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            # From sources above

				            - llvm-3.9-dev

				            # Common

				            - xz-utils

				@@ -179,18 +367,15 @@ matrix:

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        # NOTE: Analogous to SWR above, building Clover is quite slow.

				        - LABEL="make Gallium ST Clover LLVM-3.9"

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=3.9

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - OVERRIDE_CC=gcc-4.7

				        - OVERRIDE_CXX=g++-4.7

				        # New binutils linker is required for llvm-3.9

				        - OVERRIDE_PATH=/usr/lib/binutils-2.26/bin

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				        - GALLIUM_ST="--disable-dri --enable-opencl --enable-opencl-icd --enable-llvm --disable-xa --disable-nine --disable-xvmc --disable-vdpau --disable-va --disable-omx-bellagio --disable-gallium-osmesa"

				@@ -199,15 +384,10 @@ matrix:

				        - LIBUNWIND_FLAGS="--enable-libunwind"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-3.9

				          packages:

				            - binutils-2.26

				            - libclc-dev

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            - g++-4.7

				            # From sources above

				            - llvm-3.9-dev

				            - clang-3.9

				            - libclang-3.9-dev

				@@ -217,16 +397,15 @@ matrix:

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        # NOTE: Analogous to SWR above, building Clover is quite slow.

				        - LABEL="make Gallium ST Clover LLVM-4.0"

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=4.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - OVERRIDE_CC=gcc-4.8

				        - OVERRIDE_CXX=g++-4.8

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				        - GALLIUM_ST="--disable-dri --enable-opencl --enable-opencl-icd --enable-llvm --disable-xa --disable-nine --disable-xvmc --disable-vdpau --disable-va --disable-omx-bellagio --disable-gallium-osmesa"

				@@ -235,14 +414,10 @@ matrix:

				        - LIBUNWIND_FLAGS="--enable-libunwind"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-4.0

				          packages:

				            - libclc-dev

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            - g++-4.8

				            # From sources above

				            - llvm-4.0-dev

				            - clang-4.0

				            - libclang-4.0-dev

				@@ -252,32 +427,27 @@ matrix:

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        # NOTE: Analogous to SWR above, building Clover is quite slow.

				        - LABEL="make Gallium ST Clover LLVM-5.0"

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=5.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - OVERRIDE_CC=gcc-4.8

				        - OVERRIDE_CXX=g++-4.8

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				        - GALLIUM_ST="--disable-dri --enable-opencl --enable-opencl-icd --enable-llvm --disable-xa --disable-nine --disable-xvmc --disable-vdpau --disable-va --disable-omx-bellagio --disable-gallium-osmesa"

				        - GALLIUM_DRIVERS="r600,radeonsi"

				        - GALLIUM_DRIVERS="r600"

				        - VULKAN_DRIVERS=""

				        - LIBUNWIND_FLAGS="--enable-libunwind"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-5.0

				          packages:

				            - libclc-dev

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            - g++-4.8

				            # From sources above

				            - llvm-5.0-dev

				            - clang-5.0

				            - libclang-5.0-dev

				@@ -287,8 +457,9 @@ matrix:

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        # NOTE: Analogous to SWR above, building Clover is quite slow.

				        - LABEL="make Gallium ST Clover LLVM-6.0"

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				@@ -298,21 +469,48 @@ matrix:

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				        - GALLIUM_ST="--disable-dri --enable-opencl --enable-opencl-icd --enable-llvm --disable-xa --disable-nine --disable-xvmc --disable-vdpau --disable-va --disable-omx-bellagio --disable-gallium-osmesa"

				        - GALLIUM_DRIVERS="r600"

				        - VULKAN_DRIVERS=""

				        - LIBUNWIND_FLAGS="--enable-libunwind"

				      addons:

				        apt:

				          packages:

				            - libclc-dev

				            - llvm-6.0-dev

				            - clang-6.0

				            - libclang-6.0-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="make Gallium ST Clover LLVM-7"

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=7

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				        - GALLIUM_ST="--disable-dri --enable-opencl --enable-opencl-icd --enable-llvm --disable-xa --disable-nine --disable-xvmc --disable-vdpau --disable-va --disable-omx-bellagio --disable-gallium-osmesa"

				        - GALLIUM_DRIVERS="r600,radeonsi"

				        - VULKAN_DRIVERS=""

				        - LIBUNWIND_FLAGS="--enable-libunwind"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-6.0

				            # llvm-6 depends on gcc-4.9 which is not in main repo

				            - ubuntu-toolchain-r-test

				            - sourceline: 'deb http://apt.llvm.org/xenial/ llvm-toolchain-xenial-7 main'

				              key_url: https://apt.llvm.org/llvm-snapshot.gpg.key

				          packages:

				            - libclc-dev

				            # From sources above

				            - llvm-6.0-dev

				            - clang-6.0

				            - libclang-6.0-dev

				            - llvm-7-dev

				            - clang-7

				            - libclang-7-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				@@ -324,7 +522,7 @@ matrix:

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="true"

				        - LLVM_VERSION=3.3

				        - LLVM_VERSION=3.5

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl"

				        - DRI_DRIVERS=""

				@@ -338,8 +536,8 @@ matrix:

				      addons:

				        apt:

				          packages:

				            # We actually want to test against llvm-3.3

				            - llvm-3.3-dev

				            # We actually want to test against llvm-3.3, yet 3.5 is available

				            - llvm-3.5-dev

				            # Nine requires gcc 4.6... which is the one we have right ?

				            - libxvmc-dev

				            # Build locally, for now.

				@@ -354,12 +552,14 @@ matrix:

				            - libx11-xcb-dev

				            - libelf-dev

				            - libunwind8-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="make Vulkan"

				        - BUILD=make

				        - MAKEFLAGS="-j4"

				        - MAKE_CHECK_COMMAND="make -C src/gtest check && make -C src/intel check"

				        - LLVM_VERSION=5.0

				        - LLVM_VERSION=7

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        - DRI_LOADERS="--disable-glx --disable-gbm --disable-egl --with-platforms=x11,wayland"

				        - DRI_DRIVERS=""

				@@ -370,17 +570,18 @@ matrix:

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-5.0

				            - sourceline: 'deb http://apt.llvm.org/xenial/ llvm-toolchain-xenial-7 main'

				              key_url: https://apt.llvm.org/llvm-snapshot.gpg.key

				          packages:

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            # From sources above

				            - llvm-5.0-dev

				            - llvm-7-dev

				            # Common

				            - xz-utils

				            - libexpat1-dev

				            - libx11-xcb-dev

				            - libelf-dev

				            - python3-pip

				            - python3-setuptools

				    - env:

				        - LABEL="scons"

				        - BUILD=scons

				@@ -405,14 +606,15 @@ matrix:

				        - SCONS_TARGET="llvm=1"

				        # Keep it symmetrical to the make build.

				        - SCONS_CHECK_COMMAND="scons llvm=1 check"

				        - LLVM_VERSION=3.3

				        - LLVM_VERSION=3.5

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				      addons:

				        apt:

				          packages:

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            - llvm-3.3-dev

				            # We actually want to test against llvm-3.3, yet 3.5 is available

				            - llvm-3.5-dev

				            # Common

				            - xz-utils

				            - x11proto-xf86vidmode-dev

				@@ -424,21 +626,14 @@ matrix:

				        - BUILD=scons

				        - SCONSFLAGS="-j4"

				        - SCONS_TARGET="swr=1"

				        - LLVM_VERSION=5.0

				        - LLVM_VERSION=6.0

				        - LLVM_CONFIG="llvm-config-${LLVM_VERSION}"

				        # Keep it symmetrical to the make build. There's no actual SWR, yet.

				        - SCONS_CHECK_COMMAND="true"

				        - OVERRIDE_CC="gcc-4.8"

				        - OVERRIDE_CXX="g++-4.8"

				      addons:

				        apt:

				          sources:

				            - llvm-toolchain-trusty-5.0

				          packages:

				            # LLVM packaging is broken and misses these dependencies

				            - libedit-dev

				            # From sources above

				            - llvm-5.0-dev

				            - llvm-6.0-dev

				            # Common

				            - xz-utils

				            - x11proto-xf86vidmode-dev

				@@ -455,7 +650,9 @@ matrix:

				    - env:

				        - LABEL="macOS meson"

				        - BUILD=meson

				        - MESON_OPTIONS="-Degl=false"

				        - UNWIND="false"

				        - DRI_LOADERS="-Dglx=dri -Dgbm=false -Degl=false -Dplatforms=x11 -Dosmesa=none"

				        - GALLIUM_ST="-Ddri3=true -Dgallium-vdpau=false -Dgallium-xvmc=false -Dgallium-omx=disabled -Dgallium-va=false -Dgallium-xa=false -Dgallium-nine=false -Dgallium-opencl=disabled"

				      os: osx

				before_install:

				@@ -482,30 +679,24 @@ before_install:

				    fi

				install:

				  - pip2 install --user mako

				  # Install a more modern meson from pip, since the version in the

				  # ubuntu repos is often quite old. Avoid >=0.45.0 as it needs python

				  # 3.5+

				  # ubuntu repos is often quite old.

				  - if test "x$BUILD" = xmeson; then

				      pip3 install --user "meson<0.45.0";

				      pip3 install --user meson;

				      pip3 install --user mako;

				    fi

				  # Install autotools build dependencies

				  - if test "x$BUILD" = xmake; then

				      pip2 install --user mako;

				    fi

				  # Install a more modern scons from pip.

				  - if test "x$BUILD" = xscons; then

				      pip2 install --user "scons>=2.4";

				      pip2 install --user mako;

				    fi

				  # Since libdrm gets updated in configure.ac regularly, try to pick up the

				  # latest version from there.

				  - for line in `grep "^LIBDRM.*_REQUIRED=" configure.ac`; do

				      old_ver=`echo $LIBDRM_VERSION | sed 's/libdrm-//'`;

				      new_ver=`echo $line | sed 's/.*REQUIRED=//'`;

				      if `echo "$old_ver,$new_ver" | tr ',' '\n' | sort -Vc 2> /dev/null`; then

				        export LIBDRM_VERSION="libdrm-$new_ver";

				      fi;

				    done

				  # Install dependencies where we require specific versions (or where

				  # disallowed by Travis CI's package whitelisting).

				@@ -535,7 +726,7 @@ install:

				      tar -jxvf $LIBPCIACCESS_VERSION.tar.bz2

				      (cd $LIBPCIACCESS_VERSION && ./configure --prefix=$HOME/prefix && make install)

				      wget http://dri.freedesktop.org/libdrm/$LIBDRM_VERSION.tar.bz2

				      wget https://dri.freedesktop.org/libdrm/$LIBDRM_VERSION.tar.bz2

				      tar -jxvf $LIBDRM_VERSION.tar.bz2

				      (cd $LIBDRM_VERSION && ./configure --prefix=$HOME/prefix --enable-vc4 --enable-freedreno --enable-etnaviv-experimental-api && make install)

				@@ -551,11 +742,11 @@ install:

				      tar -jxvf $LIBXSHMFENCE_VERSION.tar.bz2

				      (cd $LIBXSHMFENCE_VERSION && ./configure --prefix=$HOME/prefix && make install)

				      wget http://people.freedesktop.org/~aplattner/vdpau/$LIBVDPAU_VERSION.tar.bz2

				      wget https://people.freedesktop.org/~aplattner/vdpau/$LIBVDPAU_VERSION.tar.bz2

				      tar -jxvf $LIBVDPAU_VERSION.tar.bz2

				      (cd $LIBVDPAU_VERSION && ./configure --prefix=$HOME/prefix && make install)

				      wget http://www.freedesktop.org/software/vaapi/releases/libva/$LIBVA_VERSION.tar.bz2

				      wget https://www.freedesktop.org/software/vaapi/releases/libva/$LIBVA_VERSION.tar.bz2

				      tar -jxvf $LIBVA_VERSION.tar.bz2

				      (cd $LIBVA_VERSION && ./configure --prefix=$HOME/prefix --disable-wayland --disable-dummy-driver && make install)

				@@ -567,7 +758,7 @@ install:

				      tar -axvf $WAYLAND_PROTOCOLS_VERSION.tar.xz

				      (cd $WAYLAND_PROTOCOLS_VERSION && ./configure --prefix=$HOME/prefix && make install)

				      # Meson requires ninja >= 1.6, but trusty has 1.3.x

				      # Meson requires ninja >= 1.6, but xenial has 1.3.x

				      wget https://github.com/ninja-build/ninja/releases/download/v1.6.0/ninja-linux.zip

				      unzip ninja-linux.zip

				      mv ninja $HOME/prefix/bin/

				@@ -610,15 +801,13 @@ install:

				script:

				  - if test "x$BUILD" = xmake; then

				      test -n "$OVERRIDE_CC" && export CC="$OVERRIDE_CC";

				      test -n "$OVERRIDE_CXX" && export CXX="$OVERRIDE_CXX";

				      test -n "$OVERRIDE_PATH" && export PATH="$OVERRIDE_PATH:$PATH";

				      export CFLAGS="$CFLAGS -isystem`pwd`";

				      mkdir build &&

				      cd build &&

				      ../autogen.sh --enable-debug

				      ../autogen.sh

				        --enable-autotools

				        --enable-debug

				        $LIBUNWIND_FLAGS

				        $DRI_LOADERS

				        --with-dri-drivers=$DRI_DRIVERS

				@@ -631,33 +820,33 @@ script:

				    fi

				  - if test "x$BUILD" = xscons; then

				      test -n "$OVERRIDE_CC" && export CC="$OVERRIDE_CC";

				      test -n "$OVERRIDE_CXX" && export CXX="$OVERRIDE_CXX";

				      scons $SCONS_TARGET && eval $SCONS_CHECK_COMMAND;

				    fi

				  - |

				    if test "x$BUILD" = xmeson; then

				      if test -n "$LLVM_CONFIG"; then

				        # We need to control the version of llvm-config we're using, so we'll

				        # generate a native file to do so. This requires meson >=0.49

				        #

				        echo -e "[binaries]\nllvm-config = '`which $LLVM_CONFIG`'" > native.file

				      # Travis CI has moved to LLVM 5.0, and meson is detecting

				      # automatically the available version in /usr/local/bin based on

				      # the PATH env variable order preference.

				      #

				      # As for 0.44.x, Meson cannot receive the path to the

				      # llvm-config binary as a configuration parameter. See

				      # https://github.com/mesonbuild/meson/issues/2887 and

				      # https://github.com/dcbaker/meson/commit/7c8b6ee3fa42f43c9ac7dcacc61a77eca3f1bcef

				      #

				      # We want to use the custom (APT) installed version. Therefore,

				      # let's make Meson find our wanted version sooner than the one

				      # at /usr/local/bin

				      #

				      # Once this is corrected, we would still need a patch similar

				      # to:

				      # https://lists.freedesktop.org/archives/mesa-dev/2017-December/180217.html

				      test -f /usr/bin/$LLVM_CONFIG && ln -s /usr/bin/$LLVM_CONFIG $HOME/prefix/bin/llvm-config

				        $LLVM_CONFIG --version

				      else

				        : > native.file

				      fi

				      export CFLAGS="$CFLAGS -isystem`pwd`"

				      meson _build $MESON_OPTIONS

				      meson _build \

				                   --native-file=native.file \

				                   -Dbuild-tests=true \

				                   -Dlibunwind=${UNWIND} \

				                   ${DRI_LOADERS} \

				                   -Ddri-drivers=${DRI_DRIVERS:-[]} \

				                   ${GALLIUM_ST} \

				                   -Dgallium-drivers=${GALLIUM_DRIVERS:-[]} \

				                   -Dvulkan-drivers=${VULKAN_DRIVERS:-[]}

				      meson configure _build

				      ninja -C _build

				      ninja -C _build test

				    fi

									
										10

Android.common.mk
									
												View File
												
				@@ -37,7 +37,6 @@ LOCAL_CFLAGS += \

					-Wno-missing-field-initializers \

					-Wno-initializer-overrides \

					-Wno-mismatched-tags \

					-DVERSION=\"$(MESA_VERSION)\" \

					-DPACKAGE_VERSION=\"$(MESA_VERSION)\" \

					-DPACKAGE_BUGREPORT=\"https://bugs.freedesktop.org/enter_bug.cgi?product=Mesa\"

				@@ -76,6 +75,8 @@ LOCAL_CFLAGS += \

					-DMAJOR_IN_SYSMACROS \

					-DVK_USE_PLATFORM_ANDROID_KHR \

					-fvisibility=hidden \

					-fno-math-errno \

					-fno-trapping-math \

					-Wno-sign-compare

				LOCAL_CPPFLAGS += \

				@@ -89,6 +90,13 @@ LOCAL_CPPFLAGS += \

				LOCAL_CONLYFLAGS += \

					-std=c99

				# c11 timespec_get is part of bionic as well

				# https://android-review.googlesource.com/c/718518

				# This means releases from P and earlier won't need this

				ifeq ($(filter 5 6 7 8 9, $(MESA_ANDROID_MAJOR_VERSION)),)

				LOCAL_CFLAGS += -DHAVE_TIMESPEC_GET

				endif

				ifeq ($(strip $(MESA_ENABLE_ASM)),true)

				ifeq ($(TARGET_ARCH),x86)

				LOCAL_CFLAGS += \

									
										7

Android.mk
									
												View File
												
				@@ -24,7 +24,7 @@

				# BOARD_GPU_DRIVERS should be defined.  The valid values are

				#

				#   classic drivers: i915 i965

				#   gallium drivers: swrast freedreno i915g nouveau pl111 r300g r600g radeonsi vc4 virgl vmwgfx etnaviv imx

				#   gallium drivers: swrast freedreno i915g nouveau kmsro r300g r600g radeonsi vc4 virgl vmwgfx etnaviv

				#

				# The main target is libGLES_mesa.  For each classic driver enabled, a DRI

				# module will also be built.  DRI modules will be loaded by libGLES_mesa.

				@@ -52,15 +52,14 @@ gallium_drivers := \

					freedreno.HAVE_GALLIUM_FREEDRENO \

					i915g.HAVE_GALLIUM_I915 \

					nouveau.HAVE_GALLIUM_NOUVEAU \

					pl111.HAVE_GALLIUM_PL111 \

					kmsro.HAVE_GALLIUM_KMSRO \

					r300g.HAVE_GALLIUM_R300 \

					r600g.HAVE_GALLIUM_R600 \

					radeonsi.HAVE_GALLIUM_RADEONSI \

					vmwgfx.HAVE_GALLIUM_VMWGFX \

					vc4.HAVE_GALLIUM_VC4 \

					virgl.HAVE_GALLIUM_VIRGL \

					etnaviv.HAVE_GALLIUM_ETNAVIV \

					imx.HAVE_GALLIUM_IMX

					etnaviv.HAVE_GALLIUM_ETNAVIV

				ifeq ($(BOARD_GPU_DRIVERS),all)

				MESA_BUILD_CLASSIC := $(filter HAVE_%, $(subst ., , $(classic_drivers)))

									
										6

CleanSpec.mk
									
												View File
												
				@@ -10,7 +10,7 @@ $(call add-clean-step, rm -rf $(PRODUCT_OUT)/*/STATIC_LIBRARIES/libmesa_*_interm

				$(call add-clean-step, rm -rf $(PRODUCT_OUT)/*/SHARED_LIBRARIES/i9?5_dri_intermediates)

				$(call add-clean-step, rm -rf $(PRODUCT_OUT)/*/SHARED_LIBRARIES/libglapi_intermediates)

				$(call add-clean-step, rm -rf $(PRODUCT_OUT)/*/SHARED_LIBRARIES/libGLES_mesa_intermediates)

				$(call add-clean-step, rm -rf $(HOST_OUT_release)/*/EXECUTABLES/mesa_*_intermediates)

				$(call add-clean-step, rm -rf $(HOST_OUT_release)/*/EXECUTABLES/glsl_compiler_intermediates)

				$(call add-clean-step, rm -rf $(HOST_OUT_release)/*/STATIC_LIBRARIES/libmesa_*_intermediates)

				$(call add-clean-step, rm -rf $(HOST_OUT)/*/EXECUTABLES/mesa_*_intermediates)

				$(call add-clean-step, rm -rf $(HOST_OUT)/*/EXECUTABLES/glsl_compiler_intermediates)

				$(call add-clean-step, rm -rf $(HOST_OUT)/*/STATIC_LIBRARIES/libmesa_*_intermediates)

				$(call add-clean-step, rm -rf $(PRODUCT_OUT)/*/SHARED_LIBRARIES/*_dri_intermediates)

									
										3

Makefile.am
									
												View File
												
				@@ -22,6 +22,7 @@

				SUBDIRS = src

				AM_DISTCHECK_CONFIGURE_FLAGS = \

					--enable-autotools \

					--enable-dri \

					--enable-dri3 \

					--enable-egl \

				@@ -45,7 +46,7 @@ AM_DISTCHECK_CONFIGURE_FLAGS = \

					--enable-libunwind \

					--with-platforms=x11,wayland,drm,surfaceless \

					--with-dri-drivers=i915,i965,nouveau,radeon,r200,swrast \

					--with-gallium-drivers=i915,nouveau,r300,pl111,r600,radeonsi,freedreno,svga,swrast,vc4,tegra,virgl,swr,etnaviv,imx \

					--with-gallium-drivers=i915,nouveau,r300,kmsro,r600,radeonsi,freedreno,svga,swrast,vc4,tegra,virgl,swr,etnaviv \

					--with-vulkan-drivers=intel,radeon

				ACLOCAL_AMFLAGS = -I m4

7

REVIEWERS

View File

@@ -72,7 +72,9 @@ F: src/loader/
 EGL
 R: Eric Engestrom <eric@engestrom.ch>
 R: Emil Velikov <emil.l.velikov@gmail.com>
 F: src/egl/
 F: include/EGL/
 HAIKU
 R: Alexander von Gluck IV <kallisti5@unixzen.com>
@@ -136,3 +138,8 @@ F:	src/gallium/drivers/freedreno/
 GLX
 R: Adam Jackson <ajax@redhat.com>
 F: src/glx/
 VULKAN
 R: Eric Engestrom <eric@engestrom.ch>
 F: src/vulkan/
 F: include/vulkan/

									
										1

SConstruct
									
												View File
												
				@@ -31,6 +31,7 @@ import common

				# Minimal scons version

				EnsureSConsVersion(2, 4)

				EnsurePythonVersion(2, 7)

				#######################################################################

2

VERSION

View File

@@ -1 +1 @@
 .2.8
 .0.8

									
										30

appveyor.yml
									
												View File
												
				@@ -33,31 +33,41 @@ branches:

				# - https://www.appveyor.com/blog/2014/06/04/shallow-clone-for-git-repositories

				clone_depth: 100

				# https://www.appveyor.com/docs/build-cache/

				cache:

				- win_flex_bison-2.5.9.zip

				- llvm-5.0.1-msvc2015-mtd.7z

				- '%LOCALAPPDATA%\pip\Cache -> appveyor.yml'

				- win_flex_bison-2.5.15.zip

				- llvm-5.0.1-msvc2017-mtd.7z

				os: Visual Studio 2015

				os: Visual Studio 2017

				init:

				# Appveyor defaults core.autocrlf to input instead of the default (true), but

				# that can hide problems processing CRLF text on Windows

				- git config --global core.autocrlf true

				environment:

				  WINFLEXBISON_ARCHIVE: win_flex_bison-2.5.9.zip

				  LLVM_ARCHIVE: llvm-5.0.1-msvc2015-mtd.7z

				  WINFLEXBISON_VERSION: 2.5.15

				  LLVM_ARCHIVE: llvm-5.0.1-msvc2017-mtd.7z

				install:

				# Check git config

				- git config core.autocrlf

				# Check pip

				- python --version

				- python -m pip --version

				# Install Mako

				- python -m pip install Mako==1.0.6

				- python -m pip install Mako==1.0.7

				# Install pywin32 extensions, needed by SCons

				- python -m pip install pypiwin32

				# Install python wheels, necessary to install SCons via pip

				- python -m pip install wheel

				# Install SCons

				- python -m pip install scons==2.5.1

				- python -m pip install scons==3.0.1

				- scons --version

				# Install flex/bison

				- if not exist "%WINFLEXBISON_ARCHIVE%" appveyor DownloadFile "https://downloads.sourceforge.net/project/winflexbison/old_versions/%WINFLEXBISON_ARCHIVE%"

				- set WINFLEXBISON_ARCHIVE=win_flex_bison-%WINFLEXBISON_VERSION%.zip

				- if not exist "%WINFLEXBISON_ARCHIVE%" appveyor DownloadFile "https://github.com/lexxmark/winflexbison/releases/download/v%WINFLEXBISON_VERSION%/%WINFLEXBISON_ARCHIVE%"

				- 7z x -y -owinflexbison\ "%WINFLEXBISON_ARCHIVE%" > nul

				- set Path=%CD%\winflexbison;%Path%

				- win_flex --version

				@@ -69,10 +79,10 @@ install:

				- set LLVM=%CD%\llvm

				build_script:

				- scons -j%NUMBER_OF_PROCESSORS% MSVC_VERSION=14.0 llvm=1

				- scons -j%NUMBER_OF_PROCESSORS% MSVC_VERSION=14.1 llvm=1

				after_build:

				- scons -j%NUMBER_OF_PROCESSORS% MSVC_VERSION=14.0 llvm=1 check

				- scons -j%NUMBER_OF_PROCESSORS% MSVC_VERSION=14.1 llvm=1 check

				# It's possible to setup notification here, as described in

103

bin/.cherry-ignore

View File

@@ -1,56 +1,47 @@
 # fixes:  This commit has more than one Fixes tag but the commit it
 #         addresses didn't land in branch.
 ff1c479968819b93c46d24bd898e89ce14ac401 autotools: don't ship the git_sha1.h generated in git in the tarballs
 # pick:   This commit addresses a regression introduced by previous
 #         commit fa9e6c235da, which didn't make it for 18.2.
 a72dbc461bdb7714656e62cd8f4b00a404c2e6e0 mesa: allow GL_UNSIGNED_BYTE type for SNORM reads
 # fixes:  This commit has more than one Fixes tag but the commit it
 #         addresses didn't land in branch.
 c9f54486959716762e6818dabb0a73a8cd46df67 radeonsi: fix regression in indirect input swizzles.
 # extra:  Just some comments update.
 ad9917e187c1e9dbb053d3c98aa0e39fa374059 anv/blorp: Fix a comment as per Nanley's review feedback
 # fixes:  This commit was immediately reverted by commit 2dce1175c1c.
 aec44c0d9c4c0649c362199fac97efe0a3b38a4 i965/tools: 32bit compilation with meson
 # pick:   This commit was reverted by commit 95bb7d82ca8.
 abb56f6b1a0cd4946b13b6caf24fb46e500 radv: fix descriptor pool allocation size
 # pick:   There is a specific patch for stable branch for this commit.
 d495bec25bd7584de4e988c2b4528c1996bc1d0 radeonsi: NaN should pass kill_if
 # pick:   This commit reverts 0fa9e6d7b30 which did not land in branch.
 aa02d7e8781c25ee18b6da97606300808c84973a Revert "anv/skylake: disable ForceThreadDispatchEnable"
 # pick:   Explicit 18.3 only nominations.
 b1b2dd06a7b777e862b525302b15bcaf407d3648 radv: add missing TFB queries support to CmdCopyQueryPoolsResults()
 e0c7114eb3c19d4c2653f661698a6baa3bc9bedf st/mesa: disable L3 thread pinning
 b5f213bb1dcde22949dffe9d3a431fecd5d0f33b radv: binding streamout buffers doesn't change context regs
 9367514524f70faad99c721bac92339c8ff8bad9 radeonsi: fix video APIs on Raven2
 ea9f95e2a67eca90bb84eea24e7b4b804b3b1345 radeonsi: go back to using bottom-of-pipe for beginning of TIME_ELAPSED
 f401b0ce6e6650e1a85e9bb2be23d5ff08812b8 anv,radv: Disable VK_EXT_pci_bus_info
 c77f4c76ddfe0b692b430b012b65f6981a53336 meson: Add support for gnu hurd
 a90886921eb1d5d73b40aadd6fd3f340041bd26 meson: Add toggle for glx-direct
 # fixes:  This commit was reverted by commit 5f312e95f87.
 a9031bf9b55602d93cccef6c926e2179c23205b4 i965/batch: avoid reverting batch buffer if saved state is an empty
 # extra:  intel/aub_viewer is not present in branch
 ac324a6809c09c54d3b0bfdb00e5e62987ec4ad8 intel/aub_viewer: fix dynamic state printing
 db898cef2f5a455138e5845689c075aadba1c1f intel/aub_viewer: Print blend states properly
 # fixes: This commit requires commits 854202f70e6 and 84bc5738401 which did not
 #        land in branch.
 c120dbfe4d18240315ecec9b43a61aeb9ab239ac mesa/main: fix incorrect depth-error
 # fixes: This commit fixes commits b4476138d5ad and aa0fed10d357 which did not
 #        land in branch.
 d0c7b079d07f751eb37ecaa45a2a6db920d71d7a freedreno: Fix autotools build.
 # pick:  While this commit does not include the proper CC tag, it was intended
 #        to be applied only in 18.3 branch.
 d2d2e4c57015bc60edfcc656062c3a7472 mesa: Revert INTEL_fragment_shader_ordering support
 # fixes: The changes this commit provides are already included in the branch.
 ff6f1dd0d3c6b4c15ca51b478b2884d14f6a1e06 meson: libfreedreno depends upon libdrm (for fence support)
 # fixes: This commit requires commits aeaf8dbd097 and 7484bc894b9 which did not
 #        land in branch.
 f67dea5e19ef14187be0e8d0f61b1f764c7ccb4f radv: Fix multiview depth clears
 # fixes: There is a specific patch for stable franch for this commit.
 bde9f482de69528db5ccf5dd6bbfd8359adfbb19 ac: split 16-bit ssbo loads that may not be dword aligned
 # pick:  This commit is nominated to stable, but fixes commit b3c61469255 which
 #        is in 18.3 stable. Hence, this commit is considered as nominated to
 #        18.3 stable.
 f7b452a550c66cfb9a8c9518e35635eb25947 nir: properly find the entry to keep in copy_prop_vars
 # pick:  This commit is nominated to stable, but fixes commit 11dc1307794 which
 #        is not in the current stable branch.
 d6110d4d547ad98dce7a89d0e020ab5be5aaaad6 intel/compiler: move nir_lower_bool_to_int32 before nir_lower_locals_to_regs
 # Both of these were already merged with different shas
 da48cba61ef6fefb799bf96e6364b70dbf4ec712
 c812c740e60c14060eb89db66039111881a0f42f
 # The commit these fix was reverted from 19.0, but fixed for 19.1 due
 # to the number of fixes required to make that commit work
 d8f80af3a17354508f2ec9d6559c915d5be351d
 c0c69729b6d72a5297122856c8fe48510e90764
 e90c09965818b02e359474a6f7446b41d647
 b031c643491a92a5574c7a4bd659df33f2d89bb6
 # These were manually rebased by Jason, thanks!
 ab95b849e66f3221d80a67eef2ec6e3730901a8
 c30fffeec1732c21d600c036f95f8cdb1bb5487
 # This doesn't actually appliy to 19.0
 f58c6ba8099859ea25900214dbbd3814a92
 # This was superceeded by a manual backport from ken
 6981069fc805da1afc867ca3c905075d146d7ff9
 # This was manually backported
 bc1942c9ddce4e796322a7561f06af5dec0decd
 # This doesn't need to be applied, it already seems to exist in stable.
 dc78407d0d1e03ceddf8889b217e8fd113568d
 # This was backported manually
 f18c43d1df64135e8968a7d4fbfd2c9918b76ae
 # These were de-nominated since they don't apply nicley
 88105375c978f9de82af8c654051e5aa16d61614
 c9358621276ae49162e58d4a16fe37abda6a347f
 # These are only for 19.1
 c3538ab5702ceeead284c2b5f9e700f3082c8135
 d2aa65eb1892f7b300ac24560f9dbda6b600b5a7
 e35df52aa2f7d770f929a0866a0faa89c261a9
 f1b070bad34c46c4bcc6c679fa533bf6b4b79e5
 ad2b4aa37806779bdfc15d704940136c3db21eb4
 dc57eebd578b976b94c54d083377ba0920d43a8
 ac6756898a1bd30bde04555437a55c378726
 ffd2f948fee271cbbce93708fc508dab7cb5d14c
 # This was manually rebased and the script doesn't understand that for some
 # reason
 cb7c9b2a9352cc73a2d3becc0427c53c8baf153a

									
										8

bin/get-pick-list.sh
									
												View File
												
				@@ -13,12 +13,12 @@

				is_stable_nomination()

				{

					git show --summary "$1" | grep -q -i -o "CC:.*mesa-stable"

					git show --pretty=medium --summary "$1" | grep -q -i -o "CC:.*mesa-stable"

				}

				is_typod_nomination()

				{

					git show --summary "$1" | grep -q -i -o "CC:.*mesa-dev"

					git show --pretty=medium --summary "$1" | grep -q -i -o "CC:.*mesa-dev"

				}

				fixes=

				@@ -44,7 +44,7 @@ is_sha_nomination()

						# Treat only the current line

						id=`echo "$fixes" | tail -n $fixes_count | head -n 1 | cut -d : -f 2`

						fixes_count=$(($fixes_count-1))

						if ! git show $id &>/dev/null; then

						if ! git show $id >/dev/null 2>&1; then

							echo WARNING: Commit $1 lists invalid sha $id

						fi

					done

				@@ -143,7 +143,7 @@ do

					esac

					printf "[ %8s ] " "$tag"

					git --no-pager show --summary --oneline $sha

					git --no-pager show --no-patch --oneline $sha

				done

				rm -f already_picked

									
										29

bin/git_sha1_gen.py
									
										Executable file → Normal file
									
												View File
												
				@@ -1,5 +1,3 @@

				#!/usr/bin/env python

				"""

				Generate the contents of the git_sha1.h file.

				The output of this script goes to stdout.

				@@ -28,22 +26,25 @@ def get_git_sha1():

				        git_sha1 = ''

				    return git_sha1

				def write_if_different(contents):

				    """

				    Avoid touching the output file if it doesn't need modifications

				    Useful to avoid triggering rebuilds when nothing has changed.

				    """

				    if os.path.isfile(args.output):

				        with open(args.output, 'r') as file:

				            if file.read() == contents:

				                return

				    with open(args.output, 'w') as file:

				        file.write(contents)

				parser = argparse.ArgumentParser()

				parser.add_argument('--output', help='File to write the #define in',

				        required=True)

				                    required=True)

				args = parser.parse_args()

				git_sha1 = os.environ.get('MESA_GIT_SHA1_OVERRIDE', get_git_sha1())[:10]

				if git_sha1:

				    git_sha1_h_in_path = os.path.join(os.path.dirname(sys.argv[0]),

				            '..', 'src', 'git_sha1.h.in')

				    with open(git_sha1_h_in_path , 'r') as git_sha1_h_in:

				        new_sha1 = git_sha1_h_in.read().replace('@VCS_TAG@', git_sha1)

				        if os.path.isfile(args.output):

				            with open(args.output, 'r') as git_sha1_h:

				                if git_sha1_h.read() == new_sha1:

				                    quit()

				        with open(args.output, 'w') as git_sha1_h:

				            git_sha1_h.write(new_sha1)

				    write_if_different('#define MESA_GIT_SHA1 " (git-' + git_sha1 + ')"')

				else:

				    open(args.output, 'w').close()

				    write_if_different('#define MESA_GIT_SHA1 ""')

									
										15

bin/install_megadrivers.py
									
										Executable file → Normal file
									
												View File
												
				@@ -1,4 +1,3 @@

				#!/usr/bin/env python

				# encoding=utf-8

				# Copyright © 2017-2018 Intel Corporation

				@@ -36,7 +35,11 @@ def main():

				    args = parser.parse_args()

				    if os.path.isabs(args.libdir):

				        to = os.path.join(os.environ.get('DESTDIR', '/'), args.libdir[1:])

				        destdir = os.environ.get('DESTDIR')

				        if destdir:

				            to = os.path.join(destdir, args.libdir[1:])

				        else:

				            to = args.libdir

				    else:

				        to = os.path.join(os.environ['MESON_INSTALL_DESTDIR_PREFIX'], args.libdir)

				@@ -46,7 +49,6 @@ def main():

				        if os.path.lexists(to):

				            os.unlink(to)

				        os.makedirs(to)

				    shutil.copy(args.megadriver, master)

				    for driver in args.drivers:

				        abs_driver = os.path.join(to, driver)

				@@ -68,7 +70,14 @@ def main():

				                name, ext = os.path.splitext(name)

				        finally:

				            os.chdir(ret)

				    # Remove meson-created master .so and symlinks

				    os.unlink(master)

				    name, ext = os.path.splitext(master)

				    while ext != '.so':

				        if os.path.lexists(name):

				            os.unlink(name)

				        name, ext = os.path.splitext(name)

				if __name__ == '__main__':

									
										88

bin/meson-cmd-extract.py
									
										Executable file
									
												View File
												
				@@ -0,0 +1,88 @@

				#!/usr/bin/env python3

				# Copyright © 2019 Intel Corporation

				# Permission is hereby granted, free of charge, to any person obtaining a copy

				# of this software and associated documentation files (the "Software"), to deal

				# in the Software without restriction, including without limitation the rights

				# to use, copy, modify, merge, publish, distribute, sublicense, and/or sell

				# copies of the Software, and to permit persons to whom the Software is

				# furnished to do so, subject to the following conditions:

				# The above copyright notice and this permission notice shall be included in

				# all copies or substantial portions of the Software.

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE

				# AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,

				# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE

				# SOFTWARE.

				"""This script reads a meson build directory and gives back the command line it

				was configured with.

				This only works for meson 0.49.0 and newer.

				"""

				import argparse

				import ast

				import configparser

				import pathlib

				import sys

				def parse_args() -> argparse.Namespace:

				    """Parse arguments."""

				    parser = argparse.ArgumentParser()

				    parser.add_argument(

				        'build_dir',

				        help='Path the meson build directory')

				    args = parser.parse_args()

				    return args

				def load_config(path: pathlib.Path) -> configparser.ConfigParser:

				    """Load config file."""

				    conf = configparser.ConfigParser()

				    with path.open() as f:

				        conf.read_file(f)

				    return conf

				def build_cmd(conf: configparser.ConfigParser) -> str:

				    """Rebuild the command line."""

				    args = []

				    for k, v in conf['options'].items():

				        if ' ' in v:

				            args.append(f'-D{k}="{v}"')

				        else:

				            args.append(f'-D{k}={v}')

				    cf = conf['properties'].get('cross_file')

				    if cf:

				        args.append('--cross-file={}'.format(cf))

				    nf = conf['properties'].get('native_file')

				    if nf:

				        # this will be in the form "['str', 'str']", so use ast.literal_eval to

				        # convert it to a list of strings.

				        nf = ast.literal_eval(nf)

				        args.extend(['--native-file={}'.format(f) for f in nf])

				    return ' '.join(args)

				def main():

				    args = parse_args()

				    path = pathlib.Path(args.build_dir, 'meson-private', 'cmd_line.txt')

				    if not path.exists():

				        print('Cannot find the necessary file to rebuild command line. '

				              'Is your meson version >= 0.49.0?', file=sys.stderr)

				        sys.exit(1)

				    conf = load_config(path)

				    cmd = build_cmd(conf)

				    print(cmd)

				if __name__ == '__main__':

				    main()

									
										3

common.py
									
												View File
												
				@@ -99,8 +99,6 @@ def AddOptions(opts):

				                        'enable static code analysis where available', 'no'))

				    opts.Add(BoolOption('asan', 'enable Address Sanitizer', 'no'))

				    opts.Add('toolchain', 'compiler toolchain', default_toolchain)

				    opts.Add(BoolOption('gles', 'EXPERIMENTAL: enable OpenGL ES support',

				                        'no'))

				    opts.Add(BoolOption('llvm', 'use LLVM', default_llvm))

				    opts.Add(BoolOption('openmp', 'EXPERIMENTAL: compile with openmp (swrast)',

				                        'no'))

				@@ -110,3 +108,4 @@ def AddOptions(opts):

				    opts.Add(BoolOption('swr', 'Build OpenSWR', 'no'))

				    if host_platform == 'windows':

				        opts.Add('MSVC_VERSION', 'Microsoft Visual C/C++ version')

				        opts.Add('MSVC_USE_SCRIPT', 'Microsoft Visual C/C++ vcvarsall script', True)

98

configure.ac

View File

@@ -52,6 +52,19 @@ mingw*)
     ;;
 esac
 AC_ARG_ENABLE(autotools,
    [AS_HELP_STRING([--enable-autotools],
                    [Enable the use of this autotools based build configuration])],
    [enable_autotools=$enableval], [enable_autotools=no])
 if test "x$enable_autotools" != "xyes" ; then
     AC_MSG_ERROR([the autotools build system has been deprecated in favour of
     meson and will be removed eventually. For instructions on how to use meson
     see https://www.mesa3d.org/meson.html.
     If you still want to use the autotools build, then add --enable-autotools
     to the configure command line.])
 fi
 # Support silent build rules, requires at least automake-1.11. Disable
 # by either passing --disable-silent-rules to configure or passing V=1
 # to make
@@ -74,11 +87,10 @@ AC_SUBST([OPENCL_VERSION])
 # in the first entry.
 LIBDRM_REQUIRED=2.4.75
 LIBDRM_RADEON_REQUIRED=2.4.71
 LIBDRM_AMDGPU_REQUIRED=2.4.91
 LIBDRM_AMDGPU_REQUIRED=2.4.97
 LIBDRM_INTEL_REQUIRED=2.4.75
 LIBDRM_NVVIEUX_REQUIRED=2.4.66
 LIBDRM_NOUVEAU_REQUIRED=2.4.66
 LIBDRM_FREEDRENO_REQUIRED=2.4.92
 LIBDRM_ETNAVIV_REQUIRED=2.4.89
 LIBDRM_VC4_REQUIRED=2.4.89
@@ -108,9 +120,9 @@ dnl LLVM versions
 LLVM_REQUIRED_GALLIUM=3.3.0
 LLVM_REQUIRED_OPENCL=3.9.0
 LLVM_REQUIRED_R600=3.9.0
 LLVM_REQUIRED_RADEONSI=5.0.0
 LLVM_REQUIRED_RADV=5.0.0
 LLVM_REQUIRED_SWR=5.0.0
 LLVM_REQUIRED_RADEONSI=7.0.0
 LLVM_REQUIRED_RADV=7.0.0
 LLVM_REQUIRED_SWR=7.0.0
 dnl Check for progs
 AC_PROG_CPP
@@ -125,7 +137,8 @@ AC_PROG_GREP
 AC_PROG_NM
 AM_PROG_AS
 AX_CHECK_GNU_MAKE
 AC_CHECK_PROGS([PYTHON2], [python2.7 python2 python])
 AM_PATH_PYTHON([2.7],, [AM_PATH_PYTHON([3.4],, [:])])
 AC_PROG_SED
 AC_PROG_MKDIR_P
@@ -157,7 +170,7 @@ fi
 AX_CHECK_PYTHON_MAKO_MODULE($PYTHON_MAKO_REQUIRED)
 if test -z "$PYTHON2"; then
 if test "$PYTHON" = ":"; then
     if test ! -f "$srcdir/src/util/format_srgb.c"; then
         AC_MSG_ERROR([Python not found - unable to generate sources])
     fi
@@ -310,6 +323,7 @@ AX_CHECK_COMPILE_FLAG([-Werror=missing-prototypes],            [CFLAGS="$CFLAGS
 AX_CHECK_COMPILE_FLAG([-Wmissing-prototypes],                  [CFLAGS="$CFLAGS -Wmissing-prototypes"])
 dnl Dylan Baker: gcc and clang always accepr -Wno-*, hence check for the original warning, then set the no-* flag
 AX_CHECK_COMPILE_FLAG([-Wmissing-field-initializers],          [CFLAGS="$CFLAGS -Wno-missing-field-initializers"])
 AX_CHECK_COMPILE_FLAG([-Wformat-truncation],                   [CFLAGS="$CFLAGS -Wno-format-truncation"])
 AX_CHECK_COMPILE_FLAG([-fno-math-errno],                       [CFLAGS="$CFLAGS -fno-math-errno"])
 AX_CHECK_COMPILE_FLAG([-fno-trapping-math],                    [CFLAGS="$CFLAGS -fno-trapping-math"])
@@ -329,6 +343,7 @@ AX_CHECK_COMPILE_FLAG([-fno-math-errno],                       [CXXFLAGS="$CXXFL
 AX_CHECK_COMPILE_FLAG([-fno-trapping-math],                    [CXXFLAGS="$CXXFLAGS -fno-trapping-math"])
 AX_CHECK_COMPILE_FLAG([-fvisibility=hidden],                   [VISIBILITY_CXXFLAGS="-fvisibility=hidden"])
 AX_CHECK_COMPILE_FLAG([-Wmissing-field-initializers],          [CXXFLAGS="$CXXFLAGS -Wno-missing-field-initializers"])
 AX_CHECK_COMPILE_FLAG([-Wformat-truncation],                   [CXXFLAGS="$CXXFLAGS -Wno-format-truncation"])
 AC_LANG_POP([C++])
 # Flags to help ensure that certain portions of the code -- and only those
@@ -976,6 +991,22 @@ if test "x$pthread_stubs_possible" = xyes; then
     PKG_CHECK_MODULES(PTHREADSTUBS, pthread-stubs >= 0.4)
 fi
 save_LIBS="$LIBS"
 LIBS="$PTHREAD_LIBS"
 AC_MSG_CHECKING(whether pthread_setaffinity_np is supported)
 AC_LINK_IFELSE([AC_LANG_SOURCE([[
     #define _GNU_SOURCE
     #include <pthread.h>
     int main() {
        void *a = (void*) &pthread_setaffinity_np;
        long b = (long) a;
        return (int) b;
     }]])],
   [DEFINES="$DEFINES -DHAVE_PTHREAD_SETAFFINITY"];
    AC_MSG_RESULT([yes]),
    AC_MSG_RESULT([no]))
 LIBS="$save_LIBS"
 dnl Check for futex for fast inline simple_mtx_t.
 AC_CHECK_HEADER([linux/futex.h], [DEFINES="$DEFINES -DHAVE_LINUX_FUTEX_H"])
@@ -1377,7 +1408,7 @@ GALLIUM_DRIVERS_DEFAULT="r300,r600,svga,swrast"
 AC_ARG_WITH([gallium-drivers],
     [AS_HELP_STRING([--with-gallium-drivers@<:@=DIRS...@:>@],
         [comma delimited Gallium drivers list, e.g.
         "i915,nouveau,r300,r600,radeonsi,freedreno,pl111,svga,swrast,swr,tegra,v3d,vc4,virgl,etnaviv,imx"
         "i915,nouveau,r300,r600,radeonsi,freedreno,kmsro,svga,swrast,swr,tegra,v3d,vc4,virgl,etnaviv"
         @<:@default=r300,r600,svga,swrast@:>@])],
     [with_gallium_drivers="$withval"],
     [with_gallium_drivers="$GALLIUM_DRIVERS_DEFAULT"])
@@ -1826,6 +1857,10 @@ for plat in $platforms; do
         PKG_CHECK_MODULES([WAYLAND_SCANNER], [wayland-scanner],
                           WAYLAND_SCANNER=`$PKG_CONFIG --variable=wayland_scanner wayland-scanner`,
                           WAYLAND_SCANNER='')
         PKG_CHECK_EXISTS([wayland-scanner >= 1.15],
                           AC_SUBST(SCANNER_ARG, 'private-code'),
                           AC_SUBST(SCANNER_ARG, 'code'))
         if test "x$WAYLAND_SCANNER" = x; then
             AC_PATH_PROG([WAYLAND_SCANNER], [wayland-scanner], [:])
         fi
@@ -1842,6 +1877,7 @@ for plat in $platforms; do
         ;;
     drm)
         test "x$enable_egl" = "xyes" &&
         test "x$enable_gbm" = "xno" &&
                 AC_MSG_ERROR([EGL platform drm needs gbm])
         DEFINES="$DEFINES -DHAVE_DRM_PLATFORM"
@@ -1886,7 +1922,7 @@ if test x"$enable_dri3" = xyes; then
     dri3_modifier_modules="xcb-dri3 >= $XCBDRI3_MODIFIERS_REQUIRED xcb-present >= $XCBPRESENT_MODIFIERS_REQUIRED"
     PKG_CHECK_MODULES([XCB_DRI3_MODIFIERS], [$dri3_modifier_modules], [have_dri3_modifiers=yes], [have_dri3_modifiers=no])
     if test "x$have_dri3_modifiers" == xyes; then
     if test "x$have_dri3_modifiers" = xyes; then
         DEFINES="$DEFINES -DHAVE_DRI3_MODIFIERS"
     fi
 fi
@@ -2321,7 +2357,7 @@ if test "x$enable_xvmc" = xyes -o \
         "x$enable_omx_tizonia" = xyes -o \
         "x$enable_va" = xyes; then
     if echo $platforms | grep -q "x11"; then
         PKG_CHECK_MODULES([VL], [x11-xcb xcb xcb-dri2 >= $XCBDRI2_REQUIRED])
         PKG_CHECK_MODULES([VL], [x11-xcb xcb xcb-dri2 >= $XCBDRI2_REQUIRED libdrm >= $LIBDRM_REQUIRED])
     fi
     need_gallium_vl_winsys=yes
 fi
@@ -2698,7 +2734,6 @@ if test -n "$with_gallium_drivers"; then
             ;;
         xfreedreno)
             HAVE_GALLIUM_FREEDRENO=yes
             PKG_CHECK_MODULES([FREEDRENO], [libdrm >= $LIBDRM_FREEDRENO_REQUIRED libdrm_freedreno >= $LIBDRM_FREEDRENO_REQUIRED])
             require_libdrm "freedreno"
             ;;
         xetnaviv)
@@ -2706,9 +2741,6 @@ if test -n "$with_gallium_drivers"; then
             PKG_CHECK_MODULES([ETNAVIV], [libdrm >= $LIBDRM_ETNAVIV_REQUIRED libdrm_etnaviv >= $LIBDRM_ETNAVIV_REQUIRED])
             require_libdrm "etnaviv"
             ;;
        ximx)
             HAVE_GALLIUM_IMX=yes
             ;;
         xtegra)
             HAVE_GALLIUM_TEGRA=yes
             require_libdrm "tegra"
@@ -2795,8 +2827,8 @@ if test -n "$with_gallium_drivers"; then
                                DEFINES="$DEFINES -DUSE_V3D_SIMULATOR"],
                               [USE_V3D_SIMULATOR=no])
             ;;
         xpl111)
             HAVE_GALLIUM_PL111=yes
         xkmsro)
             HAVE_GALLIUM_KMSRO=yes
             ;;
         xvirgl)
             HAVE_GALLIUM_VIRGL=yes
@@ -2813,9 +2845,8 @@ if test -n "$with_gallium_drivers"; then
 fi
 # XXX: Keep in sync with LLVM_REQUIRED_SWR
 AM_CONDITIONAL(SWR_INVALID_LLVM_VERSION, test "x$LLVM_VERSION" != x5.0.0 -a \
                                               "x$LLVM_VERSION" != x5.0.1 -a \
                                               "x$LLVM_VERSION" != x5.0.2)
 AM_CONDITIONAL(SWR_INVALID_LLVM_VERSION, test "x$LLVM_VERSION" != x7.0.0 -a \
                                               "x$LLVM_VERSION" != x7.0.1)
 if test "x$enable_llvm" = "xyes" -a "$with_gallium_drivers"; then
     llvm_require_version $LLVM_REQUIRED_GALLIUM "gallium"
@@ -2830,12 +2861,8 @@ AM_CONDITIONAL(HAVE_SWR_BUILTIN, test "x$HAVE_SWR_BUILTIN" = xyes)
 dnl We need to validate some needed dependencies for renderonly drivers.
 if test "x$HAVE_GALLIUM_ETNAVIV" != xyes -a "x$HAVE_GALLIUM_IMX" = xyes  ; then
     AC_MSG_ERROR([Building with imx requires etnaviv])
 fi
 if test "x$HAVE_GALLIUM_VC4" != xyes -a "x$HAVE_GALLIUM_PL111" = xyes  ; then
     AC_MSG_ERROR([Building with pl111 requires vc4])
 if test "x$HAVE_GALLIUM_VC4" != xyes -a "x$HAVE_GALLIUM_KMSRO" = xyes  ; then
     AC_MSG_ERROR([Building with kmsro requires vc4])
 fi
 if test "x$HAVE_GALLIUM_NOUVEAU" != xyes -a "x$HAVE_GALLIUM_TEGRA" = xyes; then
@@ -2883,6 +2910,7 @@ if test "x$enable_llvm" = xyes; then
     LLVM_LDFLAGS=`$LLVM_CONFIG --ldflags`
     LLVM_CFLAGS=$LLVM_CPPFLAGS   # CPPFLAGS seem to be sufficient
     LLVM_CXXFLAGS=`strip_unwanted_llvm_flags "$LLVM_CONFIG --cxxflags"`
     LLVM_CXXFLAGS="$CXX11_CXXFLAGS $LLVM_CXXFLAGS"
     dnl Set LLVM_LIBS - This is done after the driver configuration so
     dnl that drivers can add additional components to LLVM_COMPONENTS.
@@ -2917,11 +2945,11 @@ if test "x$enable_llvm" = xyes; then
     fi
     dnl The gallium-xlib GLX and gallium OSMesa targets directly embed the
     dnl swr/llvmpipe driver into the final binary.  Adding LLVM_LIBS results in
     dnl swr/llvmpipe driver into the final binary.  Adding LLVM_LIBS results in
     dnl the LLVM library propagated in the Libs.private of the respective .pc
     dnl file which ensures complete dependency information when statically
     dnl linking.
     if test "x$enable_glx" == xgallium-xlib; then
     if test "x$enable_glx" = xgallium-xlib; then
         GL_PC_LIB_PRIV="$GL_PC_LIB_PRIV $LLVM_LIBS"
     fi
     if test "x$enable_gallium_osmesa" = xyes; then
@@ -2931,14 +2959,13 @@ fi
 AM_CONDITIONAL(HAVE_GALLIUM_SVGA, test "x$HAVE_GALLIUM_SVGA" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_I915, test "x$HAVE_GALLIUM_I915" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_PL111, test "x$HAVE_GALLIUM_PL111" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_KMSRO, test "x$HAVE_GALLIUM_KMSRO" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_R300, test "x$HAVE_GALLIUM_R300" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_R600, test "x$HAVE_GALLIUM_R600" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_RADEONSI, test "x$HAVE_GALLIUM_RADEONSI" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_NOUVEAU, test "x$HAVE_GALLIUM_NOUVEAU" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_FREEDRENO, test "x$HAVE_GALLIUM_FREEDRENO" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_ETNAVIV, test "x$HAVE_GALLIUM_ETNAVIV" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_IMX, test "x$HAVE_GALLIUM_IMX" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_TEGRA, test "x$HAVE_GALLIUM_TEGRA" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_SOFTPIPE, test "x$HAVE_GALLIUM_SOFTPIPE" = xyes)
 AM_CONDITIONAL(HAVE_GALLIUM_LLVMPIPE, test "x$HAVE_GALLIUM_LLVMPIPE" = xyes)
@@ -2977,6 +3004,7 @@ AM_CONDITIONAL(HAVE_AMD_DRIVERS, test "x$HAVE_GALLIUM_RADEONSI" = xyes -o \
 AM_CONDITIONAL(HAVE_BROADCOM_DRIVERS, test "x$HAVE_GALLIUM_VC4" = xyes -o \
                                       "x$HAVE_GALLIUM_V3D" = xyes)
 AM_CONDITIONAL(HAVE_FREEDRENO_DRIVERS, test "x$HAVE_GALLIUM_FREEDRENO" = xyes)
 AM_CONDITIONAL(HAVE_INTEL_DRIVERS, test "x$HAVE_INTEL_VULKAN" = xyes -o \
                                         "x$HAVE_I965_DRI" = xyes)
@@ -3023,7 +3051,7 @@ AC_SUBST([XVMC_MAJOR], 1)
 AC_SUBST([XVMC_MINOR], 0)
 AC_SUBST([XA_MAJOR], 2)
 AC_SUBST([XA_MINOR], 4)
 AC_SUBST([XA_MINOR], 5)
 AC_SUBST([XA_PATCH], 0)
 AC_SUBST([XA_VERSION], "$XA_MAJOR.$XA_MINOR.$XA_PATCH")
@@ -3069,6 +3097,7 @@ AC_CONFIG_FILES([Makefile
                  src/amd/vulkan/Makefile
                  src/broadcom/Makefile
                  src/compiler/Makefile
                  src/freedreno/Makefile
                  src/egl/Makefile
                  src/egl/main/egl.pc
                  src/egl/wayland/wayland-drm/Makefile
@@ -3079,7 +3108,7 @@ AC_CONFIG_FILES([Makefile
                  src/gallium/drivers/i915/Makefile
                  src/gallium/drivers/llvmpipe/Makefile
                  src/gallium/drivers/nouveau/Makefile
                  src/gallium/drivers/pl111/Makefile
                  src/gallium/drivers/kmsro/Makefile
                  src/gallium/drivers/r300/Makefile
                  src/gallium/drivers/r600/Makefile
                  src/gallium/drivers/radeonsi/Makefile
@@ -3088,7 +3117,6 @@ AC_CONFIG_FILES([Makefile
                  src/gallium/drivers/swr/Makefile
                  src/gallium/drivers/tegra/Makefile
                  src/gallium/drivers/etnaviv/Makefile
                  src/gallium/drivers/imx/Makefile
                  src/gallium/drivers/v3d/Makefile
                  src/gallium/drivers/vc4/Makefile
                  src/gallium/drivers/virgl/Makefile
@@ -3123,11 +3151,10 @@ AC_CONFIG_FILES([Makefile
                  src/gallium/tests/trivial/Makefile
                  src/gallium/tests/unit/Makefile
                  src/gallium/winsys/etnaviv/drm/Makefile
                  src/gallium/winsys/imx/drm/Makefile
                  src/gallium/winsys/freedreno/drm/Makefile
                  src/gallium/winsys/i915/drm/Makefile
                  src/gallium/winsys/nouveau/drm/Makefile
                  src/gallium/winsys/pl111/drm/Makefile
                  src/gallium/winsys/kmsro/drm/Makefile
                  src/gallium/winsys/radeon/drm/Makefile
                  src/gallium/winsys/amdgpu/drm/Makefile
                  src/gallium/winsys/svga/drm/Makefile
@@ -3172,6 +3199,7 @@ AC_CONFIG_FILES([Makefile
                  src/mesa/main/tests/Makefile
                  src/mesa/state_tracker/tests/Makefile
                  src/util/Makefile
                  src/util/tests/fast_idiv_by_const/Makefile
                  src/util/tests/hash_table/Makefile
                  src/util/tests/set/Makefile
                  src/util/tests/string_buffer/Makefile
@@ -3343,7 +3371,7 @@ if test "x$enable_llvm" = xyes; then
     echo "        LLVM_LDFLAGS:    $LLVM_LDFLAGS"
     echo ""
 fi
 echo "        PYTHON2:         $PYTHON2"
 echo "        PYTHON:         $PYTHON"
 echo ""
 echo "        Run '${MAKE-make}' to build Mesa"

									
										13

docs/autoconf.html
									
												View File
												
				@@ -26,6 +26,12 @@

				  </ul>

				</ol>

				<h2>ATTENTION:</h2>

				<p>

				    The autotools build is being replaced by the <a href="meson.html">meson</a>

				    build system. If you haven't yet now is a good time to try using meson and

				    report any issues you run into.

				</p>

				<h2 id="basic">1. Basic Usage</h2>

				@@ -94,6 +100,13 @@ Currently there's only one config file provided when dri drivers are

				enabled - it's <code>drirc</code>.</p>

				</dd>

				<dt><code>--datadir=DIR</code></dt>

				<dd><p>This option specifies the directory where the data files will

				be installed. The default is <code>${prefix}/share</code>.

				Currently when dri drivers are enabled, <code>drirc.d/</code> is at

				this place.</p>

				</dd>

				<dt><code>--enable-static, --disable-shared</code></dt>

				<dd><p>By default, Mesa

				will build shared libraries. Either of these options will force static

									
										6

docs/download.html
									
												View File
												
				@@ -102,9 +102,9 @@ In the past, GLUT, GLU and the Mesa demos were released in conjunction with

				Mesa releases.  But since GLUT, GLU and the demos change infrequently, they

				were split off into their own git repositories:

				<a href="https://cgit.freedesktop.org/mesa/glut/">GLUT</a>,

				<a href="https://cgit.freedesktop.org/mesa/glu/">GLU</a> and

				<a href="https://cgit.freedesktop.org/mesa/demos/">Demos</a>,

				<a href="https://gitlab.freedesktop.org/mesa/glut">GLUT</a>,

				<a href="https://gitlab.freedesktop.org/mesa/glu">GLU</a> and

				<a href="https://gitlab.freedesktop.org/mesa/demos">Demos</a>,

				</p>

				</div>

									
										7

docs/envvars.html
									
												View File
												
				@@ -146,8 +146,8 @@ your system. For example under the default settings you may end up with a 1GB

				cache for x86_64 and another 1GB cache for i386.

				<li>MESA_GLSL_CACHE_DIR - if set, determines the directory to be used

				for the on-disk cache of compiled GLSL programs. If this variable is

				not set, then the cache will be stored in $XDG_CACHE_HOME/mesa (if

				that variable is set), or else within .cache/mesa within the user's

				not set, then the cache will be stored in $XDG_CACHE_HOME/mesa_shader_cache (if

				that variable is set), or else within .cache/mesa_shader_cache within the user's

				home directory.

				<li>MESA_GLSL - <a href="shading.html#envvars">shading language compiler options</a>

				<li>MESA_NO_MINMAX_CACHE - when set, the minmax index cache is globally disabled.

				@@ -338,6 +338,9 @@ See src/mesa/state_tracker/st_debug.c for other options.

				for details.

				<li>SVGA_EXTRA_LOGGING - if set, enables extra logging to the vmware.log file,

				such as the OpenGL program's name and command line arguments.

				<li>SVGA_NO_LOGGING - if set, disables logging to the vmware.log file.

				This is useful when using Valgrind because it otherwise crashes when

				initializing the host log feature.

				<li>See the driver code for other, lesser-used variables.

				</ul>

68

docs/features.txt

View File

@@ -196,7 +196,7 @@ GL 4.4, GLSL 4.40 -- all DONE: i965/gen8+, nvc0, r600, radeonsi
   GL_MAX_VERTEX_ATTRIB_STRIDE                           DONE (all drivers)
   GL_ARB_buffer_storage                                 DONE (freedreno, i965, nv50, llvmpipe, swr)
   GL_ARB_clear_texture                                  DONE (i965, nv50, llvmpipe, softpipe, swr)
   GL_ARB_enhanced_layouts                               DONE (i965, nv50, llvmpipe, softpipe)
   GL_ARB_enhanced_layouts                               DONE (i965, nv50, llvmpipe, softpipe, virgl)
   - compile-time constant expressions                   DONE
   - explicit byte offsets for blocks                    DONE
   - forced alignment within blocks                      DONE
@@ -219,7 +219,7 @@ GL 4.5, GLSL 4.50 -- all DONE: nvc0, radeonsi
   GL_ARB_direct_state_access                            DONE (all drivers)
   GL_ARB_get_texture_sub_image                          DONE (all drivers)
   GL_ARB_shader_texture_image_samples                   DONE (i965, nv50, r600, virgl)
   GL_ARB_texture_barrier                                DONE (freedreno, i965, nv50, r600)
   GL_ARB_texture_barrier                                DONE (freedreno, i965, nv50, r600, virgl)
   GL_KHR_context_flush_control                          DONE (all - but needs GLX/EGL extension to be useful)
   GL_KHR_robustness                                     DONE (i965)
   GL_EXT_shader_integer_mix                             DONE (all drivers that support GLSL)
@@ -308,7 +308,7 @@ Khronos, ARB, and OES extensions that are not part of any OpenGL or OpenGL ES ve
   GL_ARB_sample_locations                               DONE (nvc0)
   GL_ARB_seamless_cubemap_per_texture                   DONE (freedreno, i965, nvc0, radeonsi, r600, softpipe, swr, virgl)
   GL_ARB_shader_ballot                                  DONE (i965/gen8+, nvc0, radeonsi)
   GL_ARB_shader_clock                                   DONE (i965/gen7+, nv50, nvc0, r600, radeonsi)
   GL_ARB_shader_clock                                   DONE (i965/gen7+, nv50, nvc0, r600, radeonsi, virgl)
   GL_ARB_shader_stencil_export                          DONE (i965/gen9+, r600, radeonsi, softpipe, llvmpipe, swr, virgl)
   GL_ARB_shader_viewport_layer_array                    DONE (i965/gen6+, nvc0, radeonsi)
   GL_ARB_sparse_buffer                                  DONE (radeonsi/CIK+)
@@ -319,13 +319,14 @@ Khronos, ARB, and OES extensions that are not part of any OpenGL or OpenGL ES ve
   GL_EXT_memory_object                                  DONE (radeonsi)
   GL_EXT_memory_object_fd                               DONE (radeonsi)
   GL_EXT_memory_object_win32                            not started
   GL_EXT_render_snorm                                   DONE (i965, radeonsi)
   GL_EXT_semaphore                                      DONE (radeonsi)
   GL_EXT_semaphore_fd                                   DONE (radeonsi)
   GL_EXT_semaphore_win32                                not started
   GL_EXT_texture_norm16                                 DONE (i965, r600, radeonsi, nvc0)
   GL_KHR_blend_equation_advanced_coherent               DONE (i965/gen9+)
   GL_KHR_texture_compression_astc_hdr                   DONE (i965/bxt)
   GL_KHR_texture_compression_astc_sliced_3d             DONE (i965/gen9+)
   GL_KHR_texture_compression_astc_sliced_3d             DONE (i965/gen9+, radeonsi)
   GL_OES_depth_texture_cube_map                         DONE (all drivers that support GLSL 1.30+)
   GL_OES_EGL_image                                      DONE (all drivers)
   GL_OES_EGL_image_external                             DONE (all drivers)
@@ -337,12 +338,69 @@ Khronos, ARB, and OES extensions that are not part of any OpenGL or OpenGL ES ve
   GL_OES_texture_float_linear                           DONE (freedreno, i965, r300, r600, radeonsi, nv30, nv50, nvc0, softpipe, llvmpipe)
   GL_OES_texture_half_float                             DONE (freedreno, i965, r300, r600, radeonsi, nv30, nv50, nvc0, softpipe, llvmpipe)
   GL_OES_texture_half_float_linear                      DONE (freedreno, i965, r300, r600, radeonsi, nv30, nv50, nvc0, softpipe, llvmpipe)
   GL_OES_texture_view                                   DONE (i965/gen8+)
   GL_OES_texture_view                                   DONE (freedreno, i965/gen8+, r600, radeonsi, nv50, nvc0, softpipe, llvmpipe, swr)
   GL_OES_viewport_array                                 DONE (i965, nvc0, radeonsi)
   GLX_ARB_context_flush_control                         not started
   GLX_ARB_robustness_application_isolation              not started
   GLX_ARB_robustness_share_group_isolation              not started
 GL_EXT_direct_state_access subfeatures (in the spec order):
   GL 1.1: Client commands                               not started
   GL 1.0-1.3: Matrix and transpose matrix commands      not started
   GL 1.1-1.2: Texture commands                          not started
   GL 1.2: 3D texture commands                           not started
   GL 1.2.1: Multitexture commands                       not started
   GL 1.2.1-3.0: Indexed texture commands                not started
   GL 1.2.1-3.0: Indexed generic queries                 not started
   GL 1.2.1: EnableIndexed.. Get*Indexed                 not started
   GL_ARB_vertex_program                                 not started
   GL 1.3: Compressed texture and multitexture commands  not started
   GL 1.5: Buffer commands                               not started
   GL 2.0-2.1: Uniform and uniform matrix commands       not started
   GL_EXT_texture_buffer_object                          not started
   GL_EXT_texture_integer                                not started
   GL_EXT_gpu_shader4                                    not started
   GL_EXT_gpu_program_parameters                         not started
   GL_NV_gpu_program4                                    n/a
   GL_NV_framebuffer_multisample_coverage                n/a
   GL 3.0: Renderbuffer/framebuffer commands, Gen*Mipmap not started
   GL 3.0: CopyBuffer command                            not started
   GL_EXT_geometry_shader4 commands (expose in GL 3.2)   not started
   GL_NV_explicit_multisample                            n/a
   GL 3.0: Vertex array/attrib/query/map commands        not started
   Matrix GL tokens                                      not started
 GL_EXT_direct_state_access additions from other extensions (complete list):
   GL_AMD_framebuffer_sample_positions                   n/a
   GL_AMD_gpu_shader_int64                               not started
   GL_ARB_bindless_texture                               not started
   GL_ARB_buffer_storage                                 not started
   GL_ARB_clear_buffer_object                            not started
   GL_ARB_framebuffer_no_attachments                     not started
   GL_ARB_gpu_shader_fp64                                not started
   GL_ARB_instanced_arrays                               not started
   GL_ARB_internalformat_query2                          not started
   GL_ARB_sparse_texture                                 n/a
   GL_ARB_sparse_buffer                                  not started
   GL_ARB_texture_buffer_range                           not started
   GL_ARB_texture_storage                                not started
   GL_ARB_texture_storage_multisample                    not started
   GL_ARB_vertex_attrib_64bit                            not started
   GL_ARB_vertex_attrib_binding                          not started
   GL_EXT_buffer_storage                                 not started
   GL_EXT_external_buffer                                not started
   GL_EXT_separate_shader_objects                        n/a
   GL_EXT_sparse_texture                                 n/a
   GL_EXT_texture_storage                                n/a
   GL_EXT_vertex_attrib_64bit                            not started
   GL_EXT_EGL_image_storage                              n/a
   GL_NV_bindless_texture                                n/a
   GL_NV_gpu_shader5                                     n/a
   GL_NV_texture_multisample                             n/a
   GL_NV_vertex_buffer_unified_memory                    n/a
   GL_NVX_linked_gpu_multicast                           n/a
   GLX_NV_copy_buffer                                    n/a
 The following extensions are not part of any OpenGL or OpenGL ES version, and
 we DO NOT WANT implementations of these extensions for Mesa.

									
										2

docs/helpwanted.html
									
												View File
												
				@@ -47,7 +47,7 @@ You can find some further To-do lists here:

				<b>Common To-Do lists:</b>

				</p>

				<ul>

				  <li><a href="https://cgit.freedesktop.org/mesa/mesa/tree/docs/features.txt">

				  <li><a href="https://gitlab.freedesktop.org/mesa/mesa/blob/master/docs/features.txt">

				    <b>features.txt</b></a> - Status of OpenGL 3.x / 4.x features in Mesa.</li>

				</ul>

									
										102

docs/index.html
									
												View File
												
				@@ -15,6 +15,108 @@

				<div class="content">

				<h1>News</h1>

				<h2>January 17, 2019</h2>

				<p>

				<a href="relnotes/18.3.2.html">Mesa 18.3.2</a> is released.

				This is a bug-fix release.

				</p>

				<h2>December 27, 2018</h2>

				<p>

				<a href="relnotes/18.2.8.html">Mesa 18.2.8</a> is released.

				This is a bug-fix release.

				<br>

				NOTE: It is anticipated that 18.2.8 will be the final release in the

				18.2 series. Users of 18.2 are encouraged to migrate to the 18.3

				series in order to obtain future fixes.

				</p>

				<h2>December 13, 2018</h2>

				<p>

				<a href="relnotes/18.2.7.html">Mesa 18.2.7</a> is released.

				This is a bug-fix release.

				</p>

				<h2>December 11, 2018</h2>

				<p>

				<a href="relnotes/18.3.1.html">Mesa 18.3.1</a> is released.

				This is a bug-fix release.

				</p>

				<h2>December 7, 2018</h2>

				<p>

				<a href="relnotes/18.3.0.html">Mesa 18.3.0</a> is released.  This is a

				new development release.  See the release notes for more information

				about the release.

				</p>

				<h2>November 28, 2018</h2>

				<p>

				<a href="relnotes/18.2.6.html">Mesa 18.2.6</a> is released.

				This is a bug-fix release.

				</p>

				<h2>November 15, 2018</h2>

				<p>

				<a href="relnotes/18.2.5.html">Mesa 18.2.5</a> is released.

				This is a bug-fix release.

				</p>

				<h2>October 31, 2018</h2>

				<p>

				<a href="relnotes/18.2.4.html">Mesa 18.2.4</a> is released.

				This is a bug-fix release.

				</p>

				<h2>October 19, 2018</h2>

				<p>

				<a href="relnotes/18.2.3.html">Mesa 18.2.3</a> is released.

				This is a bug-fix release.

				</p>

				<h2>October 5, 2018</h2>

				<p>

				<a href="relnotes/18.2.2.html">Mesa 18.2.2</a> is released.

				This is a bug-fix release.

				</p>

				<h2>September 24, 2018</h2>

				<p>

				<a href="relnotes/18.1.9.html">Mesa 18.1.9</a> is released.

				This is a bug-fix release.

				<br>

				NOTE: It is anticipated that 18.1.9 will be the final release in the

				18.1 series. Users of 18.1 are encouraged to migrate to the 18.2

				series in order to obtain future fixes.

				</p>

				<h2>September 21, 2018</h2>

				<p>

				<a href="relnotes/18.2.1.html">Mesa 18.2.1</a> is released.

				This is a bug-fix release.

				</p>

				<h2>September 7, 2018</h2>

				<p>

				<a href="relnotes/18.1.8.html">Mesa 18.1.8</a> and

				<a href="relnotes/18.2.0.html">Mesa 18.2.0</a> are released.

				These are, respectively, a bug-fix release from the 18.1 branch and a

				new development release.  See the release notes for more information

				about the releases.

				</p>

				<h2>August 24, 2018</h2>

				<p>

				<a href="relnotes/18.1.7.html">Mesa 18.1.7</a> is released.

				This is a bug-fix release.

				</p>

				<h2>August 13, 2018</h2>

				<p>

				<a href="relnotes/18.1.6.html">Mesa 18.1.6</a> is released.

				This is a bug-fix release.

				</p>

				<h2>July 27, 2018</h2>

				<p>

									
										46

docs/install.html
									
												View File
												
				@@ -22,6 +22,7 @@

				  <li><a href="#prereq-general">General prerequisites</a>

				  <li><a href="#prereq-dri">For DRI and hardware acceleration</a>

				  </ul>

				<li><a href="#meson">Building with meson</a>

				<li><a href="#autoconf">Building with autoconf (Linux/Unix/X11)</a>

				<li><a href="#scons">Building with SCons (Windows/Linux)</a>

				<li><a href="#android">Building with AOSP (Android)</a>

				@@ -39,9 +40,10 @@ Build system.

				</p>

				<ul>

				<li>Autoconf is required when building on *nix platforms.

				<li><a href="https://mesonbuild.com">meson</a> is recommended when building on *nix platforms.

				<li>Autoconf is another option when building on *nix platforms.

				<li><a href="http://www.scons.org/">SCons</a> is required for building on

				Windows and optional for Linux (it's an alternative to autoconf/automake.)

				Windows and optional for Linux (it's an alternative to autoconf/automake or meson.)

				</li>

				<li>Android Build system when building as native Android component. Autoconf

				is used when when building ARC.

				@@ -57,7 +59,7 @@ willing to maintain support for other compiler get in touch.

				<ul>

				<li>GCC 4.2.0 or later (some parts of Mesa may require later versions)

				<li>clang - exact minimum requirement is currently unknown.

				<li>Microsoft Visual Studio 2013 Update 4 or later is required, for building on Windows.

				<li>Microsoft Visual Studio 2015 or later is required, for building on Windows.

				</ul>

				@@ -72,7 +74,9 @@ you think you've spotted a bug let developers know by filing a

				<ul>

				<li><a href="https://www.python.org/">Python</a> - Python is required.

				Version 2.6.4 or later should work.

				When building with scons 2.7 is required.

				When building with meson 3.5 or newer is required.

				When building with autotools 2.7, or 3.5 or later are required.

				</li>

				<li><a href="http://www.makotemplates.org/">Python Mako module</a> -

				Python Mako module is required. Version 0.8.0 or later should work.

				@@ -111,11 +115,31 @@ the packaging tool used by your distro.

				  ... # others

				</pre>

				<h1 id="autoconf">2. Building with autoconf (Linux/Unix/X11)</h1>

				<h1 id="meson">2. Building with meson</h1>

				<p>

				The primary method to build Mesa on Unix systems is with autoconf.

				Meson is the latest build system in mesa, it is currently able to build for

				*nix systems like Linux and BSD, and will be able to build for windows as well.

				</p>

				<p>

				The general approach is:

				</p>

				<pre>

				  meson builddir/

				  ninja -C builddir/

				  sudo ninja -C builddir/ install

				</pre>

				<p>

				Please read the <a href="meson.html">detailed meson instructions</a>

				for more information

				</p>

				<h1 id="autoconf">3. Building with autoconf (Linux/Unix/X11)</h1>

				<p>

				Although meson is recommended, another supported way to build on *nix systems

				is with autoconf.

				</p>

				<p>

				@@ -133,7 +157,7 @@ for more details.

				<h1 id="scons">3. Building with SCons (Windows/Linux)</h1>

				<h1 id="scons">4. Building with SCons (Windows/Linux)</h1>

				<p>

				To build Mesa with SCons on Linux or Windows do

				@@ -169,7 +193,7 @@ Additional information is available in <a href="README.WIN32">README.WIN32</a>.

				<h1 id="android">4. Building with AOSP (Android)</h1>

				<h1 id="android">5. Building with AOSP (Android)</h1>

				<p>

				Currently one can build Mesa for Android as part of the AOSP project, yet

				@@ -188,7 +212,7 @@ Android-x86 and/or other resources.

				</p>

				<h1 id="libs">5. Library Information</h1>

				<h1 id="libs">6. Library Information</h1>

				<p>

				When compilation has finished, look in the top-level <code>lib/</code>

				@@ -226,7 +250,7 @@ versions of libGL and device drivers.

				</p>

				<h1 id="pkg-config">6. Building OpenGL programs with pkg-config</h1>

				<h1 id="pkg-config">7. Building OpenGL programs with pkg-config</h1>

				<p>

				Running <code>make install</code> will install package configuration files

									
										3

docs/mesa.css
									
												View File
												
				@@ -29,6 +29,9 @@ pre {

					/*font-family: monospace;*/

					font-size: 10pt;

					/*color: black;*/

					background-color: #eee;

					margin-left: 2em;

					padding: .5em;

				}

				iframe {

									
										193

docs/meson.html
									
												View File
												
				@@ -16,15 +16,20 @@

				<h1>Compilation and Installation using Meson</h1>

				<ul>

				  <li><a href="#basic">Basic Usage</a></li>

				  <li><a href="#cross-compilation">Cross-compilation and 32-bit builds</a></li>

				</ul>

				<h2 id="basic">1. Basic Usage</h2>

				<p><strong>The Meson build system is generally considered stable and ready

				for production</strong></p>

				<p>The meson build is tested on on Linux, macOS, Cygwin and Haiku, it should

				work on FreeBSD, DragonflyBSD, NetBSD, and OpenBSD.</p>

				<p>The meson build is tested on Linux, macOS, Cygwin and Haiku, FreeBSD,

				DragonflyBSD, NetBSD, and should work on OpenBSD.</p>

				<p><strong>Mesa requires Meson >= 0.44.1 to build.</strong>

				<p><strong>Mesa requires Meson >= 0.45.0 to build.</strong>

				Some older versions of meson do not check that they are too old and will error

				out in odd ways.

				@@ -48,9 +53,13 @@ To see a description of your options you can run <code>meson configure</code>

				along with a build directory to view the selected options for. This will show

				your meson global arguments and project arguments, along with their defaults

				and your local settings.

				</p>

				<p>

				Meson does not currently support listing options before configure a build

				directory, but this feature is being discussed upstream.

				For now, the only way to see what options exist is to look at the

				<code>meson_options.txt</code> file at the root of the project.

				</p>

				<pre>

				@@ -95,43 +104,96 @@ each configuration you want to build

				recommended in the documentation</a>

				</p>

				<p>

				Autotools automatically updates translation files as part of the build process,

				meson does not do this. Instead if you want translated drirc files you will need 

				to invoke non-default targets for ninja to update them:

				<code>ninja -C build/ xmlpool-pot xmlpool-update-po xmlpool-gmo</code>

				</p>

				<dl>

				<dt><code>Environment Variables</code></dt>

				<dd><p>Meson supports the standard CC and CXX environment variables for

				changing the default compiler, and CFLAGS, CXXFLAGS, and LDFLAGS for setting

				options to the compiler and linker.

				changing the default compiler. Meson does support CFLAGS, CXXFLAGS, etc. But

				their use is discouraged because of the many caveats in using them. Instead it

				is recomended to use <code>-D${lang}_args</code> and

				<code>-D${lang}_link_args</code> instead. Among the benefits of these options

				is that they are guaranteed to persist across rebuilds and reconfigurations.

				The default compilers depends on your operating system. Meson supports most of

				the popular compilers, a complete list is available

				<a href="http://mesonbuild.com/Reference-tables.html#compiler-ids">here</a>.

				These arguments are consumed and stored by meson when it is initialized or

				re-initialized. Therefore passing them to meson configure will not do anything,

				and passing them to ninja will only do something if ninja decides to

				re-initialize meson, for example, if a meson.build file has been changed.

				Changing these variables will not cause all targets to be rebuilt, so running

				ninja clean is recommended when changing CFLAGS or CXXFLAGS. Meson will never

				change compiler in a configured build directory.

				Meson does not allow changing compiler in a configured builddir, you will need

				to create a new build dir for a different compiler.

				</p>

				<pre>

				    CC=clang CXX=clang++ meson build-clang

				    ninja -C build-clang

				    ninja -C build-clang clean

				    touch meson.build

				    CFLAGS=-Wno-typedef-redefinition ninja -C build-clang

				    meson configure build -Dc_args="-Wno-typedef-redefinition"

				    ninja -C build-clang

				</pre>

				<p>

				The default compilers depends on your operating system. Meson supports most of

				the popular compilers, a complete list is available

				<a href="http://mesonbuild.com/Reference-tables.html#compiler-ids">here</a>.

				</p>

				<p>Meson also honors <code>DESTDIR</code> for installs</p>

				</dd>

				<dt><code>LLVM</code></dt>

				<dd><p>Meson includes upstream logic to wrap llvm-config using it's standard

				dependency interface. It will search <code>$PATH</code> (or <code>%PATH%</code> on windows) for

				llvm-config, so using an LLVM from a non-standard path is as easy as

				<code>PATH=/path/with/llvm-config:$PATH meson build</code>.

				<dd><p>Meson includes upstream logic to wrap llvm-config using its standard

				dependency interface.

				</p></dd>

				<dd><p>

				As of meson 0.49.0 meson also has the concept of a

				<a href="https://mesonbuild.com/Native-environments.html">"native file"</a>,

				these files provide information about the native build environment (as opposed

				to a cross build environment). They are ini formatted and can override where to

				find llvm-config:

				custom-llvm.ini

				<pre>

				    [binaries]

				    llvm-config = '/usr/local/bin/llvm/llvm-config'

				</pre>

				Then configure meson:

				<pre>

				    meson builddir/ --native-file custom-llvm.ini

				</pre>

				</p></dd>

				<dd><p>

				For selecting llvm-config for cross compiling a

				<a href="https://mesonbuild.com/Cross-compilation.html#defining-the-environment">"cross file"</a>

				should be used. It uses the same format as the native file above:

				cross-llvm.ini

				<pre>

				    [binaries]

				    ...

				    llvm-config = '/usr/lib/llvm-config-32'

				</pre>

				Then configure meson:

				<pre>

				    meson builddir/ --cross-file cross-llvm.ini

				</pre>

				See the <a href="#cross-compilation">Cross Compilation</a> section for more information.

				</dd></p>

				<dd><p>

				For older versions of meson <code>$PATH</code> (or <code>%PATH%</code> on

				windows) will be searched for llvm-config (and llvm-config$version and

				llvm-config-$version), you can override this environment variable to control

				the search: <code>PATH=/path/with/llvm-config:$PATH meson build</code>.

				</dd></p>

				</dl>

				<dl>

				@@ -182,6 +244,93 @@ is unrelated to the <code>buildtype</code>; setting the latter to

				</dd>

				</dl>

				<h2 id="cross-compilation">2. Cross-compilation and 32-bit builds</h2>

				<p><a href="https://mesonbuild.com/Cross-compilation.html">Meson supports

				cross-compilation</a> by specifying a number of binary paths and

				settings in a file and passing this file to <code>meson</code> or

				<code>meson configure</code> with the <code>--cross-file</code>

				parameter.</p>

				<p>This file can live at any location, but you can use the bare filename

				(without the folder path) if you put it in $XDG_DATA_HOME/meson/cross or

				~/.local/share/meson/cross</p>

				<p>Below are a few example of cross files, but keep in mind that you

				will likely have to alter them for your system.</p>

				<p>

				Those running on ArchLinux can use the AUR-maintained packages for some

				of those, as they'll have the right values for your system:

				<ul>

				  <li><a href="https://aur.archlinux.org/packages/meson-cross-x86-linux-gnu">meson-cross-x86-linux-gnu</a></li>

				  <li><a href="https://aur.archlinux.org/packages/meson-cross-aarch64-linux-gnu">meson-cross-aarch64-linux-gnu</a></li>

				</ul>

				</p>

				<p>

				32-bit build on x86 linux:

				<pre>

				[binaries]

				c = '/usr/bin/gcc'

				cpp = '/usr/bin/g++'

				ar = '/usr/bin/gcc-ar'

				strip = '/usr/bin/strip'

				pkgconfig = '/usr/bin/pkg-config-32'

				llvm-config = '/usr/bin/llvm-config32'

				[properties]

				c_args = ['-m32']

				c_link_args = ['-m32']

				cpp_args = ['-m32']

				cpp_link_args = ['-m32']

				[host_machine]

				system = 'linux'

				cpu_family = 'x86'

				cpu = 'i686'

				endian = 'little'

				</pre>

				</p>

				<p>

				64-bit build on ARM linux:

				<pre>

				[binaries]

				c = '/usr/bin/aarch64-linux-gnu-gcc'

				cpp = '/usr/bin/aarch64-linux-gnu-g++'

				ar = '/usr/bin/aarch64-linux-gnu-gcc-ar'

				strip = '/usr/bin/aarch64-linux-gnu-strip'

				pkgconfig = '/usr/bin/aarch64-linux-gnu-pkg-config'

				exe_wrapper = '/usr/bin/qemu-aarch64-static'

				[host_machine]

				system = 'linux'

				cpu_family = 'aarch64'

				cpu = 'aarch64'

				endian = 'little'

				</pre>

				</p>

				<p>

				64-bit build on x86 windows:

				<pre>

				[binaries]

				c = '/usr/bin/x86_64-w64-mingw32-gcc'

				cpp = '/usr/bin/x86_64-w64-mingw32-g++'

				ar = '/usr/bin/x86_64-w64-mingw32-ar'

				strip = '/usr/bin/x86_64-w64-mingw32-strip'

				pkgconfig = '/usr/bin/x86_64-w64-mingw32-pkg-config'

				exe_wrapper = 'wine'

				[host_machine]

				system = 'windows'

				cpu_family = 'x86_64'

				cpu = 'i686'

				endian = 'little'

				</pre>

				</p>

				</div>

				</body>

				</html>

									
										137

docs/release-calendar.html
									
												View File
												
				@@ -23,6 +23,16 @@ Mesa provides feature/development and stable releases.

				The table below lists the date and release manager that is expected to do the

				specific release.

				<br>

				Regular updates will ensure that the schedule for the current and the

				next two feature releases are shown in the table.

				<br>

				In order to keep the whole releasing team up to date with the tools

				used, best practices and other details, the member in charge of the

				next feature release will be in constant rotation.

				<br>

				The way the release schedule works is

				explained <a href="releasing.html#schedule" target="_parent">here</a>.

				<br>

				Take a look <a href="submittingpatches.html#criteria" target="_parent">here</a>

				if you'd like to nominate a patch in the next stable release.

				</p>

				@@ -39,47 +49,128 @@ if you'd like to nominate a patch in the next stable release.

				<th>Notes</th>

				</tr>

				<tr>

				<td rowspan="3">18.1</td>

				<td>2018-08-10</td>

				<td>18.1.6</td>

				<td rowspan="4">18.3</td>

				<td>2019-01-30</td>

				<td>18.3.3</td>

				<td>Emil Velikov</td>

				<td>

				</tr>

				<tr>

				<td>2019-02-13</td>

				<td>18.3.4</td>

				<td>Emil Velikov</td>

				<td>

				</tr>

				<tr>

				<td>2019-02-27</td>

				<td>18.3.5</td>

				<td>Emil Velikov</td>

				<td>

				</tr>

				<tr>

				<td>2019-03-13</td>

				<td>18.3.6</td>

				<td>Emil Velikov</td>

				<td>Last planned 18.3.x release</td>

				</tr>

				<tr>

				<td rowspan="4">19.0</td>

				<td>2019-01-29</td>

				<td>19.0.0-rc1</td>

				<td>Dylan Baker</td>

				<td></td>

				<td>

				</tr>

				<tr>

				<td>2018-08-24</td>

				<td>18.1.7</td>

				<td>2019-02-05</td>

				<td>19.0.0-rc2</td>

				<td>Dylan Baker</td>

				<td></td>

				<td>

				</tr>

				<tr>

				<td>2018-09-07</td>

				<td>18.1.8</td>

				<td>2019-02-12</td>

				<td>19.0.0-rc3</td>

				<td>Dylan Baker</td>

				<td>Last planned 18.1.x release</td>

				<td>

				</tr>

				<tr>

				<td rowspan="4">18.2</td>

				<td>2018-08-01</td>

				<td>18.2.0rc1</td>

				<td>Andres Gomez</td>

				<td></td>

				<td>2019-02-19</td>

				<td>19.0.0-rc4</td>

				<td>Dylan Baker</td>

				<td>Last planned RC/Final release</td>

				</tr>

				<tr>

				<td>2018-08-08</td>

				<td>18.2.0rc2</td>

				<td rowspan="4">19.1</td>

				<td>2019-04-30</td>

				<td>19.1.0-rc1</td>

				<td>Andres Gomez</td>

				<td></td>

				<td>

				</tr>

				<tr>

				<td>2018-08-15</td>

				<td>18.2.0rc3</td>

				<td>2019-05-07</td>

				<td>19.1.0-rc2</td>

				<td>Andres Gomez</td>

				<td></td>

				<td>

				</tr>

				<tr>

				<td>2018-08-22</td>

				<td>18.2.0rc4</td>

				<td>2019-05-14</td>

				<td>19.1.0-rc3</td>

				<td>Andres Gomez</td>

				<td>

				</tr>

				<tr>

				<td>2019-05-21</td>

				<td>19.1.0-rc4</td>

				<td>Andres Gomez</td>

				<td>Last planned RC/Final release</td>

				</tr>

				<tr>

				<td rowspan="4">19.2</td>

				<td>2019-08-06</td>

				<td>19.2.0-rc1</td>

				<td>Emil Velikov</td>

				<td>

				</tr>

				<tr>

				<td>2019-08-13</td>

				<td>19.2.0-rc2</td>

				<td>Emil Velikov</td>

				<td>

				</tr>

				<tr>

				<td>2019-08-20</td>

				<td>19.2.0-rc3</td>

				<td>Emil Velikov</td>

				<td>

				</tr>

				<tr>

				<td>2019-08-27</td>

				<td>19.2.0-rc4</td>

				<td>Emil Velikov</td>

				<td>Last planned RC/Final release</td>

				</tr>

				<tr>

				<td rowspan="4">19.3</td>

				<td>2019-10-15</td>

				<td>19.3.0-rc1</td>

				<td>Juan A. Suarez</td>

				<td>

				</tr>

				<tr>

				<td>2019-10-22</td>

				<td>19.3.0-rc2</td>

				<td>Juan A. Suarez</td>

				<td>

				</tr>

				<tr>

				<td>2019-10-29</td>

				<td>19.3.0-rc3</td>

				<td>Juan A. Suarez</td>

				<td>

				</tr>

				<tr>

				<td>2019-11-05</td>

				<td>19.3.0-rc4</td>

				<td>Juan A. Suarez</td>

				<td>Last planned RC/Final release</td>

				</tr>

				</table>

									
										71

docs/releasing.html
									
												View File
												
				@@ -21,6 +21,7 @@

				<li><a href="#overview">Overview</a>

				<li><a href="#schedule">Release schedule</a>

				<li><a href="#pickntest">Cherry-pick and test</a>

				<li><a href="#stagingbranch">Staging branch</a>

				<li><a href="#branch">Making a branchpoint</a>

				<li><a href="#prerelease">Pre-release announcement</a>

				<li><a href="#release">Making a new release</a>

				@@ -55,9 +56,10 @@ For example:

				<p>

				Releases should happen on Wednesdays. Delays can occur although those

				should be keep to a minimum.

				should be kept to a minimum.

				<br>

				See our <a href="release-calendar.html" target="_parent">calendar</a> for the

				See our <a href="release-calendar.html" target="_parent">calendar</a>

				for information about how the release schedule is planned, and the

				date and other details for individual releases.

				</p>

				@@ -66,6 +68,9 @@ date and other details for individual releases.

				<li>Available approximately every three months.

				<li>Initial timeplan available 2-4 weeks before the planned branchpoint (rc1)

				on the mesa-announce@ mailing list.

				<li>Typically, the final release will happen after 4

				candidates. Additional ones may be needed in order to resolve blocking

				regressions, though.

				<li>A <a href="#prerelease">pre-release</a> announcement should be available

				approximately 24 hours before the final (non-rc) release.

				</ul>

				@@ -83,6 +88,12 @@ Note: There is one or two releases overlap when changing branches. For example:

				<br>

				The final release from the 12.0 series Mesa 12.0.5 will be out around the same

				time (or shortly after) 13.0.1 is out.

				<br>

				This also involves that, as a final release may be delayed due to the

				need of additional candidates to solve some blocking regression(s),

				the release manager might have to update

				the <a href="release-calendar.html" target="_parent">calendar</a> with

				additional bug fix releases of the current stable branch.

				</p>

				@@ -111,18 +122,21 @@ the autoconf and scons build.

				<p>Done continuously up-to the <a href="#prerelease">pre-release</a> announcement.</p>

				<p>

				As an exception, patches can be applied up-to the last ~1h before the actual

				release. This is made <strong>only</strong> with explicit permission/request,

				and the patch <strong>must</strong> be very well contained. Thus it cannot

				affect more than one driver/subsystem.

				</p>

				<p>

				Currently Ilia Mirkin and AMD devs have requested "permanent" exception.

				Developers can request, <em>as an exception</em>, patches to be applied up-to

				the last one hour before the actual release. This is made <strong>only</strong>

				with explicit permission/request, and the patch <strong>must</strong> be very

				well contained. Thus it cannot affect more than one driver/subsystem.

				</p>

				<p>Following developers have requested permanent exception</p>

				<ul>

				<li>make distcheck, scons and scons check must pass

				<li><em>Ilia Mirkin</em>

				<li><em>AMD team</em>

				</ul>

				<p>The following must pass:</p>

				<ul>

				<li>make distcheck, scons and scons check

				<li>Testing with different version of system components - LLVM and others is also

				performed where possible.

				<li>As a general rule, testing with various combinations of configure

				@@ -130,9 +144,9 @@ switches, depending on the specific patchset.

				</ul>

				<p>

				Achieved by combination of local ad-hoc scripts, mingw-w64 cross

				compilation and AppVeyor plus Travis-CI, the latter as part of their

				Github integration.

				These are achieved by combination of <a href="basictesting">local testing</a>,

				which includes mingw-w64 cross compilation and AppVeyor plus Travis-CI, the

				latter two as part of their Github integration.

				</p>

				<p>

				@@ -209,6 +223,25 @@ system and making some every day's use until the release may be a good

				idea too.

				</p>

				<h1 id="stagingbranch">Staging branch</h1>

				<p>

				A live branch, which contains the currently merge/rejected patches is available

				in the main repository under <code>staging/X.Y</code>. For example:

				</p>

				<pre>

					staging/18.1 - WIP branch for the 18.1 series

					staging/18.2 - WIP branch for the 18.2 series

				</pre>

				<p>

				Notes:

				</p>

				<ul>

				<li>People are encouraged to test the staging branch and report regressions.</li>

				<li>The branch history is not stable and it <strong>will</strong> be rebased,</li>

				</ul>

				<h1 id="branch">Making a branchpoint</h1>

				@@ -425,7 +458,7 @@ Ensure the latest code is available - both in your local master and the

				relevant branch.

				</p>

				<h3>Perform basic testing</h3>

				<h3 id="basictesting">Perform basic testing</h3>

				<p>

				Most of the testing should already be done during the

				@@ -492,10 +525,10 @@ Here is one solution that I've been using.

					# Drop LLVM_CONFIG, if applicable:

					# unset LLVM_CONFIG

					__glxinfo_cmd='glxinfo 2>&amp;1 | egrep -o "Mesa.*|Gallium.*|.*dri\.so"'

					__glxgears_cmd='glxgears 2>&amp;1 | grep -v "configuration file"'

					__es2info_cmd='es2_info 2>&amp;1 | egrep "GL_VERSION|GL_RENDERER|.*dri\.so"'

					__es2gears_cmd='es2gears_x11 2>&amp;1 | grep -v "configuration file"'

					__glxinfo_cmd='glxinfo 2&gt;&amp;1 | egrep -o "Mesa.*|Gallium.*|.*dri\.so"'

					__glxgears_cmd='glxgears 2&gt;&amp;1 | grep -v "configuration file"'

					__es2info_cmd='es2_info 2&gt;&amp;1 | egrep "GL_VERSION|GL_RENDERER|.*dri\.so"'

					__es2gears_cmd='es2gears_x11 2&gt;&amp;1 | grep -v "configuration file"'

					test "x$LD_LIBRARY_PATH" != 'x' &amp;&amp; __old_ld="$LD_LIBRARY_PATH"

					export LD_LIBRARY_PATH=`pwd`/test/usr/local/lib/:"${__old_ld}"

					export LIBGL_DRIVERS_PATH=`pwd`/test/usr/local/lib/dri/

									
										16

docs/relnotes.html
									
												View File
												
				@@ -21,6 +21,22 @@ The release notes summarize what's new or changed in each Mesa release.

				</p>

				<ul>

				<li><a href="relnotes/18.3.2.html">18.3.2 release notes</a>

				<li><a href="relnotes/18.2.8.html">18.2.8 release notes</a>

				<li><a href="relnotes/18.2.7.html">18.2.7 release notes</a>

				<li><a href="relnotes/18.3.1.html">18.3.1 release notes</a>

				<li><a href="relnotes/18.3.0.html">18.3.0 release notes</a>

				<li><a href="relnotes/18.2.6.html">18.2.6 release notes</a>

				<li><a href="relnotes/18.2.5.html">18.2.5 release notes</a>

				<li><a href="relnotes/18.2.4.html">18.2.4 release notes</a>

				<li><a href="relnotes/18.2.3.html">18.2.3 release notes</a>

				<li><a href="relnotes/18.2.2.html">18.2.2 release notes</a>

				<li><a href="relnotes/18.1.9.html">18.1.9 release notes</a>

				<li><a href="relnotes/18.2.1.html">18.2.1 release notes</a>

				<li><a href="relnotes/18.2.0.html">18.2.0 release notes</a>

				<li><a href="relnotes/18.1.8.html">18.1.8 release notes</a>

				<li><a href="relnotes/18.1.7.html">18.1.7 release notes</a>

				<li><a href="relnotes/18.1.6.html">18.1.6 release notes</a>

				<li><a href="relnotes/18.1.5.html">18.1.5 release notes</a>

				<li><a href="relnotes/18.1.4.html">18.1.4 release notes</a>

				<li><a href="relnotes/18.1.3.html">18.1.3 release notes</a>

									
										188

docs/relnotes/18.1.6.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,188 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 18.1.6 Release Notes / August 13 2018</h1>

				<p>

				Mesa 18.1.6 is a bug fix release which fixes bugs found since the 18.1.5 release.

				</p>

				<p>

				Mesa 18.1.6 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				580e03328ffefe1fd43b19ab7669f20d931601a1c0a4c0f8b9c65d6e81a06df3  mesa-18.1.6.tar.gz

				bb7ce759069801804fcfb8152da3457f76cd7b4e0096e4870ff5adcb5c894289  mesa-18.1.6.tar.xz

				</pre>

				<h2>New features</h2>

				<p>None</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=13728">Bug 13728</a> - [G965] Some objects in Neverwinter Nights Linux version not displayed correctly</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=98699">Bug 98699</a> - &quot;float[a+++4 ? 1:1] f;&quot; crashes glsl_compiler</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=99730">Bug 99730</a> - Metro Redux game(s) needs override for midshader extension declaration</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106382">Bug 106382</a> - Shader cache breaks INTEL_DEBUG=shader_time</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107117">Bug 107117</a> - mesa-18.1: regression with TFP on intel with modesettings and glamor acceleration</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107212">Bug 107212</a> - Dual-Core CPU E5500 / G45: RetroArch with reicast core results in corrupted graphics</li>

				</ul>

				<h2>Changes</h2>

				<p>Adam Jackson (1):</p>

				<ul>

				  <li>glx: GLX_MESA_multithread_makecurrent is direct-only</li>

				</ul>

				<p>Andres Gomez (3):</p>

				<ul>

				  <li>ddebug: use util_snprintf() in dd_get_debug_filename_and_mkdir</li>

				  <li>gallium/aux/util: use util_snprintf() in test_texture_barrier</li>

				  <li>glsl: use util_snprintf()</li>

				</ul>

				<p>Christian Gmeiner (1):</p>

				<ul>

				  <li>etnaviv: fix typo in query names</li>

				</ul>

				<p>Dave Airlie (1):</p>

				<ul>

				  <li>r600: reduce num compute threads to 1024.</li>

				</ul>

				<p>Dylan Baker (6):</p>

				<ul>

				  <li>docs: Add sha-256 sums for 18.1.5</li>

				  <li>nir/meson: fix c vs cpp args for nir test</li>

				  <li>gallium: fix ddebug on windows</li>

				  <li>cherry-ignore: add patches that get-pick-list is finding in error</li>

				  <li>cherry-ignore: Add some additional patches that are for 18.2</li>

				  <li>bump version to 18.1.6</li>

				</ul>

				<p>Emil Velikov (5):</p>

				<ul>

				  <li>swr: don't export swr_create_screen_internal</li>

				  <li>automake: require shared glapi when using DRI based libGL</li>

				  <li>autotools: error out when using the broken --with-{gl, osmesa}-lib-name</li>

				  <li>autotools: error out when building with mangling and glvnd</li>

				  <li>autotools: use correct gl.pc LIBS when using glvnd</li>

				</ul>

				<p>Eric Anholt (4):</p>

				<ul>

				  <li>vc4: Fix a leak of the no-vertex-elements workaround BO.</li>

				  <li>vc4: Respect a sampler view's first_layer field.</li>

				  <li>vc4: Ignore samplers for finding uniform offsets.</li>

				  <li>egl: Fix leak of X11 pixmaps backing pbuffers in DRI3.</li>

				</ul>

				<p>Gert Wollny (1):</p>

				<ul>

				  <li>meson, install_megadrivers: Also remove stale symlinks</li>

				</ul>

				<p>Jan Vesely (2):</p>

				<ul>

				  <li>clover: Reduce wait_count in abort path.</li>

				  <li>clover: Don't extend illegal integer types.</li>

				</ul>

				<p>Jason Ekstrand (2):</p>

				<ul>

				  <li>nir: Take if uses into account in ssa_def_components_read</li>

				  <li>i965/fs: Flag all slots of a flat input as flat</li>

				</ul>

				<p>Jon Turney (1):</p>

				<ul>

				  <li>meson: use correct keyword to fix a meson warning</li>

				</ul>

				<p>Jordan Justen (2):</p>

				<ul>

				  <li>i965, anv: Use INTEL_DEBUG for disk_cache driver flags</li>

				  <li>i965: Disable shader cache with INTEL_DEBUG=shader_time</li>

				</ul>

				<p>Juan A. Suarez Romero (2):</p>

				<ul>

				  <li>wayland/egl: update surface size on window resize</li>

				  <li>wayland/egl: initialize window surface size to window size</li>

				</ul>

				<p>Karol Herbst (2):</p>

				<ul>

				  <li>nir/lower_int64: mark all metadata as dirty</li>

				  <li>nvc0/ir: return 0 in imageLoad on incomplete textures</li>

				</ul>

				<p>Kenneth Graunke (1):</p>

				<ul>

				  <li>intel: Fix SIMD16 unaligned payload GRF reads on Gen4-5.</li>

				</ul>

				<p>Marek Olšák (1):</p>

				<ul>

				  <li>ac/surface: fix MSAA corruption on Vega due to FMASK tile swizzle</li>

				</ul>

				<p>Mauro Rossi (2):</p>

				<ul>

				  <li>radv: generate entrypoints for VK_ANDROID_native_buffer</li>

				  <li>radv: move vk_format_table.c to generated sources</li>

				</ul>

				<p>Olivier Fourdan (1):</p>

				<ul>

				  <li>dri3: For 1.2, use root window instead of pixmap drawable</li>

				</ul>

				<p>Tapani Pälli (1):</p>

				<ul>

				  <li>glsl: handle error case with ast_post_inc, ast_post_dec</li>

				</ul>

				<p>Vlad Golovkin (1):</p>

				<ul>

				  <li>swr: Remove unnecessary memset call</li>

				</ul>

				<p>vadym.shovkoplias (1):</p>

				<ul>

				  <li>drirc: Allow extension midshader for Metro Redux</li>

				</ul>

				</div>

				</body>

				</html>

									
										104

docs/relnotes/18.1.7.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,104 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 18.1.7 Release Notes / August 24 2018</h1>

				<p>

				Mesa 18.1.7 is a bug fix release which fixes bugs found since the 18.1.6 release.

				</p>

				<p>

				Mesa 18.1.7 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				0c3c240bcd1352d179e65993214f9d55a399beac852c3ab4433e8df9b6c51c83  mesa-18.1.7.tar.gz

				655e3b32ce3bdddd5e6e8768596e5d4bdef82d0dd37067c324cc4b2daa207306  mesa-18.1.7.tar.xz

				</pre>

				<h2>New features</h2>

				<p>None</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=105975">Bug 105975</a> - i965 always reports 0 viewport subpixel bits</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107098">Bug 107098</a> - Segfault after munmap(kms_sw_dt-&gt;ro_mapped)</li>

				</ul>

				<h2>Changes</h2>

				<p>Alexander Tsoy (1):</p>

				<ul>

				  <li>meson: fix build for egl platform_x11 without dri3 and gbm</li>

				</ul>

				<p>Bas Nieuwenhuizen (1):</p>

				<ul>

				  <li>radv: Fix missing Android platform define.</li>

				</ul>

				<p>Danylo Piliaiev (1):</p>

				<ul>

				  <li>i965: Advertise 8 bits subpixel precision for viewport bounds on gen6+</li>

				</ul>

				<p>Dave Airlie (1):</p>

				<ul>

				  <li>r600/eg: rework atomic counter emission with flushes</li>

				</ul>

				<p>Dylan Baker (7):</p>

				<ul>

				  <li>docs: Add sha256 sums for 18.1.6</li>

				  <li>cherry-ignore: Add additional 18.2 only patches</li>

				  <li>cherry-ignore: Add more 18.2 patches</li>

				  <li>cherry-ignore: Add more 18.2 patches</li>

				  <li>cherry-ignore: Add a couple of patches with &gt; 1 fixes tags</li>

				  <li>cherry-ignore: more 18.2 patches</li>

				  <li>bump version for 18.1.7 release</li>

				</ul>

				<p>Jason Ekstrand (2):</p>

				<ul>

				  <li>intel: Switch the order of the 2x MSAA sample positions</li>

				  <li>anv/lower_ycbcr: Use the binding array size for bounds checks</li>

				</ul>

				<p>Ray Strode (1):</p>

				<ul>

				  <li>gallium/winsys/kms: don't unmap what wasn't mapped</li>

				</ul>

				<p>Samuel Pitoiset (1):</p>

				<ul>

				  <li>radv/winsys: fix creating the BO list for virtual buffers</li>

				</ul>

				<p>Timothy Arceri (1):</p>

				<ul>

				  <li>radv: add Doom workaround</li>

				</ul>

				</div>

				</body>

				</html>

									
										180

docs/relnotes/18.1.8.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,180 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 18.1.8 Release Notes / September 7 2018</h1>

				<p>

				Mesa 18.1.8 is a bug fix release which fixes bugs found since the 18.1.7 release.

				</p>

				<p>

				Mesa 18.1.8 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				8ec62f215dd1bb3910987f9941c6fc31632a0874e618815cf1e8e29445c86e0a  mesa-18.1.8.tar.gz

				bd1be67fe9c73b517765264ac28911c84144682d28dbff140e1c2deb2f44c21b  mesa-18.1.8.tar.xz

				</pre>

				<h2>New features</h2>

				<p>None</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=93355">Bug 93355</a> - [BXT,SKLGT4e] intermittent ext_framebuffer_multisample.accuracy fails</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=101247">Bug 101247</a> - Mesa fails to link GLSL programs with unused output blocks</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=104809">Bug 104809</a> - anv: DOOM 2016 and Wolfenstein II:The New Colossus crash due to not having depthBoundsTest</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=105904">Bug 105904</a> - Needed to delete mesa shader cache after driver upgrade for 32 bit wine vulkan programs to work.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106738">Bug 106738</a> - No test for miptrees with DRI modifiers</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106865">Bug 106865</a> - [GLK] piglit.spec.ext_framebuffer_multisample.accuracy stencil tests fail</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107359">Bug 107359</a> - [Regression] [bisected] [OpenGL CTS] [SKL,BDW] KHR-GL46.texture_barrier*-texels, GTF-GL46.gtf21.GL2FixedTests.buffer_corners.buffer_corners, and GTF-GL46.gtf21.GL2FixedTests.stencil_plane_corners.stencil_plane_corners fail with some configuration</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107477">Bug 107477</a> - [DXVK] Setting high shader quality in GTA V results in LLVM error</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107579">Bug 107579</a> - [SNB] The graphic corruption when we reuse the GS compiled and used for TFB when statebuffer contain magic trash in the unused space</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107601">Bug 107601</a> - Rise of the Tomb Raider Segmentation Fault when the game starts</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107760">Bug 107760</a> - GPU Hang when Playing DiRT 3 Complete Edition using Steam Play with DXVK</li>

				</ul>

				<h2>Changes</h2>

				<p>Andrii Simiklit (1):</p>

				<ul>

				  <li>i965/gen6/xfb: handle case where transform feedback is not active</li>

				</ul>

				<p>Bas Nieuwenhuizen (3):</p>

				<ul>

				  <li>radv: Add missing checks in radv_get_image_format_properties.</li>

				  <li>radv: Fix CMASK dimensions.</li>

				  <li>radv: Use a lower max offchip buffer count.</li>

				</ul>

				<p>Christian Gmeiner (1):</p>

				<ul>

				  <li>tegra: fix memory leak</li>

				</ul>

				<p>Daniel Stone (1):</p>

				<ul>

				  <li>st/dri: Don't expose sRGB formats to clients</li>

				</ul>

				<p>Dave Airlie (1):</p>

				<ul>

				  <li>ac/radeonsi: fix CIK copy max size</li>

				</ul>

				<p>Dylan Baker (10):</p>

				<ul>

				  <li>docs: Add mesa 18.1.7 notes</li>

				  <li>cherry-ignore: add a patch</li>

				  <li>cherry-ignore: Add more 18.2 only patches</li>

				  <li>meson: Actually load translation files</li>

				  <li>cherry-ignore: Add more 18.2 patches</li>

				  <li>cherry-ignore: Add additional patch</li>

				  <li>cherry-ignore: Add patch that doesn't apply to 18.1</li>

				  <li>cherry-ignore: Add a couple of two fixes warning patches</li>

				  <li>cherry-ignore: Add patch that needs more significant patches to function</li>

				  <li>Bump version to 18.1.8</li>

				</ul>

				<p>Emil Velikov (1):</p>

				<ul>

				  <li>docs: update required mako version</li>

				</ul>

				<p>Grazvydas Ignotas (1):</p>

				<ul>

				  <li>radv: place pointer length into cache uuid</li>

				</ul>

				<p>Gurchetan Singh (2):</p>

				<ul>

				  <li>meson: fix egl build for surfaceless</li>

				  <li>meson: fix egl build for android</li>

				</ul>

				<p>Ian Romanick (2):</p>

				<ul>

				  <li>i965/vec4: Clamp indirect tes input array reads with 0x0fffffff</li>

				  <li>i965/vec4: Correctly handle uniform sources in generate_tes_add_indirect_urb_offset</li>

				</ul>

				<p>Jason Ekstrand (5):</p>

				<ul>

				  <li>anv: Fill holes in the VF VUE to zero</li>

				  <li>nir/algebraic: Be more careful converting ushr to extract_u8/16</li>

				  <li>egl/dri2: Add a helper for the number of planes for a FOURCC format</li>

				  <li>egl/dri2: Guard against invalid fourcc formats</li>

				  <li>anv/blorp: Do more flushing around HiZ clears</li>

				</ul>

				<p>Juan A. Suarez Romero (1):</p>

				<ul>

				  <li>egl/wayland: do not leak wl_buffer when it is locked</li>

				</ul>

				<p>Lionel Landwerlin (1):</p>

				<ul>

				  <li>anv: blorp: support multiple aspect blits</li>

				</ul>

				<p>Marek Olšák (1):</p>

				<ul>

				  <li>glapi: actually implement GL_EXT_robustness for GLES</li>

				</ul>

				<p>Nanley Chery (7):</p>

				<ul>

				  <li>intel/isl: Avoid tiling some 16K-wide render targets</li>

				  <li>i965: Make blt_pitch public</li>

				  <li>i965/miptree: Drop an if case from retile_as_linear</li>

				  <li>i965/miptree: Use the correct BLT pitch</li>

				  <li>i965/miptree: Use miptree_map in map_blit functions</li>

				  <li>i965/miptree: Fix can_blit_slice()</li>

				  <li>i965/gen7_urb: Re-emit PUSH_CONSTANT_ALLOC on some gen9</li>

				</ul>

				<p>Samuel Pitoiset (1):</p>

				<ul>

				  <li>radv: fix passing clip/cull distances from VS to PS</li>

				</ul>

				<p>vadym.shovkoplias (1):</p>

				<ul>

				  <li>glsl/linker: Allow unused in blocks which are not declated on previous stage</li>

				</ul>

				</div>

				</body>

				</html>

									
										178

docs/relnotes/18.1.9.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,178 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 18.1.8 Release Notes / September 24 2018</h1>

				<p>

				Mesa 18.1.9 is a bug fix release which fixes bugs found since the 18.1.8 release.

				</p>

				<p>

				Mesa 18.1.9 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				0f825dc834b1b3e3d9a6c3ce58b42977f0d9a248a7627a36dd3b313ffe41a499  mesa-18.1.9.tar.gz

				55f5778d58a710a63d6635f000535768faf7db9e8144dc0f4fd1989f936c1a83  mesa-18.1.9.tar.xz

				</pre>

				<h2>New features</h2>

				<p>None</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=103241">Bug 103241</a> - Anv crashes when using 64-bit vertex inputs</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=104926">Bug 104926</a> - swrast: Mesa 17.3.3 produces:  HW cursor for format 875713089 not supported</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107280">Bug 107280</a> - [DXVK] Batman: Arkham City with tessellation enabled hangs on SKL GT4</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107772">Bug 107772</a> - Mesa preprocessor matches if(def)s &amp; endifs incorrectly</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107779">Bug 107779</a> - Access violation with some games</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107810">Bug 107810</a> - The 'va_end' call is missed after 'va_copy' in 'util_vsnprintf' function under windows</li>

				</ul>

				<h2>Changes</h2>

				<p>Andrii Simiklit (4):</p>

				<ul>

				  <li>apple/glx/log: added missing va_end() after va_copy()</li>

				  <li>mesa/util: don't use the same 'va_list' instance twice</li>

				  <li>mesa/util: don't ignore NULL returned from 'malloc'</li>

				  <li>mesa/util: add missing va_end() after va_copy()</li>

				</ul>

				<p>Bas Nieuwenhuizen (4):</p>

				<ul>

				  <li>radv: Use build ID if available for cache UUID.</li>

				  <li>radv: Only allow 16 user SGPRs for compute on GFX9+.</li>

				  <li>radv: Set the user SGPR MSB for Vega.</li>

				  <li>radv: Fix driver UUID SHA1 init.</li>

				</ul>

				<p>Christopher Egert (1):</p>

				<ul>

				  <li>radeon: fix ColorMask</li>

				</ul>

				<p>Dave Airlie (1):</p>

				<ul>

				  <li>virgl: don't send a shader create with no data. (v2)</li>

				</ul>

				<p>Dylan Baker (10):</p>

				<ul>

				  <li>docs/relnotes: Add sha256 sums for mesa 18.1.8</li>

				  <li>cherry-ignore: Add additional 18.2 patch</li>

				  <li>meson: Print a message about why a libdrm version was selected</li>

				  <li>cherry-ignore: add another 18.2 patch</li>

				  <li>cherry-ignore: Add patches that don't apply cleanly and are for developer tools</li>

				  <li>cherry-ignore: Add more 18.2 patches</li>

				  <li>cherry-ignore: add 18.2 patchs</li>

				  <li>cherry-ignore: add a patch that was reverted on master</li>

				  <li>cherry-ignore: one final update</li>

				  <li>Bump version to 18.1.9</li>

				</ul>

				<p>Erik Faye-Lund (2):</p>

				<ul>

				  <li>winsys/virgl: avoid unintended behavior</li>

				  <li>virgl: adjust strides when mapping temp-resources</li>

				</ul>

				<p>Gert Wollny (1):</p>

				<ul>

				  <li>winsys/virgl: correct resource and handle allocation (v2)</li>

				</ul>

				<p>Jason Ekstrand (6):</p>

				<ul>

				  <li>anv/pipeline: Only consider double elements which actually exist</li>

				  <li>i965: Workaround the gen9 hw astc5x5 sampler bug</li>

				  <li>anv: Re-emit vertex buffers when the pipeline changes</li>

				  <li>anv: Disable the vertex cache when tessellating on SKL GT4</li>

				  <li>anv: Clamp scissors to the framebuffer boundary</li>

				  <li>anv/query: Write both dwords in emit_zero_queries</li>

				</ul>

				<p>Josh Pieper (1):</p>

				<ul>

				  <li>st/mesa: Validate the result of pipe_transfer_map in make_texture (v2)</li>

				</ul>

				<p>Kenneth Feng (1):</p>

				<ul>

				  <li>amd: Add Picasso device id</li>

				</ul>

				<p>Marek Olšák (4):</p>

				<ul>

				  <li>st/mesa: help fix stencil border color for GL_DEPTH_STENCIL textures</li>

				  <li>radeonsi: fix HTILE for NPOT textures with mipmapping on SI/CI</li>

				  <li>r600: fix HTILE for NPOT textures with mipmapping</li>

				  <li>radeonsi: fix printing a BO list into ddebug reports</li>

				</ul>

				<p>Mathias Fröhlich (1):</p>

				<ul>

				  <li>tnl: Fix green gun regression in xonotic.</li>

				</ul>

				<p>Mauro Rossi (3):</p>

				<ul>

				  <li>android: broadcom/genxml: fix collision with intel/genxml header-gen macro</li>

				  <li>android: broadcom/cle: add gallium include path</li>

				  <li>android: broadcom/cle: export the broadcom top level path headers</li>

				</ul>

				<p>Michal Srb (1):</p>

				<ul>

				  <li>st/dri: don't set queryDmaBufFormats/queryDmaBufModifiers if the driver does not implement it</li>

				</ul>

				<p>Michel Dänzer (1):</p>

				<ul>

				  <li>loader/dri3: Only wait for back buffer fences in dri3_get_buffer</li>

				</ul>

				<p>Pierre Moreau (1):</p>

				<ul>

				  <li>nvir: Always split 64-bit IMAD/IMUL operations</li>

				</ul>

				<p>Sergii Romantsov (1):</p>

				<ul>

				  <li>intel: compiler option msse2 and mstackrealign</li>

				</ul>

				<p>Timothy Arceri (1):</p>

				<ul>

				  <li>glsl: fixer lexer for unreachable defines</li>

				</ul>

				</div>

				</body>

				</html>

									
										3

docs/relnotes/18.2.8.html
									
												View File
												
				@@ -31,7 +31,8 @@ Compatibility contexts may report a lower version depending on each driver.

				<h2>SHA256 checksums</h2>

				<pre>

				TBD

				77512edc0a84e19c7131a0e2e5ebf1beaf1494dc4b71508fcc92d06d65f9f4f5  mesa-18.2.8.tar.gz

				1d2ed9fd435d86d95b7215b287258d3e6b1180293a36f688e5a2efc18298d863  mesa-18.2.8.tar.xz

				</pre>

									
										283

docs/relnotes/18.3.0.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,283 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 18.3.0 Release Notes / December 7, 2018</h1>

				<p>

				Mesa 18.3.0 is a new development release. People who are concerned

				with stability and reliability should stick with a previous release or

				wait for Mesa 18.3.1.

				</p>

				<p>

				Mesa 18.3.0 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<p>

				libwayland-egl is now distributed by Wayland (since 1.15,

				<a href="https://lists.freedesktop.org/archives/wayland-devel/2018-April/037767.html">see announcement</a>),

				and has been removed from Mesa in this release. Make sure you're using

				an up-to-date version of Wayland to keep the functionality.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				17a124d4dbc712505d22a7815c9b0cee22214c96c8abb91539a2b1351e38a000  mesa-18.3.0.tar.gz

				b63f947e735d6ef3dfaa30c789a9adfbae18aea671191eaacde95a18c17fc38a  mesa-18.3.0.tar.xz

				</pre>

				<h2>New features</h2>

				<p>

				Note: some of the new features are only available with certain drivers.

				</p>

				<ul>

				<li>GL_AMD_depth_clamp_separate on r600, radeonsi.</li>

				<li>GL_AMD_framebuffer_multisample_advanced on radeonsi.</li>

				<li>GL_AMD_gpu_shader_int64 on i965, nvc0, radeonsi.</li>

				<li>GL_AMD_multi_draw_indirect on all GL 4.x drivers.</li>

				<li>GL_AMD_query_buffer_object on i965, nvc0, r600, radeonsi.</li>

				<li>GL_EXT_disjoint_timer_query on radeonsi and most other Gallium drivers (ES extension)</li>

				<li>GL_EXT_texture_compression_s3tc on all drivers (ES extension)<li>

				<li>GL_EXT_vertex_attrib_64bit on i965, nvc0, radeonsi.</li>

				<li>GL_EXT_window_rectangles on radeonsi.</li>

				<li>GL_KHR_texture_compression_astc_sliced_3d on radeonsi.</li>

				<li>GL_NV_fragment_shader_interlock on i965.</li>

				<li>EGL_EXT_device_base for all drivers.</li>

				<li>EGL_EXT_device_drm for all drivers.</li>

				<li>EGL_MESA_device_software for all drivers.</li>

				</ul>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=13728">Bug 13728</a> - [G965] Some objects in Neverwinter Nights Linux version not displayed correctly</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=91433">Bug 91433</a> - piglit.spec.arb_depth_buffer_float.fbo-depth-gl_depth_component32f-copypixels fails</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=93355">Bug 93355</a> - [BXT,SKLGT4e] intermittent ext_framebuffer_multisample.accuracy fails</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=94957">Bug 94957</a> - dEQP failures on llvmpipe</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=98699">Bug 98699</a> - &quot;float[a+++4 ? 1:1] f;&quot; crashes glsl_compiler</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=99507">Bug 99507</a> - Corrupted frame contents with Vulkan version of DOTA2, Talos Principle and Sascha Willems' demos when they're run Vsynched in fullscreen</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=99730">Bug 99730</a> - Metro Redux game(s) needs override for midshader extension declaration</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=100200">Bug 100200</a> - Default Unreal Engine 4 frag shader fails to compile</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=101247">Bug 101247</a> - Mesa fails to link GLSL programs with unused output blocks</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=102597">Bug 102597</a> - [Regression] mpv, high rendering times (two to three times higher)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=103241">Bug 103241</a> - Anv crashes when using 64-bit vertex inputs</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=104602">Bug 104602</a> - [apitrace] Graphical artifacts in Civilization VI on RX Vega</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=104809">Bug 104809</a> - anv: DOOM 2016 and Wolfenstein II:The New Colossus crash due to not having depthBoundsTest</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=104926">Bug 104926</a> - swrast: Mesa 17.3.3 produces:  HW cursor for format 875713089 not supported</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=105333">Bug 105333</a> - [gallium-nine] missing geometry after commit ac: replace ac_build_kill with ac_build_kill_if_false</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=105371">Bug 105371</a> - r600_shader_from_tgsi - GPR limit exceeded - shader requires 360 registers</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=105731">Bug 105731</a> - linker error &quot;fragment shader input ... has no matching output in the previous stage&quot; when previous stage's output declaration in a separate shader object</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=105904">Bug 105904</a> - Needed to delete mesa shader cache after driver upgrade for 32 bit wine vulkan programs to work.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=105975">Bug 105975</a> - i965 always reports 0 viewport subpixel bits</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106231">Bug 106231</a> - llvmpipe blends produce bad code after llvm patch https://reviews.llvm.org/D44785</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106283">Bug 106283</a> - Shader replacements works only for limited use cases</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106577">Bug 106577</a> - broken rendering with nine and nouveau (GM107)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106833">Bug 106833</a> - glLinkProgram is expected to fail when vertex attribute aliasing happens on ES3.0 context or later</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106865">Bug 106865</a> - [GLK] piglit.spec.ext_framebuffer_multisample.accuracy stencil tests fail</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106980">Bug 106980</a> - Basemark GPU vulkan benchmark hangs on GFX9</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106997">Bug 106997</a> - [Regression]. Dying light game is crashing on latest mesa</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107088">Bug 107088</a> - [GEN8+] Hang when discarding a fragment if dual source blending is enabled but shader doesn't support it</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107098">Bug 107098</a> - Segfault after munmap(kms_sw_dt-&gt;ro_mapped)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107212">Bug 107212</a> - Dual-Core CPU E5500 / G45: RetroArch with reicast core results in corrupted graphics</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107223">Bug 107223</a> - [GEN9+] 50% perf drop in SynMark Fill* tests (E2E RBC gets disabled?)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107276">Bug 107276</a> - radv: OpBitfieldUExtract returns incorrect result when count is zero</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107280">Bug 107280</a> - [DXVK] Batman: Arkham City with tessellation enabled hangs on SKL GT4</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107313">Bug 107313</a> - Meson instructions on web site are non-optimal</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107359">Bug 107359</a> - [Regression] [bisected] [OpenGL CTS] [SKL,BDW] KHR-GL46.texture_barrier*-texels, GTF-GL46.gtf21.GL2FixedTests.buffer_corners.buffer_corners, and GTF-GL46.gtf21.GL2FixedTests.stencil_plane_corners.stencil_plane_corners fail with some configuration</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107460">Bug 107460</a> - radv: OpControlBarrier does not always work correctly (bisected)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107477">Bug 107477</a> - [DXVK] Setting high shader quality in GTA V results in LLVM error</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107483">Bug 107483</a> - DispatchSanity_test.GL31_CORE regression</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107487">Bug 107487</a> - [intel] [tools] intel gpu tools don't honor -D tools=[]</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107488">Bug 107488</a> - gl.h:2090: error: redefinition of typedef ‘GLeglImageOES’</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107510">Bug 107510</a> - [GEN8+] up to 10% perf drop on several 3D benchmarks</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107511">Bug 107511</a> - KHR/khrplatform.h not always installed when needed</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107524">Bug 107524</a> - Broken packDouble2x32 at llvmpipe</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107544">Bug 107544</a> - intel/decoder: out of bounds group_iter</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107547">Bug 107547</a> - shader crashing glsl_compiler (uniform block assigned to vec2, then component substraced by 1)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107550">Bug 107550</a> - &quot;0[2]&quot; as function parameter hits assert</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107563">Bug 107563</a> - [RADV] Broken rendering in Unity demos</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107565">Bug 107565</a> - TypeError: __init__() got an unexpected keyword argument 'future_imports'</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107579">Bug 107579</a> - [SNB] The graphic corruption when we reuse the GS compiled and used for TFB when statebuffer contain magic trash in the unused space</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107601">Bug 107601</a> - Rise of the Tomb Raider Segmentation Fault when the game starts</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107610">Bug 107610</a> - Dolphin emulator mis-renders shadow overlay in Super Mario Sunshine</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107626">Bug 107626</a> - [SNB] The graphical corruption and GPU hang occur sometimes on the piglit test &quot;arb_texture_multisample-large-float-texture&quot; with parameter --fp16</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107658">Bug 107658</a> - [Regression] [bisected] [OpenGLES CTS] KHR-GLES3.packed_pixels.*rectangle.r*8_snorm</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107734">Bug 107734</a> - [GLSL] glsl-fface-invariant, glsl-fcoord-invariant and glsl-pcoord-invariant should fail</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107745">Bug 107745</a> - [bisected] [bdw bsw] piglit.­spec.­arb_fragment_shader_interlock.­arb_fragment_shader_interlock-image-load-store failure</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107760">Bug 107760</a> - GPU Hang when Playing DiRT 3 Complete Edition using Steam Play with DXVK</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107765">Bug 107765</a> - [regression] Batman Arkham City crashes with DXVK under wine</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107772">Bug 107772</a> - Mesa preprocessor matches if(def)s &amp; endifs incorrectly</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107779">Bug 107779</a> - Access violation with some games</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107786">Bug 107786</a> - [DXVK] MSAA reflections are broken in GTA V</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107806">Bug 107806</a> - glsl_get_natural_size_align_bytes() ABORT with GfxBench Vulkan AztecRuins</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107810">Bug 107810</a> - The 'va_end' call is missed after 'va_copy' in 'util_vsnprintf' function under windows</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107832">Bug 107832</a> - Gallium picking A16L16 formats when emulating INTENSITY16 conflicts with mesa</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107843">Bug 107843</a> - 32bit Mesa build failes with meson.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107856">Bug 107856</a> - i965 incorrectly calculates the number of layers for texture views (assert)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107857">Bug 107857</a> - GPU hang - GS_EMIT without shader outputs</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107865">Bug 107865</a> - swr fail to build with llvm-libs 6.0.1</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107869">Bug 107869</a> - u_thread.h:87:4: error: use of undeclared identifier 'cpu_set_t'</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107870">Bug 107870</a> - Undefined symbols for architecture x86_64: &quot;_util_cpu_caps&quot;</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107879">Bug 107879</a> - crash happens when link program</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107891">Bug 107891</a> - [wine, regression, bisected] RAGE, Wolfenstein The New Order hangs in menu</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107923">Bug 107923</a> - build_id.c:126: multiple definition of `build_id_length'</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107926">Bug 107926</a> - [anv] Rise of the Tomb Raider always misrendering, segfault and gpu hang.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107941">Bug 107941</a> - GPU hang and system crash with Dota 2 using Vulkan</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107971">Bug 107971</a> - SPV_GOOGLE_hlsl_functionality1 / SPV_GOOGLE_decorate_string</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108012">Bug 108012</a> - Compiler crashes on access of non-existent member incremental operations</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108024">Bug 108024</a> - [Debian Stretch]Fail to build because &quot;xcb_randr_lease_t&quot;</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108082">Bug 108082</a> - warning: unknown warning option '-Wno-format-truncation' [-Wunknown-warning-option]</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108109">Bug 108109</a> - [GLSL] no-overloads.vert fails</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108112">Bug 108112</a> - [vulkancts] some of the coherent memory tests fail.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108113">Bug 108113</a> - [vulkancts] r32g32b32 transfer operations not implemented</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108115">Bug 108115</a> - [vulkancts] dEQP-VK.subgroups.vote.graphics.subgroupallequal.* fails</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108164">Bug 108164</a> - [radv] VM faults since 5d6a560a2986c9ab421b3c7904d29bb7bc35e36f</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108245">Bug 108245</a> - RADV/Vega: Low mip levels of large BCn textures get corrupted by vkCmdCopyBufferToImage</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108272">Bug 108272</a> - [polaris10] opencl-mesa: Anything using OpenCL segfaults, XFX Radeon RX 580</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108311">Bug 108311</a> - Query buffer object support is broken on r600.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108319">Bug 108319</a> - [GLK BXT BSW] Assertion in piglit.spec.arb_gpu_shader_fp64.execution.built-in-functions.vs-sign-sat-neg-abs</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108491">Bug 108491</a> - Commit baa38c14 causes output issues on my VEGA with RADV</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108524">Bug 108524</a> - [RADV]  GPU lockup on event synchronization</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108530">Bug 108530</a> - (mesa-18.3) [Tracker] Mesa 18.3 Release Tracker</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108532">Bug 108532</a> - make check nir_copy_prop_vars_test.store_store_load_different_components regression</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108560">Bug 108560</a> - Mesa 32 is built without sse</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108595">Bug 108595</a> - ir3_compiler valgrind build error</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108617">Bug 108617</a> - [deqp] Mesa fails conformance for egl_ext_device</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108630">Bug 108630</a> - [G965] piglit.spec.!opengl 1_2.tex3d-maxsize spins forever</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108635">Bug 108635</a> - Mesa master commit 68dc591af16ebb36814e4c187e4998948103c99c causes XWayland to segfault</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108713">Bug 108713</a> - Gallium: use after free with transform feedback</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108829">Bug 108829</a> - [meson] libglapi exports internal API</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108894">Bug 108894</a> - [anv] vkCmdCopyBuffer() and vkCmdCopyQueryPoolResults() write-after-write hazard</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108909">Bug 108909</a> - Vkd3d test failure test_resolve_non_issued_query_data()</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108914">Bug 108914</a> - blocky shadow artifacts in The Forest with DXVK, RADV_DEBUG=nohiz fixes this</li>

				<h2>Changes</h2>

				<ul>

				<li>TBD</li>

				</ul>

				</div>

				</body>

				</html>

									
										63

docs/relnotes/18.3.1.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,63 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 18.3.1 Release Notes / December 11, 2018</h1>

				<p>

				Mesa 18.3.1 is a bug fix release which fixes bugs found since the 18.3.0 release.

				</p>

				<p>

				Mesa 18.3.0 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				256d0c3d88e380c1b8e3fc5c6ac34001e3b7c30458b8b852407ec68b8ccd9fda  mesa-18.3.1.tar.gz

				5b1f827d28684a25f6657289f8b7d47ac56395988c7ac23e0ec9a62b644bdc63  mesa-18.3.1.tar.xz

				</pre>

				<h2>New features</h2>

				<p>None</p>

				<h2>Bug fixes</h2>

				<p>None</p>

				<h2>Changes</h2>

				<p>Emil Velikov (2):</p>

				<ul>

				  <li>docs: add sha256 checksums for 18.3.0</li>

				  <li>Update version to 18.3.1</li>

				</ul>

				<p>Jason Ekstrand (1):</p>

				<ul>

				  <li>anv,radv: Disable VK_EXT_pci_bus_info</li>

				</ul>

				</div>

				</body>

				</html>

									
										265

docs/relnotes/18.3.2.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,265 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 18.3.2 Release Notes / January 17, 2019</h1>

				<p>

				Mesa 18.3.2 is a bug fix release which fixes bugs found since the 18.3.1 release.

				</p>

				<p>

				Mesa 18.3.2 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				1cde4fafd40cd1ad4ee3a13b364b7a0175a08b7afdd127fb46f918c1e1dfd4b0  mesa-18.3.2.tar.gz

				f7ce7181c07b6d8e0132da879af1729523a6c8aa87f79a9d59dfd064024cfb35  mesa-18.3.2.tar.xz

				</pre>

				<h2>New features</h2>

				<p>None</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=106595">Bug 106595</a> - [RADV] Rendering distortions only when MSAA is enabled</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107728">Bug 107728</a> - Wrong background in Sascha Willem's Multisampling Demo</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108114">Bug 108114</a> - [vulkancts] new VK_KHR_16bit_storage tests fail.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108116">Bug 108116</a> - [vulkancts] stencil partial clear tests fail.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108624">Bug 108624</a> - [regression][bisected] &quot;nir: Copy propagation between blocks&quot; regression</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108910">Bug 108910</a> - Vkd3d test failure test_multisample_array_texture()</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108911">Bug 108911</a> - Vkd3d test failure test_clear_render_target_view()</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108943">Bug 108943</a> - Build fails on ppc64le with meson</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109072">Bug 109072</a> - GPU hang in blender 2.80</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109081">Bug 109081</a> - [bisected] [HSW] Regression in clipping.user_defined.clip_* vulkancts tests</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109151">Bug 109151</a> - [KBL-G][vulkan] dEQP-VK.texture.explicit_lod.2d.sizes.31x55_nearest_linear_mipmap_nearest_repeat failed verification.</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109202">Bug 109202</a> - nv50_ir.cpp:749:19: error: cannot use typeid with -fno-rtti</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109204">Bug 109204</a> - [regression, bisected] retroarch's crt-royale shader crash radv</li>

				</ul>

				<h2>Changes</h2>

				<p>Alex Deucher (3):</p>

				<ul>

				  <li>pci_ids: add new vega10 pci ids</li>

				  <li>pci_ids: add new vega20 pci id</li>

				  <li>pci_ids: add new VegaM pci id</li>

				</ul>

				<p>Alexander von Gluck IV (1):</p>

				<ul>

				  <li>egl/haiku: Fix reference to disp vs dpy</li>

				</ul>

				<p>Andres Gomez (2):</p>

				<ul>

				  <li>glsl: correct typo in GLSL compilation error message</li>

				  <li>glsl/linker: specify proper direction in location aliasing error</li>

				</ul>

				<p>Axel Davy (3):</p>

				<ul>

				  <li>st/nine: Fix volumetexture dtor on ctor failure</li>

				  <li>st/nine: Bind src not dst in nine_context_box_upload</li>

				  <li>st/nine: Add src reference to nine_context_range_upload</li>

				</ul>

				<p>Bas Nieuwenhuizen (5):</p>

				<ul>

				  <li>radv: Do a cache flush if needed before reading predicates.</li>

				  <li>radv: Implement buffer stores with less than 4 components.</li>

				  <li>anv/android: Do not reject storage images.</li>

				  <li>radv: Fix rasterization precision bits.</li>

				  <li>spirv: Fix matrix parameters in function calls.</li>

				</ul>

				<p>Caio Marcelo de Oliveira Filho (3):</p>

				<ul>

				  <li>nir: properly clear the entry sources in copy_prop_vars</li>

				  <li>nir: properly find the entry to keep in copy_prop_vars</li>

				  <li>nir: remove dead code from copy_prop_vars</li>

				</ul>

				<p>Dave Airlie (2):</p>

				<ul>

				  <li>radv/xfb: fix counter buffer bounds checks.</li>

				  <li>virgl/vtest: fix front buffer flush with protocol version 0.</li>

				</ul>

				<p>Dylan Baker (6):</p>

				<ul>

				  <li>meson: Fix ppc64 little endian detection</li>

				  <li>meson: Add support for gnu hurd</li>

				  <li>meson: Add toggle for glx-direct</li>

				  <li>meson: Override C++ standard to gnu++11 when building with altivec on ppc64</li>

				  <li>meson: Error out if building nouveau and using LLVM without rtti</li>

				  <li>autotools: Remove tegra vdpau driver</li>

				</ul>

				<p>Emil Velikov (12):</p>

				<ul>

				  <li>docs: add sha256 checksums for 18.3.1</li>

				  <li>bin/get-pick-list.sh: rework handing of sha nominations</li>

				  <li>bin/get-pick-list.sh: warn when commit lists invalid sha</li>

				  <li>cherry-ignore: meson: libfreedreno depends upon libdrm (for fence support)</li>

				  <li>glx: mandate xf86vidmode only for "drm" dri platforms</li>

				  <li>meson: don't require glx/egl/gbm with gallium drivers</li>

				  <li>pipe-loader: meson: reference correct library</li>

				  <li>TODO: glx: meson: build dri based glx tests, only with -Dglx=dri</li>

				  <li>glx: meson: drop includes from a link-only library</li>

				  <li>glx: meson: wire up the dispatch-index-check test</li>

				  <li>glx/test: meson: assorted include fixes</li>

				  <li>Update version to 18.3.2</li>

				</ul>

				<p>Eric Anholt (6):</p>

				<ul>

				  <li>v3d: Fix a leak of the transfer helper on screen destroy.</li>

				  <li>vc4: Fix a leak of the transfer helper on screen destroy.</li>

				  <li>v3d: Fix a leak of the disassembled instruction string during debug dumps.</li>

				  <li>v3d: Make sure that a thrsw doesn't split a multop from its umul24.</li>

				  <li>v3d: Add missing flagging of SYNCB as a TSY op.</li>

				  <li>gallium/ttn: Fix setup of outputs_written.</li>

				</ul>

				<p>Erik Faye-Lund (2):</p>

				<ul>

				  <li>virgl: wrap vertex element state in a struct</li>

				  <li>virgl: work around bad assumptions in virglrenderer</li>

				</ul>

				<p>Francisco Jerez (5):</p>

				<ul>

				  <li>intel/fs: Handle source modifiers in lower_integer_multiplication().</li>

				  <li>intel/fs: Implement quad swizzles on ICL+.</li>

				  <li>intel/fs: Fix bug in lower_simd_width while splitting an instruction which was already split.</li>

				  <li>intel/eu/gen7: Fix brw_MOV() with DF destination and strided source.</li>

				  <li>intel/fs: Respect CHV/BXT regioning restrictions in copy propagation pass.</li>

				</ul>

				<p>Ian Romanick (2):</p>

				<ul>

				  <li>i965/vec4/dce: Don't narrow the write mask if the flags are used</li>

				  <li>Revert "nir/lower_indirect: Bail early if modes == 0"</li>

				</ul>

				<p>Jan Vesely (1):</p>

				<ul>

				  <li>clover: Fix build after clang r348827</li>

				</ul>

				<p>Jason Ekstrand (6):</p>

				<ul>

				  <li>nir/constant_folding: Fix source bit size logic</li>

				  <li>intel/blorp: Be more conservative about copying clear colors</li>

				  <li>spirv: Handle any bit size in vector_insert/extract</li>

				  <li>anv/apply_pipeline_layout: Set the cursor in lower_res_reindex_intrinsic</li>

				  <li>spirv: Sign-extend array indices</li>

				  <li>intel/peephole_ffma: Fix swizzle propagation</li>

				</ul>

				<p>Karol Herbst (1):</p>

				<ul>

				  <li>nv50/ir: fix use-after-free in ConstantFolding::visit</li>

				</ul>

				<p>Kirill Burtsev (1):</p>

				<ul>

				  <li>loader: free error state, when checking the drawable type</li>

				</ul>

				<p>Lionel Landwerlin (5):</p>

				<ul>

				  <li>anv: don't do partial resolve on layer &gt; 0</li>

				  <li>i965: include draw_params/derived_draw_params for VF cache workaround</li>

				  <li>i965: add CS stall on VF invalidation workaround</li>

				  <li>anv: explictly specify format for blorp ccs/mcs op</li>

				  <li>anv: flush fast clear colors into compressed surfaces</li>

				</ul>

				<p>Marek Olšák (1):</p>

				<ul>

				  <li>st/mesa: don't leak pipe_surface if pipe_context is not current</li>

				</ul>

				<p>Mario Kleiner (1):</p>

				<ul>

				  <li>radeonsi: Fix use of 1- or 2- component GL_DOUBLE vbo's.</li>

				</ul>

				<p>Nicolai Hähnle (1):</p>

				<ul>

				  <li>meson: link LLVM 'native' component when LLVM is available</li>

				</ul>

				<p>Rhys Perry (3):</p>

				<ul>

				  <li>radv: don't set surf_index for stencil-only images</li>

				  <li>ac/nir,radv,radeonsi/nir: use correct indices for interpolation intrinsics</li>

				  <li>ac: split 16-bit ssbo loads that may not be dword aligned</li>

				</ul>

				<p>Rob Clark (2):</p>

				<ul>

				  <li>freedreno/drm: fix memory leak</li>

				  <li>mesa/st/nir: fix missing nir_compact_varyings</li>

				</ul>

				<p>Samuel Pitoiset (1):</p>

				<ul>

				  <li>radv: switch on EOP when primitive restart is enabled with triangle strips</li>

				</ul>

				<p>Timothy Arceri (2):</p>

				<ul>

				  <li>tgsi/scan: fix loop exit point in tgsi_scan_tess_ctrl()</li>

				  <li>tgsi/scan: correctly walk instructions in tgsi_scan_tess_ctrl()</li>

				</ul>

				<p>Vinson Lee (2):</p>

				<ul>

				  <li>meson: Fix typo.</li>

				  <li>meson: Fix libsensors detection.</li>

				</ul>

				</div>

				</body>

				</html>

2475

docs/relnotes/19.0.0.html Normal file

View File

File diff suppressed because it is too large Load Diff

									
										159

docs/relnotes/19.0.1.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,159 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.1 Release Notes / March 27, 2019</h1>

				<p>

				Mesa 19.0.1 is a bug fix release which fixes bugs found since the 19.0.0 release.

				</p>

				<p>

				Mesa 19.0.1 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				f1dd1980ed628edea3935eed7974fbc5d8353e9578c562728b880d63ac613dbd  mesa-19.0.1.tar.gz

				6884163c0ea9e4c98378ab8fecd72fe7b5f437713a14471beda378df247999d4  mesa-19.0.1.tar.xz

				</pre>

				<h2>New features</h2>

				<p>None</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=100316">Bug 100316</a> - Linking GLSL 1.30 shaders with invariant and deprecated variables triggers an 'mismatching invariant qualifiers' error</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=107563">Bug 107563</a> - [RADV] Broken rendering in Unity demos</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109698">Bug 109698</a> - dri.pc contents invalid when built with meson</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109980">Bug 109980</a> - [i915 CI][HSW] spec&#64;arb_fragment_shader_interlock&#64;arb_fragment_shader_interlock-image-load-store - fail</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110211">Bug 110211</a> - If DESTDIR is set to an empty string, the dri drivers are not installed</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110221">Bug 110221</a> - build error with meson</li>

				</ul>

				<h2>Changes</h2>

				<p>Andres Gomez (4):</p>

				<ul>

				  <li>glsl: correctly validate component layout qualifier for dvec{3,4}</li>

				  <li>glsl/linker: don't fail non static used inputs without matching outputs</li>

				  <li>glsl/linker: simplify xfb_offset vs xfb_stride overflow check</li>

				  <li>Revert "glsl: relax input-&gt;output validation for SSO programs"</li>

				</ul>

				<p>Bas Nieuwenhuizen (2):</p>

				<ul>

				  <li>radv: Use correct image view comparison for fast clears.</li>

				  <li>ac/nir: Return frag_coord as integer.</li>

				</ul>

				<p>Danylo Piliaiev (2):</p>

				<ul>

				  <li>anv: Treat zero size XFB buffer as disabled</li>

				  <li>glsl: Cross validate variable's invariance by explicit invariance only</li>

				</ul>

				<p>Dave Airlie (1):</p>

				<ul>

				  <li>softpipe: fix texture view crashes</li>

				</ul>

				<p>Dylan Baker (5):</p>

				<ul>

				  <li>docs: Add SHA256 sums for 19.0.0</li>

				  <li>cherry-ignore: Add commit that doesn't apply</li>

				  <li>bin/install_megadrivers.py: Correctly handle DESTDIR=''</li>

				  <li>bin/install_megadrivers.py: Fix regression for set DESTDIR</li>

				  <li>bump version for 19.0.1</li>

				</ul>

				<p>Eric Anholt (1):</p>

				<ul>

				  <li>v3d: Fix leak of the renderonly struct on screen destruction.</li>

				</ul>

				<p>Jason Ekstrand (6):</p>

				<ul>

				  <li>glsl/lower_vector_derefs: Don't use a temporary for TCS outputs</li>

				  <li>glsl/list: Add a list variant of insert_after</li>

				  <li>anv/pass: Flag the need for a RT flush for resolve attachments</li>

				  <li>nir/builder: Add a vector extract helper</li>

				  <li>nir: Add a new pass to lower array dereferences on vectors</li>

				  <li>intel/nir: Lower array-deref-of-vector UBO and SSBO loads</li>

				</ul>

				<p>Józef Kucia (2):</p>

				<ul>

				  <li>radv: Fix driverUUID</li>

				  <li>mesa: Fix GL_NUM_DEVICE_UUIDS_EXT</li>

				</ul>

				<p>Kenneth Graunke (1):</p>

				<ul>

				  <li>intel/fs: Fix opt_peephole_csel to not throw away saturates.</li>

				</ul>

				<p>Kevin Strasser (1):</p>

				<ul>

				  <li>egl/dri: Avoid out of bounds array access</li>

				</ul>

				<p>Mark Janes (1):</p>

				<ul>

				  <li>mesa: properly report the length of truncated log messages</li>

				</ul>

				<p>Plamena Manolova (1):</p>

				<ul>

				  <li>i965: Disable ARB_fragment_shader_interlock for platforms prior to GEN9</li>

				</ul>

				<p>Samuel Pitoiset (3):</p>

				<ul>

				  <li>radv: set the maximum number of IBs per submit to 192</li>

				  <li>radv: always initialize HTILE when the src layout is UNDEFINED</li>

				  <li>radv: fix binding transform feedback buffers</li>

				</ul>

				<p>Sergii Romantsov (1):</p>

				<ul>

				  <li>d3d: meson: do not prefix user provided d3d-drivers-path</li>

				</ul>

				<p>Tapani Pälli (2):</p>

				<ul>

				  <li>isl: fix automake build when sse41 is not supported</li>

				  <li>anv/radv: release memory allocated by glsl types during spirv_to_nir</li>

				</ul>

				</div>

				</body>

				</html>

									
										122

docs/relnotes/19.0.2.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,122 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.2 Release Notes / April 10, 2019</h1>

				<p>

				Mesa 19.0.2 is a bug fix release which fixes bugs found since the 19.0.1 release.

				</p>

				<p>

				Mesa 19.0.2 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				SHA256: eb972fc11d4e1261d34ec0b91a701f158d4870c0428fb108353ae7eab64b1118  mesa-19.0.2.tar.gz

				SHA256: 1a2edc3ce56906a676c91e6851298db45903df1f5cb9827395a922c1452db802  mesa-19.0.2.tar.xz

				</pre>

				<h2>New features</h2>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108766">Bug 108766</a> - Mesa built with meson has RPATH entries</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109648">Bug 109648</a> - AMD Raven hang during va-api decoding</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110257">Bug 110257</a> - Major artifacts in mpeg2 vaapi hw decoding</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110259">Bug 110259</a> - radv: Sampling depth-stencil image in GENERAL layout returns nothing but zero (regression, bisected)</li>

				</ul>

				<h2>Changes</h2>

				<p>Boyuan Zhang (1):</p>

				<ul>

				  <li>st/va: reverse qt matrix back to its original order</li>

				</ul>

				<p>Caio Marcelo de Oliveira Filho (1):</p>

				<ul>

				  <li>nir: Take if_uses into account when repairing SSA</li>

				</ul>

				<p>Dylan Baker (2):</p>

				<ul>

				  <li>docs: Add SHA256 sums for mesa 19.0.1</li>

				  <li>VERSION: bump version for 19.0.2</li>

				</ul>

				<p>Eric Anholt (3):</p>

				<ul>

				  <li>dri3: Return the current swap interval from glXGetSwapIntervalMESA().</li>

				  <li>v3d: Bump the maximum texture size to 4k for V3D 4.x.</li>

				  <li>v3d: Don't try to use the TFU blit path if a scissor is enabled.</li>

				</ul>

				<p>Eric Engestrom (1):</p>

				<ul>

				  <li>meson: strip rpath from megadrivers</li>

				</ul>

				<p>Jason Ekstrand (1):</p>

				<ul>

				  <li>Revert "anv/radv: release memory allocated by glsl types during spirv_to_nir"</li>

				</ul>

				<p>Karol Herbst (1):</p>

				<ul>

				  <li>nir/print: fix printing the image_array intrinsic index</li>

				</ul>

				<p>Leo Liu (2):</p>

				<ul>

				  <li>radeon/vcn: add H.264 constrained baseline support</li>

				  <li>radeon/vcn/vp9: search the render target from the whole list</li>

				</ul>

				<p>Lionel Landwerlin (1):</p>

				<ul>

				  <li>intel: add dependency on genxml generated files</li>

				</ul>

				<p>Marek Olšák (1):</p>

				<ul>

				  <li>radeonsi: fix assertion failure by using the correct type</li>

				</ul>

				<p>Samuel Pitoiset (2):</p>

				<ul>

				  <li>radv: skip updating depth/color metadata for conditional rendering</li>

				  <li>radv: do not always initialize HTILE in compressed state</li>

				</ul>

				</div>

				</body>

				</html>

									
										148

docs/relnotes/19.0.3.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,148 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.3 Release Notes / April 24, 2019</h1>

				<p>

				Mesa 19.0.3 is a bug fix release which fixes bugs found since the l9.0.2 release.

				</p>

				<p>

				Mesa 19.0.3 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				59543ec3c9f8c72990e77887f13d1678cb6739e5d5f56abc21ebf9e772389c5e  mesa-19.0.3.tar.gz

				f027244e38dc309a4c12db45ef79be81ab62c797a50a88d566e4edb6159fc4d5  mesa-19.0.3.tar.xz

				</pre>

				<h2>New features</h2>

				<p>N/A</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108879">Bug 108879</a> - [CIK] [regression] All opencl apps hangs indefinitely in si_create_context</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110201">Bug 110201</a> - [ivb] mesa 19.0.0 breaks rendering in kitty</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110356">Bug 110356</a> - install_megadrivers.py creates new dangling symlink [bisected]</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110441">Bug 110441</a> - [llvmpipe] complex-loop-analysis-bug regression</li>

				</ul>

				<h2>Changes</h2>

				<p>Andres Gomez (1):</p>

				<ul>

				  <li>glsl/linker: location aliasing requires types to have the same width</li>

				</ul>

				<p>Bas Nieuwenhuizen (1):</p>

				<ul>

				  <li>ac: Move has_local_buffers disable to radeonsi.</li>

				</ul>

				<p>Chia-I Wu (1):</p>

				<ul>

				  <li>virgl: fix fence fd version check</li>

				</ul>

				<p>Danylo Piliaiev (1):</p>

				<ul>

				  <li>intel/compiler: Do not reswizzle dst if instruction writes to flag register</li>

				</ul>

				<p>Dylan Baker (2):</p>

				<ul>

				  <li>docs: Add sha256 sums for 19.0.2</li>

				  <li>Bump version for 19.0.3</li>

				</ul>

				<p>Eric Anholt (1):</p>

				<ul>

				  <li>nir: Fix deref offset calculation for structs.</li>

				</ul>

				<p>Eric Engestrom (1):</p>

				<ul>

				  <li>meson: remove meson-created megadrivers symlinks</li>

				</ul>

				<p>Jason Ekstrand (2):</p>

				<ul>

				  <li>anv/pipeline: Fix MEDIA_VFE_STATE::PerThreadScratchSpace on gen7</li>

				  <li>anv: Add a #define for the max binding table size</li>

				</ul>

				<p>Juan A. Suarez Romero (1):</p>

				<ul>

				  <li>meson: Add dependency on genxml to anvil genfiles</li>

				</ul>

				<p>Kenneth Graunke (2):</p>

				<ul>

				  <li>glsl: Set location on structure-split sampler uniform variables</li>

				  <li>Revert "glsl: Set location on structure-split sampler uniform variables"</li>

				</ul>

				<p>Lionel Landwerlin (2):</p>

				<ul>

				  <li>anv: fix uninitialized pthread cond clock domain</li>

				  <li>intel/devinfo: fix missing num_thread_per_eu on ICL</li>

				</ul>

				<p>Lubomir Rintel (2):</p>

				<ul>

				  <li>gallivm: guess CPU features also on ARM</li>

				  <li>gallivm: disable NEON instructions if they are not supported</li>

				</ul>

				<p>Marek Olšák (1):</p>

				<ul>

				  <li>radeonsi: use CP DMA for the null const buffer clear on CIK</li>

				</ul>

				<p>Rhys Perry (1):</p>

				<ul>

				  <li>nir,ac/nir: fix cube_face_coord</li>

				</ul>

				<p>Roland Scheidegger (1):</p>

				<ul>

				  <li>gallivm: fix bogus assert in get_indirect_index</li>

				</ul>

				<p>Samuel Pitoiset (2):</p>

				<ul>

				  <li>ac/nir: only use the new raw/struct image atomic intrinsics with LLVM 9+</li>

				  <li>radv: do not load vertex attributes that are not provided by the pipeline</li>

				</ul>

				</div>

				</body>

				</html>

									
										243

docs/relnotes/19.0.4.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,243 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.4 Release Notes / May 9, 2019</h1>

				<p>

				Mesa 19.0.4 is a bug fix release which fixes bugs found since the 19.0.3 release.

				</p>

				<p>

				Mesa 19.0.4 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				de361c76bf7aae09219f571b9ae77a34864a1cd9f6ba24c845b18b3cd5e4b9a2  mesa-19.0.4.tar.gz

				39f9f32f448d77388ef817c6098d50eb0c1595815ce7e895dec09dd68774ce47  mesa-19.0.4.tar.xz

				</pre>

				<h2>New features</h2>

				<p>N/A</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=99781">Bug 99781</a> - Some Unity games fail assertion on startup in glXCreateContextAttribsARB</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=100239">Bug 100239</a> - Incorrect rendering in CS:GO</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=108540">Bug 108540</a> - vkAcquireNextImageKHR blocks when timeout=0 in Wayland</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110143">Bug 110143</a> - Doom 3: BFG Edition - Steam and GOG.com - white flickering screen</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110291">Bug 110291</a> - Vega 64 GPU hang running Space Engineers</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110355">Bug 110355</a> - radeonsi: GTK elements become invisible in some applications (GIMP, LibreOffice)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110573">Bug 110573</a> - Mesa vulkan-radeon 19.0.3 system freeze and visual artifacts (RADV)</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110590">Bug 110590</a> - [Regression][Bisected] GTAⅣ under wine fails with GLXBadFBConfig</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110632">Bug 110632</a> - &quot;glx: Fix synthetic error generation in __glXSendError&quot; broke wine games on 32-bit</li>

				</ul>

				<h2>Changes</h2>

				<p>Alejandro Piñeiro (1):</p>

				<ul>

				  <li>docs: document MESA_GLSL=errors keyword</li>

				</ul>

				<p>Andrii Simiklit (1):</p>

				<ul>

				  <li>egl: return correct error code for a case req ver &lt; 3 with forward-compatible</li>

				</ul>

				<p>Axel Davy (1):</p>

				<ul>

				  <li>st/nine: Fix D3DWindowBuffer_release for old wine nine support</li>

				</ul>

				<p>Bas Nieuwenhuizen (1):</p>

				<ul>

				  <li>radv: Disable VK_EXT_descriptor_indexing.</li>

				</ul>

				<p>Brian Paul (1):</p>

				<ul>

				  <li>svga: add SVGA_NO_LOGGING env var (v2)</li>

				</ul>

				<p>Caio Marcelo de Oliveira Filho (1):</p>

				<ul>

				  <li>spirv: Handle SpvOpDecorateId</li>

				</ul>

				<p>Charmaine Lee (1):</p>

				<ul>

				  <li>svga: move host logging to winsys</li>

				</ul>

				<p>Chuck Atkins (1):</p>

				<ul>

				  <li>meson: Fix missing glproto dependency for gallium-glx</li>

				</ul>

				<p>Daniel Stone (1):</p>

				<ul>

				  <li>vulkan/wsi/wayland: Respect non-blocking AcquireNextImage</li>

				</ul>

				<p>Dave Airlie (2):</p>

				<ul>

				  <li>r600: reset tex array override even when no view bound</li>

				  <li>util/bitset: fix bitset range mask calculations.</li>

				</ul>

				<p>Dylan Baker (7):</p>

				<ul>

				  <li>docs: Add SHA256 sums for mesa 19.0.3</li>

				  <li>cherry-ignore: Add a patch that was manually backported</li>

				  <li>cherry-ignore: Add more backported patches</li>

				  <li>cherry-ignore: Add another patch</li>

				  <li>cherry-ignore: Add more patches</li>

				  <li>meson: Force the use of config-tool for llvm</li>

				  <li>VERSION: bump for 19.0.4 release</li>

				</ul>

				<p>Emil Velikov (3):</p>

				<ul>

				  <li>vulkan/wsi: check if the display_fd given is master</li>

				  <li>vulkan/wsi: don't use DUMB_CLOSE for normal GEM handles</li>

				  <li>configure.ac: check for libdrm when using VL with X11</li>

				</ul>

				<p>Erik Faye-Lund (2):</p>

				<ul>

				  <li>softpipe: setup pixel_offset for all primitive types</li>

				  <li>draw: flush when setting stream-out targets</li>

				</ul>

				<p>Francisco Jerez (2):</p>

				<ul>

				  <li>intel/fs: Lower integer multiply correctly when destination stride equals 4.</li>

				  <li>intel/fs: Cap dst-aligned region stride to maximum representable hstride value.</li>

				</ul>

				<p>Hal Gentz (1):</p>

				<ul>

				  <li>glx: Fix synthetic error generation in __glXSendError</li>

				</ul>

				<p>Ian Romanick (2):</p>

				<ul>

				  <li>glsl: Silence may unused parameter warnings in glsl/ir.h</li>

				  <li>mesa: Add missing display list support for GL_FOG_COORDINATE_SOURCE</li>

				</ul>

				<p>Jason Ekstrand (1):</p>

				<ul>

				  <li>anv/descriptor_set: Destroy sets before pool finalization</li>

				</ul>

				<p>Jon Turney (1):</p>

				<ul>

				  <li>meson: Force '.so' extension for DRI drivers</li>

				</ul>

				<p>Juan A. Suarez Romero (2):</p>

				<ul>

				  <li>spirv: add missing SPV_EXT_descriptor_indexing capabilities</li>

				  <li>radv: enable descriptor indexing capabilities</li>

				</ul>

				<p>Kenneth Graunke (6):</p>

				<ul>

				  <li>glsl: Allow gl_nir_lower_samplers*() without a gl_shader_program</li>

				  <li>glsl: Don't look at sampler uniform storage for internal vars</li>

				  <li>i965: Ignore uniform storage for samplers or images, use binding info</li>

				  <li>i965: Fix BRW_MEMZONE_LOW_4G heap size.</li>

				  <li>i965: Force VMA alignment to be a multiple of the page size.</li>

				  <li>i965: leave the top 4Gb of the high heap VMA unused</li>

				</ul>

				<p>Lionel Landwerlin (4):</p>

				<ul>

				  <li>anv: store heap address bounds when initializing physical device</li>

				  <li>anv: leave the top 4Gb of the high heap VMA unused</li>

				  <li>anv: fix argument name for vkCmdEndQuery</li>

				  <li>anv: rework queries writes to ensure ordering memory writes</li>

				</ul>

				<p>Marek Olšák (2):</p>

				<ul>

				  <li>radeonsi/gfx9: set that window_rectangles always roll the context</li>

				  <li>radeonsi/gfx9: rework the gfx9 scissor bug workaround (v2)</li>

				</ul>

				<p>Nicolai Hähnle (1):</p>

				<ul>

				  <li>radeonsi: add si_debug_options for convenient adding/removing of options</li>

				</ul>

				<p>Rhys Perry (1):</p>

				<ul>

				  <li>radv: fix set_output_usage_mask() with composite and 64-bit types</li>

				</ul>

				<p>Ross Burton (1):</p>

				<ul>

				  <li>Revert "meson: drop GLESv1 .so version back to 1.0.0"</li>

				</ul>

				<p>Samuel Pitoiset (8):</p>

				<ul>

				  <li>radv: add missing VEGA20 chip in radv_get_device_name()</li>

				  <li>radv: do not need to force emit the TCS regs on Vega20</li>

				  <li>radv: fix color conversions for normalized uint/sint formats</li>

				  <li>radv: implement a workaround for VK_EXT_conditional_rendering</li>

				  <li>radv: set WD_SWITCH_ON_EOP=1 when drawing primitives from a stream output buffer</li>

				  <li>radv: only need to force emit the TCS regs on Vega10 and Raven1</li>

				  <li>radv: apply the indexing workaround for atomic buffer operations on GFX9</li>

				  <li>radv: fix setting the number of rectangles when it's dyanmic</li>

				</ul>

				<p>Tapani Pälli (1):</p>

				<ul>

				  <li>anv: expose VK_EXT_queue_family_foreign on Android</li>

				</ul>

				<p>Timothy Arceri (4):</p>

				<ul>

				  <li>nir: fix nir_remove_unused_varyings()</li>

				  <li>util/drirc: add workarounds for bugs in Doom 3: BFG</li>

				  <li>radeonsi: add config entry for Counter-Strike Global Offensive</li>

				  <li>Revert "glx: Fix synthetic error generation in __glXSendError"</li>

				</ul>

				</div>

				</body>

				</html>

									
										137

docs/relnotes/19.0.5.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,137 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.5 Release Notes / May 21, 2019</h1>

				<p>

				Mesa 19.0.5 is a bug fix release which fixes bugs found since the 19.0.4 release.

				</p>

				<p>

				Mesa 19.0.5 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				b6e6b78c23bec15d1e7887c78b7ad00ce395ea1b20ad8aab6ce441f55f724e70  mesa-19.0.5.tar.gz

				6aecb7f67c136768692fb3c33a54196186c6c4fcafab7973516a355e1a54f831  mesa-19.0.5.tar.xz

				</pre>

				<h2>New features</h2>

				<p>N/A</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=109659">Bug 109659</a> - Missing OpenGL symbols in OSMesa Gallium when building with meson</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110134">Bug 110134</a> - SIGSEGV while playing large hevc video in mpv</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110648">Bug 110648</a> - Dota2 will not open using vulkan since 19.0 series</li>

				</ul>

				<h2>Changes</h2>

				<p>Caio Marcelo de Oliveira Filho (2):</p>

				<ul>

				  <li>nir: Fix nir_opt_idiv_const when negatives are involved</li>

				  <li>nir: Fix clone of nir_variable state slots</li>

				</ul>

				<p>Charmaine Lee (2):</p>

				<ul>

				  <li>st/mesa: purge framebuffers with current context after unbinding winsys buffers</li>

				  <li>mesa: unreference current winsys buffers when unbinding winsys buffers</li>

				</ul>

				<p>Dylan Baker (4):</p>

				<ul>

				  <li>docs: Add SHA256 sums for mesa 19.0.4</li>

				  <li>cherry-ignore: add patches for panfrost</li>

				  <li>cherry-ignore: Add more 19.1 patches</li>

				  <li>bump version to 19.0.5</li>

				</ul>

				<p>Eric Engestrom (1):</p>

				<ul>

				  <li>meson: expose glapi through osmesa</li>

				</ul>

				<p>Gert Wollny (2):</p>

				<ul>

				  <li>softpipe/buffer: load only as many components as the the buffer resource type provides</li>

				  <li>Revert "softpipe/buffer: load only as many components as the the buffer resource type provides"</li>

				</ul>

				<p>Ian Romanick (1):</p>

				<ul>

				  <li>Revert "nir: add late opt to turn inot/b2f combos back to bcsel"</li>

				</ul>

				<p>Jason Ekstrand (3):</p>

				<ul>

				  <li>intel/fs/ra: Only add dest interference to sources that exist</li>

				  <li>intel/fs/ra: Stop adding RA interference to too many SENDS nodes</li>

				  <li>anv: Only consider minSampleShading when sampleShadingEnable is set</li>

				</ul>

				<p>Józef Kucia (1):</p>

				<ul>

				  <li>radv: clear vertex bindings while resetting command buffer</li>

				</ul>

				<p>Kenneth Graunke (1):</p>

				<ul>

				  <li>i965: Fix memory leaks in brw_upload_cs_work_groups_surface().</li>

				</ul>

				<p>Leo Liu (1):</p>

				<ul>

				  <li>winsys/amdgpu: add VCN JPEG to no user fence group</li>

				</ul>

				<p>Lionel Landwerlin (1):</p>

				<ul>

				  <li>anv: Use corresponding type from the vector allocation</li>

				</ul>

				<p>Marek Olšák (1):</p>

				<ul>

				  <li>st/mesa: fix 2 crashes in st_tgsi_lower_yuv</li>

				</ul>

				<p>Nanley Chery (1):</p>

				<ul>

				  <li>anv: Fix some depth buffer sampling cases on ICL+</li>

				</ul>

				<p>Samuel Pitoiset (1):</p>

				<ul>

				  <li>radv: add a workaround for Monster Hunter World and LLVM 7&amp;8</li>

				</ul>

				</div>

				</body>

				</html>

									
										153

docs/relnotes/19.0.6.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,153 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.5 Release Notes / May 21, 2019</h1>

				<p>

				Mesa 19.0.6 is a bug fix release which fixes bugs found since the 19.0.5 release.

				</p>

				<p>

				Mesa 19.0.6 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				SHA256: ac8e9ea388ec5c69f5a690190edf8ede602afdbaeea62d49e108057737430ac7  mesa-19.0.6.tar.gz

				SHA256: 2db2f2fcaa4048b16e066fad76b8a93944f7d06d329972b0f5fd5ce692ce3d24  mesa-19.0.6.tar.xz

				</pre>

				<h2>New features</h2>

				<p>N/A</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110721">Bug 110721</a> - graphics corruption on steam client with mesa 19.1.0 rc3 on polaris</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110761">Bug 110761</a> - Huge problems between Mesa and Electron engine apps</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110784">Bug 110784</a> - [regression][bisected] Reverting 'expose 0 shader binary formats for compat profiles for Qt' causes get_program_binary failures on Iris</li>

				</ul>

				<h2>Changes</h2>

				<p>Alok Hota (2):</p>

				<ul>

				  <li>gallium/swr: Param defaults for unhandled PIPE_CAPs</li>

				  <li>gallium/aux: add PIPE_CAP_MAX_VARYINGS to u_screen</li>

				</ul>

				<p>Bas Nieuwenhuizen (1):</p>

				<ul>

				  <li>nir: Actually propagate progress in nir_opt_move_load_ubo.</li>

				</ul>

				<p>Chenglei Ren (1):</p>

				<ul>

				  <li>anv/android: fix missing dependencies issue during parallel build</li>

				</ul>

				<p>Christian Gmeiner (1):</p>

				<ul>

				  <li>etnaviv: use the correct uniform dirty bits</li>

				</ul>

				<p>Dave Airlie (1):</p>

				<ul>

				  <li>Revert "mesa: unreference current winsys buffers when unbinding winsys buffers"</li>

				</ul>

				<p>Deepak Rawat (1):</p>

				<ul>

				  <li>winsys/drm: Fix out of scope variable usage</li>

				</ul>

				<p>Dylan Baker (6):</p>

				<ul>

				  <li>docs: Add Sha256 sums for 19.0.5</li>

				  <li>cherry-ignore: Add a commit that was manually backported</li>

				  <li>cherry-ignore: add another 19.1 only patch</li>

				  <li>cherry-ignore: add another 19.1 only patch</li>

				  <li>gallium: wrap u_screen in extern "C" for c++</li>

				  <li>VERSION: bump to 19.0.6</li>

				</ul>

				<p>Emil Velikov (1):</p>

				<ul>

				  <li>egl/dri: flesh out and use dri2_create_drawable()</li>

				</ul>

				<p>Jan Zielinski (1):</p>

				<ul>

				  <li>swr/rast: fix 32-bit compilation on Linux</li>

				</ul>

				<p>Lionel Landwerlin (1):</p>

				<ul>

				  <li>vulkan: fix build dependency issue with generated files</li>

				</ul>

				<p>Marek Olšák (2):</p>

				<ul>

				  <li>u_blitter: don't fail mipmap generation for depth formats containing stencil</li>

				  <li>ac: fix a typo in ac_build_wg_scan_bottom</li>

				</ul>

				<p>Philipp Zabel (1):</p>

				<ul>

				  <li>etnaviv: fill missing offset in etna_resource_get_handle</li>

				</ul>

				<p>Rob Clark (3):</p>

				<ul>

				  <li>freedreno/ir3: dynamic UBO indexing vs 64b pointers</li>

				  <li>freedreno/ir3: set more barrier bits</li>

				  <li>freedreno/a6xx: fix GPU crash on small render targets</li>

				</ul>

				<p>Sagar Ghuge (1):</p>

				<ul>

				  <li>intel/compiler: Fix assertions in brw_alu3</li>

				</ul>

				<p>Samuel Pitoiset (2):</p>

				<ul>

				  <li>radv: allocate more space in the CS when emitting events</li>

				  <li>radv: do not use gfx fast depth clears for layered depth/stencil images</li>

				</ul>

				<p>Timothy Arceri (2):</p>

				<ul>

				  <li>Revert "st/mesa: expose 0 shader binary formats for compat profiles for Qt"</li>

				  <li>st/glsl: make sure to propagate initialisers to driver storage</li>

				</ul>

				</div>

				</body>

				</html>

									
										150

docs/relnotes/19.0.7.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,150 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.6 Release Notes / June 24, 2019</h1>

				<p>

				Mesa 19.0.7 is a bug fix release which fixes bugs found since the 19.0.6 release.

				</p>

				<p>

				Mesa 19.0.7 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				81119f0cbbd1fbe7c0574e1e2690e0dae8868124d24c875f5fb76f165db3a54d  mesa-19.0.7.tar.gz

				d7bf3db2e442fe5eeb96144f8508d94f04aededdf37af477e644638d366b2b28  mesa-19.0.7.tar.xz

				</pre>

				<h2>New features</h2>

				<p>N/A</p>

				<h2>Bug fixes</h2>

				<ul>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110302">Bug 110302</a> - [bisected][regression] piglit egl-create-pbuffer-surface and egl-gl-colorspace regressions</li>

				<li><a href="https://bugs.freedesktop.org/show_bug.cgi?id=110921">Bug 110921</a> - virgl on OpenGL 3.3 host regressed to OpenGL 2.1</li>

				</ul>

				<h2>Changes</h2>

				<p>Bas Nieuwenhuizen (5):</p>

				<ul>

				  <li>radv: Prevent out of bound shift on 32-bit builds.</li>

				  <li>radv: Decompress DCC when the image format is not allowed for buffers.</li>

				  <li>radv: Fix vulkan build in meson.</li>

				  <li>anv: Fix vulkan build in meson.</li>

				  <li>meson: Allow building radeonsi with just the android platform.</li>

				</ul>

				<p>Charmaine Lee (1):</p>

				<ul>

				  <li>svga: Remove unnecessary check for the pre flush bit for setting vertex buffers</li>

				</ul>

				<p>Deepak Rawat (1):</p>

				<ul>

				  <li>winsys/svga/drm: Fix 32-bit RPCI send message</li>

				</ul>

				<p>Dylan Baker (3):</p>

				<ul>

				  <li>docs: Add SHA256 sums for 19.0.6</li>

				  <li>cherry-ignore: add additional 19.1 only patches</li>

				  <li>Bump version for 19.0.7 release</li>

				</ul>

				<p>Emil Velikov (1):</p>

				<ul>

				  <li>mapi: correctly handle the full offset table</li>

				</ul>

				<p>Gert Wollny (2):</p>

				<ul>

				  <li>virgl: Add a caps feature check version</li>

				  <li>virgl: Assume sRGB write control for older guest kernels or virglrenderer hosts</li>

				</ul>

				<p>Haihao Xiang (1):</p>

				<ul>

				  <li>i965: support UYVY for external import only</li>

				</ul>

				<p>Jason Ekstrand (2):</p>

				<ul>

				  <li>nir/propagate_invariant: Don't add NULL vars to the hash table</li>

				  <li>anv: Set STATE_BASE_ADDRESS upper bounds on gen7</li>

				</ul>

				<p>Kenneth Graunke (1):</p>

				<ul>

				  <li>glsl: Fix out of bounds read in shader_cache_read_program_metadata</li>

				</ul>

				<p>Kevin Strasser (2):</p>

				<ul>

				  <li>gallium/winsys/kms: Fix dumb buffer bpp</li>

				  <li>st/mesa: Add rgbx handling for fp formats</li>

				</ul>

				<p>Lionel Landwerlin (2):</p>

				<ul>

				  <li>intel/perf: fix EuThreadsCount value in performance equations</li>

				  <li>intel/perf: improve dynamic loading config detection</li>

				</ul>

				<p>Mathias Fröhlich (1):</p>

				<ul>

				  <li>egl: Don't add hardware device if there is no render node v2.</li>

				</ul>

				<p>Nanley Chery (1):</p>

				<ul>

				  <li>anv/cmd_buffer: Initalize the clear color struct for CNL+</li>

				</ul>

				<p>Nataraj Deshpande (1):</p>

				<ul>

				  <li>anv: Fix check for isl_fmt in assert</li>

				</ul>

				<p>Samuel Pitoiset (5):</p>

				<ul>

				  <li>radv: fix alpha-to-coverage when there is unused color attachments</li>

				  <li>radv: fix setting CB_SHADER_MASK for dual source blending</li>

				  <li>radv: fix occlusion queries on VegaM</li>

				  <li>radv: fix VK_EXT_memory_budget if one heap isn't available</li>

				  <li>radv: fix FMASK expand with SRGB formats</li>

				</ul>

				</div>

				</body>

				</html>

									
										62

docs/relnotes/19.0.8.html
									
										Normal file
									
												View File
												
				@@ -0,0 +1,62 @@

				<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

				<html lang="en">

				<head>

				  <meta http-equiv="content-type" content="text/html; charset=utf-8">

				  <title>Mesa Release Notes</title>

				  <link rel="stylesheet" type="text/css" href="../mesa.css">

				</head>

				<body>

				<div class="header">

				  <h1>The Mesa 3D Graphics Library</h1>

				</div>

				<iframe src="../contents.html"></iframe>

				<div class="content">

				<h1>Mesa 19.0.8 Release Notes / June 26, 2019</h1>

				<p>

				Mesa 19.0.8 is an emergency bug fix release which fixes a critical bug found in the 19.0.7 release.

				</p>

				<p>

				Mesa 19.0.8 implements the OpenGL 4.5 API, but the version reported by

				glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /

				glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.

				Some drivers don't support all the features required in OpenGL 4.5.  OpenGL

				4.5 is <strong>only</strong> available if requested at context creation.

				Compatibility contexts may report a lower version depending on each driver.

				</p>

				<h2>SHA256 checksums</h2>

				<pre>

				1a3dc3f2af853c76aadb4a1e03c9ba420361c04a742d457a702b781671a96a57  mesa-19.0.8.tar.gz

				d017eb53a810c32dabeedf6ca2238ae1e897ce9090e470e9ce1d6c9e3f1b0862  mesa-19.0.8.tar.xz

				</pre>

				<h2>New features</h2>

				<p>N/A</p>

				<h2>Bug fixes</h2>

				<p>None</p>

				<h2>Changes</h2>

				<p>Dylan Baker (2):</p>

				<ul>

				  <li>docs: Add SHA256 sums for 19.0.7</li>

				  <li>version: bump to 19.0.8</li>

				</ul>

				<p>Kenneth Graunke (1):</p>

				<ul>

				  <li>egl/x11: calloc dri2_surf so it's properly zeroed</li>

				</ul>

				</div>

				</body>

				</html>

									
										39

docs/repository.html
									
												View File
												
				@@ -35,9 +35,9 @@ You may access the repository either as an

				<p>

				You may also 

				<a href="https://cgit.freedesktop.org/mesa/mesa/"

				<a href="https://gitlab.freedesktop.org/mesa/mesa"

				>browse the main Mesa git repository</a> and the

				<a href="https://cgit.freedesktop.org/mesa/demos"

				<a href="https://gitlab.freedesktop.org/mesa/demos"

				>Mesa demos and tests git repository</a>.

				</p>

				@@ -52,7 +52,7 @@ To get the Mesa sources anonymously (read-only):

				<li>Install the git software on your computer if needed.<br><br>

				<li>Get an initial, local copy of the repository with:

				    <pre>

				    git clone git://anongit.freedesktop.org/git/mesa/mesa

				    git clone https://gitlab.freedesktop.org/mesa/mesa.git

				    </pre>

				<li>Later, you can update your tree from the master repository with:

				    <pre>

				@@ -60,7 +60,7 @@ To get the Mesa sources anonymously (read-only):

				    </pre>

				<li>If you also want the Mesa demos/tests repository:

				    <pre>

				    git clone git://anongit.freedesktop.org/git/mesa/demos

				    git clone https://gitlab.freedesktop.org/mesa/demos.git

				    </pre>

				</ol>

				@@ -98,24 +98,17 @@ on a particular driver, add a new extension, etc.) in the bugzilla record.

				</ol>

				<p>

				Once your account is established:

				</p>

				Once your account is established, you can update your push url to use SSH:

				<pre>

				git remote set-url --push <em>origin</em> git@gitlab.freedesktop.org:mesa/mesa.git

				</pre>

				<ol>

				<li>Get an initial, local copy of the repository with:

				    <pre>

				    git clone git+ssh://username@git.freedesktop.org/git/mesa/mesa

				    </pre>

				    Replace <em>username</em> with your actual login name.<br><br>

				<li>Later, you can update your tree from the master repository with:

				    <pre>

				    git pull origin

				    </pre>

				<li>If you also want the Mesa demos/tests repository:

				    <pre>

				    git clone git+ssh://username@git.freedesktop.org/git/mesa/demos

				    </pre>

				</ol>

				You can also use <a href="https://gitlab.freedesktop.org/profile/personal_access_tokens">personal access tokens</a>

				to push over HTTPS instead (useful for people behind strict proxies).

				In this case, create a token, and put it in the url as shown here:

				<pre>

				git remote set-url --push <em>origin</em> https://<em>USER</em>:<em>TOKEN</em>@gitlab.freedesktop.org/mesa/mesa.git

				</pre>

				<h2>Windows Users</h2>

				@@ -149,12 +142,12 @@ code while a branch has the latest stable code.

				</p>

				<p>

				The command <code>git-branch</code> will list all available branches.

				The command <code>git branch</code> will list all available branches.

				</p>

				<p>

				Questions about branch status/activity should be posted to the

				mesa3d-dev mailing list.

				mesa-dev mailing list.

				</p>

				<h2>Developer Git Tips</h2>

									
										3

docs/shading.html
									
												View File
												
				@@ -59,6 +59,7 @@ execution.  These are generally used for debugging.

				<li><b>nopfrag</b> - force fragment shader to be a simple shader that passes

				    through the color attribute.

				<li><b>useprog</b> - log glUseProgram calls to stderr

				<li><b>errors</b> - GLSL compilation and link errors will be reported to stderr.

				</ul>

				<p>

				Example:  export MESA_GLSL=dump,nopt

				@@ -85,7 +86,7 @@ should match the filenames of the corresponding dumped shaders.

				<p>

				Setting <b>MESA_SHADER_CAPTURE_PATH</b> to a directory will cause the compiler

				to write <tt>.shader_test</tt> files for use with

				<a href="https://cgit.freedesktop.org/mesa/shader-db">shader-db</a>, a tool

				<a href="https://gitlab.freedesktop.org/mesa/shader-db">shader-db</a>, a tool

				which compiler developers can use to gather statistics about shaders

				(instructions, cycles, memory accesses, and so on).

				</p>

									
										2

docs/sourcedocs.html
									
												View File
												
				@@ -31,7 +31,7 @@ the <code>doxygen</code> directory and run <code>make</code>.

				<p>

				For an example of Doxygen usage in Mesa, see a recent source file

				such as <a href="https://cgit.freedesktop.org/mesa/mesa/tree/src/mesa/main/bufferobj.c">bufferobj.c</a>.

				such as <a href="https://gitlab.freedesktop.org/mesa/mesa/blob/master/src/mesa/main/bufferobj.c">bufferobj.c</a>.

				</p>

82

docs/specs/EGL_MESA_device_software.txt Normal file

View File

@@ -0,0 +1,82 @@
 Name
     MESA_device_software
 Name Strings
     EGL_MESA_device_software
 Contributors
     Adam Jackson <ajax@redhat.com>
     Emil Velikov <emil.velikov@collabora.com>
 Contacts
     Adam Jackson <ajax@redhat.com>
 Status
     DRAFT
 Version
     Version 2, 2018-10-03
 Number
     EGL Extension #TODO
 Extension Type
     EGL device extension
 Dependencies
     Requires EGL_EXT_device_query.
     This extension is written against the EGL 1.5 Specification.
 Overview
     This extension defines a software EGL "device". The device is not backed by
     any actual device node and simply renders into client memory.
     By defining this as an extension, EGL_EXT_device_enumeration is able to
     sanely enumerate a software device.
 New Types
     None
 New Procedures and Functions
     None
 New Tokens
     None
 Additions to the EGL Specification
     None
 New Behavior
     The device list produced by eglQueryDevicesEXT will include a software
     device. This can be distinguished from other device classes in the usual
     way by calling eglQueryDeviceStringEXT(EGL_EXTENSIONS) and matching this
     extension's string in the result.
 Issues
     None
 Revision History
     Version 2, 2018-10-03 (Emil Velikov)
         - Drop "fallback" from "software fallback device"
         - Add Emil Velikov as contributor
     Version 1, 2017-07-06 (Adam Jackson)
         - Initial version

95

docs/specs/EGL_MESA_query_driver.txt Normal file

View File

@@ -0,0 +1,95 @@
 Name
     MESA_query_driver
 Name Strings
     EGL_MESA_query_driver
 Contact
     Rob Clark      <robdclark 'at' gmail.com>
     Nicolai Hähnle <Nicolai.Haehnle 'at' amd.com>
 Contibutors
     Veluri Mithun <velurimithun38 'at' gmail.com>
 Status
     Complete
 Version
     Version 3, 2019-01-24
 Number
     EGL Extension 131
 Dependencies
     EGL 1.0 is required.
 Overview
     When an application has to query the name of a driver and for
     obtaining driver's option list (UTF-8 encoded XML) of a driver
     the below functions are useful.
     XML file formally describes all available options and also
     includes verbal descriptions in multiple languages. Its main purpose
     is to be automatically processed by configuration GUIs.
     The XML shall respect the following DTD:
     <!ELEMENT driinfo      (section*)>
     <!ELEMENT section      (description+, option+)>
     <!ELEMENT description  (enum*)>
     <!ATTLIST description  lang CDATA #REQUIRED
                            text CDATA #REQUIRED>
     <!ELEMENT option       (description+)>
     <!ATTLIST option       name CDATA #REQUIRED
                            type (bool|enum|int|float) #REQUIRED
                            default CDATA #REQUIRED
                            valid CDATA #IMPLIED>
     <!ELEMENT enum         EMPTY>
     <!ATTLIST enum         value CDATA #REQUIRED
                            text CDATA #REQUIRED>
 New Procedures and Functions
     char* eglGetDisplayDriverConfig(EGLDisplay dpy);
     const char* eglGetDisplayDriverName(EGLDisplay dpy);
 Description
     By passing EGLDisplay as parameter to `eglGetDisplayDriverName` one can retrieve
     driverName. Similarly passing EGLDisplay to `eglGetDisplayDriverConfig` we can retrieve
     driverConfig options of the driver in XML format.
     The string returned by `eglGetDisplayDriverConfig` is heap-allocated and caller
     is responsible for freeing it.
     EGL_BAD_DISPLAY is generated if `disp` is not an EGL display connection.
     EGL_NOT_INITIALIZED is generated if `disp` has not been initialized.
     If the implementation does not have enough resources to allocate the XML then an
     EGL_BAD_ALLOC error is generated.
 New Tokens
     No new tokens
 Issues
     None
 Revision History
     Version 1, 2018-11-05 - First draft (Veluri Mithun)
     Version 2, 2019-01-23 - Final version (Veluri Mithun)
     Version 3, 2019-01-24 - Mark as complete, add Khronos extension
                             number, fix parameter name in prototypes,
                             write revision history (Eric Engestrom)

200

docs/specs/INTEL_shader_atomic_float_minmax.txt Normal file

View File

@@ -0,0 +1,200 @@
 Name
     INTEL_shader_atomic_float_minmax
 Name Strings
     GL_INTEL_shader_atomic_float_minmax
 Contact
     Ian Romanick (ian . d . romanick 'at' intel . com)
 Contributors
 Status
     In progress
 Version
     Last Modified Date: 06/22/2018
     Revision: 4
 Number
     TBD
 Dependencies
     OpenGL 4.2, OpenGL ES 3.1, ARB_shader_storage_buffer_object, or
     ARB_compute_shader is required.
     This extension is written against version 4.60 of the OpenGL Shading
     Language Specification.
 Overview
     This extension provides GLSL built-in functions allowing shaders to
     perform atomic read-modify-write operations to floating-point buffer
     variables and shared variables.  Minimum, maximum, exchange, and
     compare-and-swap are enabled.
 New Procedures and Functions
     None.
 New Tokens
     None.
 IP Status
     None.
 Modifications to the OpenGL Shading Language Specification, Version 4.60
     Including the following line in a shader can be used to control the
     language features described in this extension:
       #extension GL_INTEL_shader_atomic_float_minmax : <behavior>
     where <behavior> is as specified in section 3.3.
     New preprocessor #defines are added to the OpenGL Shading Language:
       #define GL_INTEL_shader_atomic_float_minmax   1
 Additions to Chapter 8 of the OpenGL Shading Language Specification
 (Built-in Functions)
     Modify Section 8.11, "Atomic Memory Functions"
     (add a new row after the existing "atomicMin" table row, p. 179)
         float atomicMin(inout float mem, float data)
         Computes a new value by taking the minimum of the value of data and
         the contents of mem.  If one of these is an IEEE signaling NaN (i.e.,
         a NaN with the most-significant bit of the mantissa cleared), it is
         always considered smaller.  If one of these is an IEEE quiet NaN
         (i.e., a NaN with the most-significant bit of the mantissa set), it is
         always considered larger.  If both are IEEE quiet NaNs or both are
         IEEE signaling NaNs, the result of the comparison is undefined.
     (add a new row after the exiting "atomicMax" table row, p. 179)
         float atomicMax(inout float mem, float data)
         Computes a new value by taking the maximum of the value of data and
         the contents of mem.  If one of these is an IEEE signaling NaN (i.e.,
         a NaN with the most-significant bit of the mantissa cleared), it is
         always considered larger.  If one of these is an IEEE quiet NaN (i.e.,
         a NaN with the most-significant bit of the mantissa set), it is always
         considered smaller.  If both are IEEE quiet NaNs or both are IEEE
         signaling NaNs, the result of the comparison is undefined.
     (add to "atomicExchange" table cell, p. 180)
         float atomicExchange(inout float mem, float data)
     (add to "atomicCompSwap" table cell, p. 180)
         float atomicCompSwap(inout float mem, float compare, float data)
 Interactions with OpenGL 4.6 and ARB_gl_spirv
     If OpenGL 4.6 or ARB_gl_spirv is supported, then
     SPV_INTEL_shader_atomic_float_minmax must also be supported.
     The AtomicFloatMinmaxINTEL capability is available whenever the OpenGL or
     OpenGL ES implementation supports INTEL_shader_atomic_float_minmax.
 Issues
 ) Why call this extension INTEL_shader_atomic_float_minmax?
     RESOLVED: Several other extensions already set the precedent of
     VENDOR_shader_atomic_float and VENDOR_shader_atomic_float64 for extensions
     that enable floating-point atomic operations.  Using that as a base for
     the name seems logical.
     There already exists NV_shader_atomic_float, but the two extensions have
     nearly zero overlap in functionality.  NV_shader_atomic_float adds
     atomicAdd and image atomic operations that currently shipping Intel GPUs
     do not support.  Calling this extension INTEL_shader_atomic_float would
     likely have been confusing.
     Adding something to describe the actual functions added by this extension
     seemed reasonable.  INTEL_shader_atomic_float_compare was considered, but
     that name was deemed to be not properly descriptive.  Calling this
     extension INTEL_shader_atomic_float_min_max_exchange_compswap is right
     out.
 ) What atomic operations should we support for floating-point targets?
     RESOLVED.  Exchange, min, max, and compare-swap make sense, and these are
     all supported by the hardware.  Future extensions may add other functions.
     For buffer variables and shared variables it is not possible to bit-cast
     the memory location in GLSL, so existing integer operations, such as
     atomicOr, cannot be used.  However, the underlying hardware implementation
     can do this by treating the memory as an integer.  It would be possible to
     implement atomicNegate using this technique with atomicXor.  It is unclear
     whether this provides any actual utility.
 ) What should be said about the NaN behavior?
     RESOLVED.  There are several aspects of NaN behavior that should be
     documented in this extension.  However, some of this behavior varies based
     on NaN concepts that do not exist in the GLSL specification.
     * atomicCompSwap performs the comparison as the floating-point equality
       operator (==).  That is, if either 'mem' or 'compare' is NaN, the
       comparison result is always false.
     * atomicMin and atomicMax implement the IEEE specification with respect to
       NaN.  IEEE considers two different kinds of NaN: signaling NaN and quiet
       NaN.  A quiet NaN has the most significant bit of the mantissa set, and
       a signaling NaN does not.  This concept does not exist in SPIR-V,
       Vulkan, or OpenGL.  Let qNaN denote a quiet NaN and sNaN denote a
       signaling NaN.  atomicMin and atomicMax specifically implement
       - fmin(qNaN, x) = fmin(x, qNaN) = fmax(qNaN, x) = fmax(x, qNaN) = x
       - fmin(sNaN, x) = fmin(x, sNaN) = fmax(sNaN, x) = fmax(x, sNaN) = sNaN
       - fmin(sNaN, qNaN) = fmin(qNaN, sNaN) = fmax(sNaN, qNaN) =
         fmax(qNaN, sNaN) = sNaN
       - fmin(sNaN, sNaN) = sNaN.  This specification does not define which of
         the two arguments is stored.
       - fmax(sNaN, sNaN) = sNaN.  This specification does not define which of
         the two arguments is stored.
       - fmin(qNaN, qNaN) = qNaN.  This specification does not define which of
         the two arguments is stored.
       - fmax(qNaN, qNaN) = qNaN.  This specification does not define which of
         the two arguments is stored.
     Further details are available in the Skylake Programmer's Reference
     Manuals available at
     https://01.org/linuxgraphics/documentation/hardware-specification-prms.
 ) What about atomicMin and atomicMax with (+0.0, -0.0) or (-0.0, +0.0)
     arguments?
     RESOLVED.  atomicMin should store -0.0, and atomicMax should store +0.0.
     Due to a known issue in shipping Skylake GPUs, the incorrectly signed 0 is
     stored.  This behavior may change in later GPUs.
 Revision History
     Rev  Date        Author    Changes
     ---  ----------  --------  ---------------------------------------------
 04/19/2018  idr       Initial version
 05/05/2018  idr       Describe interactions with the capabilities
                                added by SPV_INTEL_shader_atomic_float_minmax.
 05/29/2018  idr       Remove mention of 64-bit float support.
 06/22/2018  idr       Resolve issue #2.
                                Add issue #3 (regarding NaN behavior).
                                Add issue #4 (regarding atomicMin(-0, +0).

									
										43

docs/specs/MESA_query_renderer.spec
									
												View File
												
				@@ -20,11 +20,11 @@ Status

				Version

				    Version 8, 14-February-2014

				    Version 9, 09 November 2018

				Number

				    TBD.

				    OpenGL Extension #446

				Dependencies

				@@ -32,9 +32,6 @@ Dependencies

				    GLX_ARB_create_context and GLX_ARB_create_context_profile are required.

				    This extension interacts with GLX_EXT_create_context_es2_profile and

				    GLX_EXT_create_context_es_profile.

				Overview

				    In many situations, applications want to detect characteristics of a

				@@ -95,18 +92,13 @@ New Tokens

				        GLX_RENDERER_VENDOR_ID_MESA

				        GLX_RENDERER_DEVICE_ID_MESA

				    Accepted as an attribute name in <*attrib_list> in

				    glXCreateContextAttribsARB:

				        GLX_RENDERER_ID_MESA                             0x818E

				Additions to the OpenGL / WGL Specifications

				    None. This specification is written for GLX.

				Additions to the GLX 1.4 Specification

				    [Add the following to Section X.Y.Z of the GLX Specification]

				    [Add to Section 3.3.2 "GLX Versioning" of the GLX Specification]

				    To obtain information about the available renderers for a particular

				    display and screen,

				@@ -206,29 +198,6 @@ Additions to the GLX 1.4 Specification

				    format as the string that would be returned by glGetString of GL_RENDERER.

				    It may, however, have a different value.

				    [Add to section section 3.3.7 "Rendering Contexts"]

				    The attribute name GLX_RENDERER_ID_MESA specified the index of the render

				    against which the context should be created.  The default value of

				    GLX_RENDERER_ID_MESA is 0.

				    [Add to list of errors for glXCreateContextAttribsARB in section section

				    3.3.7 "Rendering Contexts"]

				      * If the value of GLX_RENDERER_ID_MESA specifies a non-existent

				        renderer, BadMatch is generated.

				Dependencies on GLX_EXT_create_context_es_profile and

				GLX_EXT_create_context_es2_profile

				    If neither extension is supported, remove all mention of

				    GLX_RENDERER_OPENGL_ES2_PROFILE_VERSION_MESA from the spec.

				    If GLX_EXT_create_context_es_profile is not supported, remove all mention of

				    GLX_RENDERER_OPENGL_ES_PROFILE_VERSION_MESA from the spec.

				Issues

				    1) How should the difference between on-card and GART memory be exposed?

				@@ -408,3 +377,9 @@ Revision History

				                            read GLX_RENDERER_ID_MESA. The VENDOR/DEVICE_ID

				                            example given in issue #17 should be 0x5143 and

				                            0xFFFFFFFF respectively.

				    Version 9, 2018/11/09 - Remove GLX_RENDERER_ID_MESA, which has never been

				                            implemented. Remove the unnecessary interactions

				                            with the GLX GLES profile extensions. Note the

				                            official GL extension number. Specify the section

				                            of the GLX spec to modify.

									
										106

docs/submittingpatches.html
									
												View File
												
				@@ -21,7 +21,7 @@

				<li><a href="#guidelines">Basic guidelines</a>

				<li><a href="#formatting">Patch formatting</a>

				<li><a href="#testing">Testing Patches</a>

				<li><a href="#mailing">Mailing Patches</a>

				<li><a href="#submit">Submitting Patches</a>

				<li><a href="#reviewing">Reviewing Patches</a>

				<li><a href="#nominations">Nominating a commit for a stable branch</a>

				<li><a href="#criteria">Criteria for accepting patches to the stable branch</a>

				@@ -42,8 +42,10 @@ components.

				<code>git bisect</code>.)

				<li>Patches should be properly <a href="#formatting">formatted</a>.

				<li>Patches should be sufficiently <a href="#testing">tested</a> before submitting.

				<li>Patches should be submitted to <a href="#mailing">mesa-dev</a>

				for <a href="#reviewing">review</a> using <code>git send-email</code>.

				<li>Patches should be <a href="#submit">submitted</a>

				to <a href="#mailing">mesa-dev</a> or with

				a <a href="#merge-request">merge request</a>

				for <a href="#reviewing">review</a>.

				</ul>

				@@ -156,18 +158,29 @@ As mentioned at the begining, patches should be bisectable.

				A good way to test this is to make use of the `git rebase` command,

				to run your tests on each commit. Assuming your branch is based off

				<code>origin/master</code>, you can run:

				</p>

				<pre>

				$ git rebase --interactive --exec "make check" origin/master

				</pre>

				<p>

				replacing <code>"make check"</code> with whatever other test you want to

				run.

				</p>

				<h2 id="mailing">Mailing Patches</h2>

				<h2 id="submit">Submitting Patches</h2>

				<p>

				Patches should be sent to the mesa-dev mailing list for review:

				Patches may be submitted to the Mesa project by

				<a href="#mailing">email</a> or with a

				GitLab <a href="#merge-request">merge request</a>. To prevent

				duplicate code review, only use one method to submit your changes.

				</p>

				<h3 id="mailing">Mailing Patches</h3>

				<p>

				Patches may be sent to the mesa-dev mailing list for review:

				<a href="https://lists.freedesktop.org/mailman/listinfo/mesa-dev">

				mesa-dev@lists.freedesktop.org</a>.

				When submitting a patch make sure to use

				@@ -201,8 +214,65 @@ disabled before sending your patches. (Note that you may need to contact

				your email administrator for this.)

				</p>

				<h3 id="merge-request">GitLab Merge Requests</h3>

				<p>

				  <a href="https://gitlab.freedesktop.org/mesa/mesa">GitLab</a> Merge

				  Requests (MR) can also be used to submit patches for Mesa.

				</p>

				<p>

				  If the MR may have interest for most of the Mesa community, you can

				  send an email to the mesa-dev email list including a link to the MR.

				  Don't send the patch to mesa-dev, just the MR link.

				</p>

				<p>

				  Add labels to your MR to help reviewers find it. For example:

				  <ul>

				    <li>Mesa changes affecting all drivers: mesa

				    <li>Hardware vendor specific code: amd, intel, nvidia, ...

				    <li>Driver specific code: anvil, freedreno, i965, iris, radeonsi,

				      radv, vc4, ...

				    <li>Other tag examples: gallium, util

				  </ul>

				</p>

				<p>

				  If you revise your patches based on code review and push an update

				  to your branch, you should maintain a <strong>clean</strong> history

				  in your patches. There should not be "fixup" patches in the history.

				  The series should be buildable and functional after every commit

				  whenever you push the branch.

				</p>

				<p>

				  It is your responsibility to keep the MR alive and making progress,

				  as there are no guarantees that a Mesa dev will independently take

				  interest in it.

				</p>

				<p>

				  Some other notes:

				  <ul>

				    <li>Make changes and update your branch based on feedback

				    <li>Old, stale MR may be closed, but you can reopen it if you

				      still want to pursue the changes

				    <li>You should periodically check to see if your MR needs to be

				      rebased

				    <li>Make sure your MR is closed if your patches get pushed outside

				      of GitLab

				    <li>Please send MRs from a personal fork rather than from the main

				      Mesa repository, as it clutters it unnecessarily.

				  </ul>

				</p>

				<h2 id="reviewing">Reviewing Patches</h2>

				<p>

				  To participate in code review, you should monitor the

				  <a href="https://lists.freedesktop.org/mailman/listinfo/mesa-dev">

				  mesa-dev</a> email list and the GitLab

				  Mesa <a href="https://gitlab.freedesktop.org/mesa/mesa/merge_requests">Merge

				  Requests</a> page.

				</p>

				<p>

				When you've reviewed a patch on the mailing list, please be unambiguous

				about your review.  That is, state either

				@@ -229,6 +299,29 @@ which tells the patch author that the patch can be committed, as long

				as the issues are resolved first.

				</p>

				<p>

				These Reviewed-by, Acked-by, and Tested-by tags should also be amended

				into commits in a MR before it is merged.

				</p>

				<p>

				When providing a Reviewed-by, Acked-by, or Tested-by tag in a gitlab MR,

				enclose the tag in backticks:

				</p>

				<pre>

				  `Reviewed-by: Joe Hacker &lt;jhacker@example.com&gt;`</pre>

				<p>

				This is the markdown format for literal, and will prevent gitlab from hiding

				the &lt; and &gt; symbols.

				</p>

				<p>

				Review by non-experts is encouraged.  Understanding how someone else

				goes about solving a problem is a great way to learn your way around

				the project.  The submitter is expected to evaluate whether they have

				an appropriate amount of review feedback from people who also

				understand the code before merging their patches.

				</p>

				<h2 id="nominations">Nominating a commit for a stable branch</h2>

				@@ -251,6 +344,9 @@ If you are not the author of the original patch, please Cc: them in your

				nomination request.

				</p>

				<p>

				The current patch status can be observed in the <a href="releasing.html#stagingbranch">staging branch</a>.

				</p>

				<h3 id="thetag">The stable tag</h3>

									
										2

docs/utilities.html
									
												View File
												
				@@ -17,7 +17,7 @@

				<h1>Development Utilities</h1>

				<dl>

				  <dt><a href="https://cgit.freedesktop.org/mesa/demos">Mesa demos collection</a></dt>

				  <dt><a href="https://gitlab.freedesktop.org/mesa/demos">Mesa demos collection</a></dt>

				  <dd>includes several utility routines in the <code>src/util/</code>

				  directory.</dd>

									
										21

docs/vmware-guest.html
									
												View File
												
				@@ -43,6 +43,23 @@ This requires:

				Otherwise, OpenGL 2.1 is supported.

				</p>

				<p>

				With the Fall 2018 Workstation 15 / Fusion 11 releases, additional

				features are supported in the driver:

				<ul>

				<li>Multisample antialiasing (2x, 4x)

				<li>GL_ARB/AMD_draw_buffers_blend

				<li>GL_ARB_sample_shading

				<li>GL_ARB_texture_cube_map_array

				<li>GL_ARB_texture_gather

				<li>GL_ARB_texture_query_lod

				<li>GL_EXT/OES_draw_buffers_indexed

				</ul>

				<p>

				This requires version 2.15.0 or later of the vmwgfx kernel module and

				the VM must be configured for hardware version 16 or later.

				</p>

				<p>

				OpenGL 3.3 support can be disabled by setting the environment variable

				SVGA_VGPU10=0.

				@@ -126,7 +143,7 @@ Begin by saving your current directory location:

				<ul>

				<li>Mesa/Gallium master branch. This code is used to build libGL, and the direct rendering svga driver for libGL, vmwgfx_dri.so, and the X acceleration library libxatracker.so.x.x.x. 

				  <pre>

				  git clone git://anongit.freedesktop.org/git/mesa/mesa

				  git clone https://gitlab.freedesktop.org/mesa/mesa.git

				  </pre>

				<li>VMware Linux guest kernel module. Note that this repo contains the complete DRM and TTM code. The vmware-specific driver is really only the files prefixed with vmwgfx. 

				  <pre>

				@@ -136,7 +153,7 @@ Begin by saving your current directory location:

				Most distros ship with this but it's safest to install a newer version.

				To get the latest code from git:

				  <pre>

				  git clone git://anongit.freedesktop.org/git/mesa/drm

				  git clone https://gitlab.freedesktop.org/mesa/drm.git

				  </pre>

				<li>xf86-video-vmware. The chainloading driver, vmware_drv.so, the legacy driver vmwlegacy_drv.so, and the vmwgfx driver vmwgfx_drv.so. 

				  <pre>

									
										8

include/EGL/egl.h
									
												View File
												
				@@ -28,17 +28,17 @@ extern "C" {

				** MATERIALS OR THE USE OR OTHER DEALINGS IN THE MATERIALS.

				*/

				/*

				** This header is generated from the Khronos OpenGL / OpenGL ES XML

				** API Registry. The current version of the Registry, generator scripts

				** This header is generated from the Khronos EGL XML API Registry.

				** The current version of the Registry, generator scripts

				** used to make the header, and the header can be found at

				**   http://www.khronos.org/registry/egl

				**

				** Khronos $Git commit SHA1: a732b061e7 $ on $Git commit date: 2017-06-17 23:27:53 +0100 $

				** Khronos $Git commit SHA1: 9ed2ec4c67 $ on $Git commit date: 2019-01-09 17:54:35 -0800 $

				*/

				#include <EGL/eglplatform.h>

				/* Generated on date 20170627 */

				/* Generated on date 20190124 */

				/* Generated C header for:

				 * API: egl

									
										106

include/EGL/eglext.h
									
												View File
												
				@@ -28,17 +28,17 @@ extern "C" {

				** MATERIALS OR THE USE OR OTHER DEALINGS IN THE MATERIALS.

				*/

				/*

				** This header is generated from the Khronos OpenGL / OpenGL ES XML

				** API Registry. The current version of the Registry, generator scripts

				** This header is generated from the Khronos EGL XML API Registry.

				** The current version of the Registry, generator scripts

				** used to make the header, and the header can be found at

				**   http://www.khronos.org/registry/egl

				**

				** Khronos $Git commit SHA1: a732b061e7 $ on $Git commit date: 2017-06-17 23:27:53 +0100 $

				** Khronos $Git commit SHA1: 9ed2ec4c67 $ on $Git commit date: 2019-01-09 17:54:35 -0800 $

				*/

				#include <EGL/eglplatform.h>

				#define EGL_EGLEXT_VERSION 20170627

				#define EGL_EGLEXT_VERSION 20190124

				/* Generated C header for:

				 * API: egl

				@@ -495,6 +495,47 @@ EGLAPI EGLClientBuffer EGLAPIENTRY eglCreateNativeClientBufferANDROID (const EGL

				#define EGL_FRONT_BUFFER_AUTO_REFRESH_ANDROID 0x314C

				#endif /* EGL_ANDROID_front_buffer_auto_refresh */

				#ifndef EGL_ANDROID_get_frame_timestamps

				#define EGL_ANDROID_get_frame_timestamps 1

				typedef khronos_stime_nanoseconds_t EGLnsecsANDROID;

				#define EGL_TIMESTAMP_PENDING_ANDROID     EGL_CAST(EGLnsecsANDROID,-2)

				#define EGL_TIMESTAMP_INVALID_ANDROID     EGL_CAST(EGLnsecsANDROID,-1)

				#define EGL_TIMESTAMPS_ANDROID            0x3430

				#define EGL_COMPOSITE_DEADLINE_ANDROID    0x3431

				#define EGL_COMPOSITE_INTERVAL_ANDROID    0x3432

				#define EGL_COMPOSITE_TO_PRESENT_LATENCY_ANDROID 0x3433

				#define EGL_REQUESTED_PRESENT_TIME_ANDROID 0x3434

				#define EGL_RENDERING_COMPLETE_TIME_ANDROID 0x3435

				#define EGL_COMPOSITION_LATCH_TIME_ANDROID 0x3436

				#define EGL_FIRST_COMPOSITION_START_TIME_ANDROID 0x3437

				#define EGL_LAST_COMPOSITION_START_TIME_ANDROID 0x3438

				#define EGL_FIRST_COMPOSITION_GPU_FINISHED_TIME_ANDROID 0x3439

				#define EGL_DISPLAY_PRESENT_TIME_ANDROID  0x343A

				#define EGL_DEQUEUE_READY_TIME_ANDROID    0x343B

				#define EGL_READS_DONE_TIME_ANDROID       0x343C

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLGETCOMPOSITORTIMINGSUPPORTEDANDROIDPROC) (EGLDisplay dpy, EGLSurface surface, EGLint name);

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLGETCOMPOSITORTIMINGANDROIDPROC) (EGLDisplay dpy, EGLSurface surface, EGLint numTimestamps,  const EGLint *names, EGLnsecsANDROID *values);

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLGETNEXTFRAMEIDANDROIDPROC) (EGLDisplay dpy, EGLSurface surface, EGLuint64KHR *frameId);

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLGETFRAMETIMESTAMPSUPPORTEDANDROIDPROC) (EGLDisplay dpy, EGLSurface surface, EGLint timestamp);

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLGETFRAMETIMESTAMPSANDROIDPROC) (EGLDisplay dpy, EGLSurface surface, EGLuint64KHR frameId, EGLint numTimestamps,  const EGLint *timestamps, EGLnsecsANDROID *values);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI EGLBoolean EGLAPIENTRY eglGetCompositorTimingSupportedANDROID (EGLDisplay dpy, EGLSurface surface, EGLint name);

				EGLAPI EGLBoolean EGLAPIENTRY eglGetCompositorTimingANDROID (EGLDisplay dpy, EGLSurface surface, EGLint numTimestamps,  const EGLint *names, EGLnsecsANDROID *values);

				EGLAPI EGLBoolean EGLAPIENTRY eglGetNextFrameIdANDROID (EGLDisplay dpy, EGLSurface surface, EGLuint64KHR *frameId);

				EGLAPI EGLBoolean EGLAPIENTRY eglGetFrameTimestampSupportedANDROID (EGLDisplay dpy, EGLSurface surface, EGLint timestamp);

				EGLAPI EGLBoolean EGLAPIENTRY eglGetFrameTimestampsANDROID (EGLDisplay dpy, EGLSurface surface, EGLuint64KHR frameId, EGLint numTimestamps,  const EGLint *timestamps, EGLnsecsANDROID *values);

				#endif

				#endif /* EGL_ANDROID_get_frame_timestamps */

				#ifndef EGL_ANDROID_get_native_client_buffer

				#define EGL_ANDROID_get_native_client_buffer 1

				struct AHardwareBuffer;

				typedef EGLClientBuffer (EGLAPIENTRYP PFNEGLGETNATIVECLIENTBUFFERANDROIDPROC) (const struct AHardwareBuffer *buffer);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI EGLClientBuffer EGLAPIENTRY eglGetNativeClientBufferANDROID (const struct AHardwareBuffer *buffer);

				#endif

				#endif /* EGL_ANDROID_get_native_client_buffer */

				#ifndef EGL_ANDROID_image_native_buffer

				#define EGL_ANDROID_image_native_buffer 1

				#define EGL_NATIVE_BUFFER_ANDROID         0x3140

				@@ -514,7 +555,6 @@ EGLAPI EGLint EGLAPIENTRY eglDupNativeFenceFDANDROID (EGLDisplay dpy, EGLSyncKHR

				#ifndef EGL_ANDROID_presentation_time

				#define EGL_ANDROID_presentation_time 1

				typedef khronos_stime_nanoseconds_t EGLnsecsANDROID;

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLPRESENTATIONTIMEANDROIDPROC) (EGLDisplay dpy, EGLSurface surface, EGLnsecsANDROID time);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI EGLBoolean EGLAPIENTRY eglPresentationTimeANDROID (EGLDisplay dpy, EGLSurface surface, EGLnsecsANDROID time);

				@@ -578,6 +618,16 @@ EGLAPI EGLBoolean EGLAPIENTRY eglQuerySurfacePointerANGLE (EGLDisplay dpy, EGLSu

				#define EGL_EXT_client_extensions 1

				#endif /* EGL_EXT_client_extensions */

				#ifndef EGL_EXT_client_sync

				#define EGL_EXT_client_sync 1

				#define EGL_SYNC_CLIENT_EXT               0x3364

				#define EGL_SYNC_CLIENT_SIGNAL_EXT        0x3365

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLCLIENTSIGNALSYNCEXTPROC) (EGLDisplay dpy, EGLSync sync, const EGLAttrib *attrib_list);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI EGLBoolean EGLAPIENTRY eglClientSignalSyncEXT (EGLDisplay dpy, EGLSync sync, const EGLAttrib *attrib_list);

				#endif

				#endif /* EGL_EXT_client_sync */

				#ifndef EGL_EXT_compositor

				#define EGL_EXT_compositor 1

				#define EGL_PRIMARY_COMPOSITOR_CONTEXT_EXT 0x3460

				@@ -631,6 +681,7 @@ EGLAPI EGLBoolean EGLAPIENTRY eglQueryDisplayAttribEXT (EGLDisplay dpy, EGLint a

				#ifndef EGL_EXT_device_drm

				#define EGL_EXT_device_drm 1

				#define EGL_DRM_DEVICE_FILE_EXT           0x3233

				#define EGL_DRM_MASTER_FD_EXT             0x333C

				#endif /* EGL_EXT_device_drm */

				#ifndef EGL_EXT_device_enumeration

				@@ -666,6 +717,11 @@ EGLAPI EGLBoolean EGLAPIENTRY eglQueryDisplayAttribEXT (EGLDisplay dpy, EGLint a

				#define EGL_GL_COLORSPACE_DISPLAY_P3_LINEAR_EXT 0x3362

				#endif /* EGL_EXT_gl_colorspace_display_p3_linear */

				#ifndef EGL_EXT_gl_colorspace_display_p3_passthrough

				#define EGL_EXT_gl_colorspace_display_p3_passthrough 1

				#define EGL_GL_COLORSPACE_DISPLAY_P3_PASSTHROUGH_EXT 0x3490

				#endif /* EGL_EXT_gl_colorspace_display_p3_passthrough */

				#ifndef EGL_EXT_gl_colorspace_scrgb

				#define EGL_EXT_gl_colorspace_scrgb 1

				#define EGL_GL_COLORSPACE_SCRGB_EXT       0x3351

				@@ -723,6 +779,11 @@ EGLAPI EGLBoolean EGLAPIENTRY eglQueryDmaBufModifiersEXT (EGLDisplay dpy, EGLint

				#endif

				#endif /* EGL_EXT_image_dma_buf_import_modifiers */

				#ifndef EGL_EXT_image_gl_colorspace

				#define EGL_EXT_image_gl_colorspace 1

				#define EGL_GL_COLORSPACE_DEFAULT_EXT     0x314D

				#endif /* EGL_EXT_image_gl_colorspace */

				#ifndef EGL_EXT_image_implicit_sync_control

				#define EGL_EXT_image_implicit_sync_control 1

				#define EGL_IMPORT_SYNC_TYPE_EXT          0x3470

				@@ -858,6 +919,14 @@ EGLAPI EGLBoolean EGLAPIENTRY eglSwapBuffersWithDamageEXT (EGLDisplay dpy, EGLSu

				#endif

				#endif /* EGL_EXT_swap_buffers_with_damage */

				#ifndef EGL_EXT_sync_reuse

				#define EGL_EXT_sync_reuse 1

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLUNSIGNALSYNCEXTPROC) (EGLDisplay dpy, EGLSync sync, const EGLAttrib *attrib_list);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI EGLBoolean EGLAPIENTRY eglUnsignalSyncEXT (EGLDisplay dpy, EGLSync sync, const EGLAttrib *attrib_list);

				#endif

				#endif /* EGL_EXT_sync_reuse */

				#ifndef EGL_EXT_yuv_surface

				#define EGL_EXT_yuv_surface 1

				#define EGL_YUV_ORDER_EXT                 0x3301

				@@ -962,6 +1031,16 @@ EGLAPI EGLBoolean EGLAPIENTRY eglExportDMABUFImageMESA (EGLDisplay dpy, EGLImage

				#define EGL_PLATFORM_SURFACELESS_MESA     0x31DD

				#endif /* EGL_MESA_platform_surfaceless */

				#ifndef EGL_MESA_query_driver

				#define EGL_MESA_query_driver 1

				typedef char *(EGLAPIENTRYP PFNEGLGETDISPLAYDRIVERCONFIGPROC) (EGLDisplay dpy);

				typedef const char *(EGLAPIENTRYP PFNEGLGETDISPLAYDRIVERNAMEPROC) (EGLDisplay dpy);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI char *EGLAPIENTRY eglGetDisplayDriverConfig (EGLDisplay dpy);

				EGLAPI const char *EGLAPIENTRY eglGetDisplayDriverName (EGLDisplay dpy);

				#endif

				#endif /* EGL_MESA_query_driver */

				#ifndef EGL_NOK_swap_region

				#define EGL_NOK_swap_region 1

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLSWAPBUFFERSREGIONNOKPROC) (EGLDisplay dpy, EGLSurface surface, EGLint numRects, const EGLint *rects);

				@@ -988,6 +1067,11 @@ EGLAPI EGLBoolean EGLAPIENTRY eglSwapBuffersRegion2NOK (EGLDisplay dpy, EGLSurfa

				#define EGL_AUTO_STEREO_NV                0x3136

				#endif /* EGL_NV_3dvision_surface */

				#ifndef EGL_NV_context_priority_realtime

				#define EGL_NV_context_priority_realtime 1

				#define EGL_CONTEXT_PRIORITY_REALTIME_NV  0x3357

				#endif /* EGL_NV_context_priority_realtime */

				#ifndef EGL_NV_coverage_sample

				#define EGL_NV_coverage_sample 1

				#define EGL_COVERAGE_BUFFERS_NV           0x30E0

				@@ -1055,9 +1139,9 @@ EGLAPI EGLBoolean EGLAPIENTRY eglPostSubBufferNV (EGLDisplay dpy, EGLSurface sur

				#define EGL_YUV_PLANE0_TEXTURE_UNIT_NV    0x332C

				#define EGL_YUV_PLANE1_TEXTURE_UNIT_NV    0x332D

				#define EGL_YUV_PLANE2_TEXTURE_UNIT_NV    0x332E

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLSTREAMCONSUMERGLTEXTUREEXTERNALATTRIBSNVPROC) (EGLDisplay dpy, EGLStreamKHR stream, EGLAttrib *attrib_list);

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLSTREAMCONSUMERGLTEXTUREEXTERNALATTRIBSNVPROC) (EGLDisplay dpy, EGLStreamKHR stream, const EGLAttrib *attrib_list);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI EGLBoolean EGLAPIENTRY eglStreamConsumerGLTextureExternalAttribsNV (EGLDisplay dpy, EGLStreamKHR stream, EGLAttrib *attrib_list);

				EGLAPI EGLBoolean EGLAPIENTRY eglStreamConsumerGLTextureExternalAttribsNV (EGLDisplay dpy, EGLStreamKHR stream, const EGLAttrib *attrib_list);

				#endif

				#endif /* EGL_NV_stream_consumer_gltexture_yuv */

				@@ -1097,6 +1181,14 @@ EGLAPI EGLBoolean EGLAPIENTRY eglStreamConsumerGLTextureExternalAttribsNV (EGLDi

				#define EGL_STREAM_FIFO_SYNCHRONOUS_NV    0x3336

				#endif /* EGL_NV_stream_fifo_synchronous */

				#ifndef EGL_NV_stream_flush

				#define EGL_NV_stream_flush 1

				typedef EGLBoolean (EGLAPIENTRYP PFNEGLSTREAMFLUSHNVPROC) (EGLDisplay dpy, EGLStreamKHR stream);

				#ifdef EGL_EGLEXT_PROTOTYPES

				EGLAPI EGLBoolean EGLAPIENTRY eglStreamFlushNV (EGLDisplay dpy, EGLStreamKHR stream);

				#endif

				#endif /* EGL_NV_stream_flush */

				#ifndef EGL_NV_stream_frame_limits

				#define EGL_NV_stream_frame_limits 1

				#define EGL_PRODUCER_MAX_FRAME_HINT_NV    0x3337

									
										2

include/GL/gl.h
									
												View File
												
				@@ -2086,7 +2086,7 @@ typedef void (APIENTRYP PFNGLBLENDEQUATIONSEPARATEATIPROC) (GLenum modeRGB, GLen

				/* GL_OES_EGL_image */

				#ifndef GL_OES_EGL_image

				#if !defined(GL_OES_EGL_image) && !defined(GL_EXT_EGL_image_storage)

				typedef void* GLeglImageOES;

				#endif

									
										158

include/GL/internal/dri_interface.h
									
												View File
												
				@@ -589,7 +589,7 @@ struct __DRIdamageExtensionRec {

				 * SWRast Loader extension.

				 */

				#define __DRI_SWRAST_LOADER "DRI_SWRastLoader"

				#define __DRI_SWRAST_LOADER_VERSION 4

				#define __DRI_SWRAST_LOADER_VERSION 5

				struct __DRIswrastLoaderExtensionRec {

				    __DRIextension base;

				@@ -649,6 +649,23 @@ struct __DRIswrastLoaderExtensionRec {

				    void (*getImageShm)(__DRIdrawable *readable,

				                        int x, int y, int width, int height,

				                        int shmid, void *loaderPrivate);

				   /**

				     * Put shm image to drawable (v2)

				     *

				     * The original version fixes srcx/y to 0, and expected

				     * the offset to be adjusted. This version allows src x,y

				     * to not be included in the offset. This is needed to

				     * avoid certain overflow checks in the X server, that

				     * result in lost rendering.

				     *

				     * \since 5

				     */

				    void (*putImageShm2)(__DRIdrawable *drawable, int op,

				                         int x, int y,

				                         int width, int height, int stride,

				                         int shmid, char *shmaddr, unsigned offset,

				                         void *loaderPrivate);

				};

				/**

				@@ -746,7 +763,8 @@ struct __DRIuseInvalidateExtensionRec {

				#define __DRI_ATTRIB_BIND_TO_TEXTURE_TARGETS	46

				#define __DRI_ATTRIB_YINVERTED			47

				#define __DRI_ATTRIB_FRAMEBUFFER_SRGB_CAPABLE	48

				#define __DRI_ATTRIB_MAX			(__DRI_ATTRIB_FRAMEBUFFER_SRGB_CAPABLE + 1)

				#define __DRI_ATTRIB_MUTABLE_RENDER_BUFFER	49 /* EGL_MUTABLE_RENDER_BUFFER_BIT_KHR */

				#define __DRI_ATTRIB_MAX			50

				/* __DRI_ATTRIB_RENDER_TYPE */

				#define __DRI_ATTRIB_RGBA_BIT			0x01	

				@@ -1326,6 +1344,7 @@ struct __DRIdri2ExtensionRec {

				#define __DRI_IMAGE_FOURCC_NV16		0x3631564e

				#define __DRI_IMAGE_FOURCC_YUYV		0x56595559

				#define __DRI_IMAGE_FOURCC_UYVY		0x59565955

				#define __DRI_IMAGE_FOURCC_AYUV		0x56555941

				#define __DRI_IMAGE_FOURCC_YVU410	0x39555659

				#define __DRI_IMAGE_FOURCC_YVU411	0x31315659

				@@ -1352,6 +1371,7 @@ struct __DRIdri2ExtensionRec {

				#define __DRI_IMAGE_COMPONENTS_Y_UV	0x3004

				#define __DRI_IMAGE_COMPONENTS_Y_XUXV	0x3005

				#define __DRI_IMAGE_COMPONENTS_Y_UXVX	0x3008

				#define __DRI_IMAGE_COMPONENTS_AYUV	0x3009

				#define __DRI_IMAGE_COMPONENTS_R	0x3006

				#define __DRI_IMAGE_COMPONENTS_RG	0x3007

				@@ -1888,9 +1908,57 @@ struct __DRI2rendererQueryExtensionRec {

				 * Image Loader extension. Drivers use this to allocate color buffers

				 */

				/**

				 * See __DRIimageLoaderExtensionRec::getBuffers::buffer_mask.

				 */

				enum __DRIimageBufferMask {

				   __DRI_IMAGE_BUFFER_BACK = (1 << 0),

				   __DRI_IMAGE_BUFFER_FRONT = (1 << 1)

				   __DRI_IMAGE_BUFFER_FRONT = (1 << 1),

				   /**

				    * A buffer shared between application and compositor. The buffer may be

				    * simultaneously accessed by each.

				    *

				    * A shared buffer is equivalent to an EGLSurface whose EGLConfig contains

				    * EGL_MUTABLE_RENDER_BUFFER_BIT_KHR and whose active EGL_RENDER_BUFFER (as

				    * opposed to any pending, requested change to EGL_RENDER_BUFFER) is

				    * EGL_SINGLE_BUFFER.

				    *

				    * If buffer_mask contains __DRI_IMAGE_BUFFER_SHARED, then must contains no

				    * other bits. As a corollary, a __DRIdrawable that has a "shared" buffer

				    * has no front nor back buffer.

				    *

				    * The loader returns __DRI_IMAGE_BUFFER_SHARED in buffer_mask if and only

				    * if:

				    *     - The loader supports __DRI_MUTABLE_RENDER_BUFFER_LOADER.

				    *     - The driver supports __DRI_MUTABLE_RENDER_BUFFER_DRIVER.

				    *     - The EGLConfig of the drawable EGLSurface contains

				    *       EGL_MUTABLE_RENDER_BUFFER_BIT_KHR.

				    *     - The EGLContext's EGL_RENDER_BUFFER is EGL_SINGLE_BUFFER.

				    *       Equivalently, the EGLSurface's active EGL_RENDER_BUFFER (as

				    *       opposed to any pending, requested change to EGL_RENDER_BUFFER) is

				    *       EGL_SINGLE_BUFFER. (See the EGL 1.5 and

				    *       EGL_KHR_mutable_render_buffer spec for details about "pending" vs

				    *       "active" EGL_RENDER_BUFFER state).

				    *

				    * A shared buffer is similar to a front buffer in that all rendering to the

				    * buffer should appear promptly on the screen. It is different from

				    * a front buffer in that its behavior is independent from the

				    * GL_DRAW_BUFFER state. Specifically, if GL_DRAW_FRAMEBUFFER is 0 and the

				    * __DRIdrawable's buffer_mask is __DRI_IMAGE_BUFFER_SHARED, then all

				    * rendering should appear promptly on the screen if GL_DRAW_BUFFER is not

				    * GL_NONE.

				    *

				    * The difference between a shared buffer and a front buffer is motivated

				    * by the constraints of Android and OpenGL ES. OpenGL ES does not support

				    * front-buffer rendering. Android's SurfaceFlinger protocol provides the

				    * EGL driver only a back buffer and no front buffer. The shared buffer

				    * mode introduced by EGL_KHR_mutable_render_buffer is a backdoor though

				    * EGL that allows Android OpenGL ES applications to render to what is

				    * effectively the front buffer, a backdoor that required no change to the

				    * OpenGL ES API and little change to the SurfaceFlinger API.

				    */

				   __DRI_IMAGE_BUFFER_SHARED = (1 << 2),

				};

				struct __DRIimageList {

				@@ -1915,7 +1983,8 @@ struct __DRIimageLoaderExtensionRec {

				    * \param stamp              Address of variable to be updated when

				    *                           getBuffers must be called again

				    * \param loaderPrivate      The loaderPrivate for driDrawable

				    * \param buffer_mask        Set of buffers to allocate

				    * \param buffer_mask        Set of buffers to allocate. A bitmask of

				    *                           __DRIimageBufferMask.

				    * \param buffers            Returned buffers

				    */

				   int (*getBuffers)(__DRIdrawable *driDrawable,

				@@ -2029,4 +2098,85 @@ struct __DRIbackgroundCallableExtensionRec {

				   GLboolean (*isThreadSafe)(void *loaderPrivate);

				};

				/**

				 * The driver portion of EGL_KHR_mutable_render_buffer.

				 *

				 * If the driver creates a __DRIconfig with

				 * __DRI_ATTRIB_MUTABLE_RENDER_BUFFER, then it must support this extension.

				 *

				 * To support this extension:

				 *

				 *    - The driver should create at least one __DRIconfig with

				 *      __DRI_ATTRIB_MUTABLE_RENDER_BUFFER. This is strongly recommended but

				 *      not required.

				 *

				 *    - The driver must be able to handle __DRI_IMAGE_BUFFER_SHARED if

				 *      returned by __DRIimageLoaderExtension:getBuffers().

				 *

				 *    - When rendering to __DRI_IMAGE_BUFFER_SHARED, it must call

				 *      __DRImutableRenderBufferLoaderExtension::displaySharedBuffer() in

				 *      response to glFlush and glFinish.  (This requirement is not documented

				 *      in EGL_KHR_mutable_render_buffer, but is a de-facto requirement in the

				 *      Android ecosystem. Android applications expect that glFlush will

				 *      immediately display the buffer when in shared buffer mode, and Android

				 *      drivers comply with this expectation).  It :may: call

				 *      displaySharedBuffer() more often than required.

				 *

				 *    - When rendering to __DRI_IMAGE_BUFFER_SHARED, it must ensure that the

				 *      buffer is always in a format compatible for display because the

				 *      display engine (usually SurfaceFlinger or hwcomposer) may display the

				 *      image at any time, even concurrently with 3D rendering. For example,

				 *      display hardware and the GL hardware may be able to access the buffer

				 *      simultaneously. In particular, if the buffer is compressed then take

				 *      care that SurfaceFlinger and hwcomposer can consume the compression

				 *      format.

				 *

				 * \see __DRI_IMAGE_BUFFER_SHARED

				 * \see __DRI_ATTRIB_MUTABLE_RENDER_BUFFER

				 * \see __DRI_MUTABLE_RENDER_BUFFER_LOADER

				 */

				#define __DRI_MUTABLE_RENDER_BUFFER_DRIVER "DRI_MutableRenderBufferDriver"

				#define __DRI_MUTABLE_RENDER_BUFFER_DRIVER_VERSION 1

				typedef struct __DRImutableRenderBufferDriverExtensionRec __DRImutableRenderBufferDriverExtension;

				struct __DRImutableRenderBufferDriverExtensionRec {

				   __DRIextension base;

				};

				/**

				 * The loader portion of EGL_KHR_mutable_render_buffer.

				 *

				 * Requires loader extension DRI_IMAGE_LOADER, through which the loader sends

				 * __DRI_IMAGE_BUFFER_SHARED to the driver.

				 *

				 * \see __DRI_MUTABLE_RENDER_BUFFER_DRIVER

				 */

				#define __DRI_MUTABLE_RENDER_BUFFER_LOADER "DRI_MutableRenderBufferLoader"

				#define __DRI_MUTABLE_RENDER_BUFFER_LOADER_VERSION 1

				typedef struct __DRImutableRenderBufferLoaderExtensionRec __DRImutableRenderBufferLoaderExtension;

				struct __DRImutableRenderBufferLoaderExtensionRec {

				   __DRIextension base;

				   /**

				    * Inform the display engine (that is, SurfaceFlinger and/or hwcomposer)

				    * that the __DRIdrawable has new content.

				    *

				    * The display engine may ignore this call, for example, if it continually

				    * refreshes and displays the buffer on every frame, as in

				    * EGL_ANDROID_front_buffer_auto_refresh. On the other extreme, the display

				    * engine may refresh and display the buffer only in frames in which the

				    * driver calls this.

				    *

				    * If the fence_fd is not -1, then the display engine will display the

				    * buffer only after the fence signals.

				    *

				    * The drawable's current __DRIimageBufferMask, as returned by

				    * __DRIimageLoaderExtension::getBuffers(), must be

				    * __DRI_IMAGE_BUFFER_SHARED.

				    */

				   void (*displaySharedBuffer)(__DRIdrawable *drawable, int fence_fd,

				                               void *loaderPrivate);

				};

				#endif

									
										12

include/c11/threads_win32.h
									
												View File
												
				@@ -76,18 +76,8 @@ Configuration macro:

				#endif

				/* Visual Studio 2015 and later */

				#if _MSC_VER >= 1900

				#define HAVE_TIMESPEC

				#ifdef _MSC_VER

				#define HAVE_TIMESPEC_GET

				#elif defined(__MINGW32__)

				#define HAVE_TIMESPEC

				#endif

				#ifndef HAVE_TIMESPEC

				struct timespec {

				    time_t tv_sec;

				    long tv_nsec;

				};

				#endif

				/*---------------------------- macros ----------------------------*/

									
										4

include/c99_compat.h
									
												View File
												
				@@ -36,8 +36,8 @@

				 */

				#if defined(_MSC_VER)

				#  if _MSC_VER < 1800 || (_MSC_FULL_VER < 180031101 && !defined(__clang__))

				#    error "Microsoft Visual Studio 2013 Update 4 or higher required"

				#  if _MSC_VER < 1900

				#    error "Microsoft Visual Studio 2015 or higher required"

				#  endif

				   /*

									
										2

include/d3dadapter/present.h
									
												View File
												
				@@ -125,7 +125,7 @@ struct ID3DPresent

				#define ID3DPresent_SetCursorPos(p,a) (p)->lpVtbl->SetCursorPos(p,a)

				#define ID3DPresent_SetCursor(p,a,b,c) (p)->lpVtbl->SetCursor(p,a,b,c)

				#define ID3DPresent_SetGammaRamp(p,a,b) (p)->lpVtbl->SetGammaRamp(p,a,b)

				#define ID3DPresent_GetWindowInfo(p,a,b,c,d) (p)->lpVtbl->GetWindowSize(p,a,b,c,d)

				#define ID3DPresent_GetWindowInfo(p,a,b,c,d) (p)->lpVtbl->GetWindowInfo(p,a,b,c,d)

				#define ID3DPresent_GetWindowOccluded(p) (p)->lpVtbl->GetWindowOccluded(p)

				#define ID3DPresent_ResolutionMismatch(p) (p)->lpVtbl->ResolutionMismatch(p)

				#define ID3DPresent_CreateThread(p,a,b) (p)->lpVtbl->CreateThread(p,a,b)

									
										13

include/drm-uapi/drm_fourcc.h
									
												View File
												
				@@ -298,6 +298,19 @@ extern "C" {

				 */

				#define DRM_FORMAT_MOD_SAMSUNG_64_32_TILE	fourcc_mod_code(SAMSUNG, 1)

				/*

				 * Qualcomm Compressed Format

				 *

				 * Refers to a compressed variant of the base format that is compressed.

				 * Implementation may be platform and base-format specific.

				 *

				 * Each macrotile consists of m x n (mostly 4 x 4) tiles.

				 * Pixel data pitch/stride is aligned with macrotile width.

				 * Pixel data height is aligned with macrotile height.

				 * Entire pixel data buffer is aligned with 4k(bytes).

				 */

				#define DRM_FORMAT_MOD_QCOM_COMPRESSED  fourcc_mod_code(QCOM, 1)

				/* Vivante framebuffer modifiers */

				/*

									
										39

include/drm-uapi/v3d_drm.h
									
												View File
												
				@@ -36,6 +36,7 @@ extern "C" {

				#define DRM_V3D_MMAP_BO                           0x03

				#define DRM_V3D_GET_PARAM                         0x04

				#define DRM_V3D_GET_BO_OFFSET                     0x05

				#define DRM_V3D_SUBMIT_TFU                        0x06

				#define DRM_IOCTL_V3D_SUBMIT_CL           DRM_IOWR(DRM_COMMAND_BASE + DRM_V3D_SUBMIT_CL, struct drm_v3d_submit_cl)

				#define DRM_IOCTL_V3D_WAIT_BO             DRM_IOWR(DRM_COMMAND_BASE + DRM_V3D_WAIT_BO, struct drm_v3d_wait_bo)

				@@ -43,6 +44,7 @@ extern "C" {

				#define DRM_IOCTL_V3D_MMAP_BO             DRM_IOWR(DRM_COMMAND_BASE + DRM_V3D_MMAP_BO, struct drm_v3d_mmap_bo)

				#define DRM_IOCTL_V3D_GET_PARAM           DRM_IOWR(DRM_COMMAND_BASE + DRM_V3D_GET_PARAM, struct drm_v3d_get_param)

				#define DRM_IOCTL_V3D_GET_BO_OFFSET       DRM_IOWR(DRM_COMMAND_BASE + DRM_V3D_GET_BO_OFFSET, struct drm_v3d_get_bo_offset)

				#define DRM_IOCTL_V3D_SUBMIT_TFU          DRM_IOW(DRM_COMMAND_BASE + DRM_V3D_SUBMIT_TFU, struct drm_v3d_submit_tfu)

				/**

				 * struct drm_v3d_submit_cl - ioctl argument for submitting commands to the 3D

				@@ -58,10 +60,15 @@ struct drm_v3d_submit_cl {

					 * coordinate shader to determine where primitives land on the screen,

					 * then writes out the state updates and draw calls necessary per tile

					 * to the tile allocation BO.

					 *

					 * This BCL will block on any previous BCL submitted on the

					 * same FD, but not on any RCL or BCLs submitted by other

					 * clients -- that is left up to the submitter to control

					 * using in_sync_bcl if necessary.

					 */

					__u32 bcl_start;

					 /** End address of the BCL (first byte after the BCL) */

					/** End address of the BCL (first byte after the BCL) */

					__u32 bcl_end;

					/* Offset of the render command list.

				@@ -69,10 +76,15 @@ struct drm_v3d_submit_cl {

					 * This is the second set of commands executed, which will either

					 * execute the tiles that have been set up by the BCL, or a fixed set

					 * of tiles (in the case of RCL-only blits).

					 *

					 * This RCL will block on this submit's BCL, and any previous

					 * RCL submitted on the same FD, but not on any RCL or BCLs

					 * submitted by other clients -- that is left up to the

					 * submitter to control using in_sync_rcl if necessary.

					 */

					__u32 rcl_start;

					 /** End address of the RCL (first byte after the RCL) */

					/** End address of the RCL (first byte after the RCL) */

					__u32 rcl_end;

					/** An optional sync object to wait on before starting the BCL. */

				@@ -169,6 +181,7 @@ enum drm_v3d_param {

					DRM_V3D_PARAM_V3D_CORE0_IDENT0,

					DRM_V3D_PARAM_V3D_CORE0_IDENT1,

					DRM_V3D_PARAM_V3D_CORE0_IDENT2,

					DRM_V3D_PARAM_SUPPORTS_TFU,

				};

				struct drm_v3d_get_param {

				@@ -187,6 +200,28 @@ struct drm_v3d_get_bo_offset {

					__u32 offset;

				};

				struct drm_v3d_submit_tfu {

					__u32 icfg;

					__u32 iia;

					__u32 iis;

					__u32 ica;

					__u32 iua;

					__u32 ioa;

					__u32 ios;

					__u32 coef[4];

					/* First handle is the output BO, following are other inputs.

					 * 0 for unused.

					 */

					__u32 bo_handles[4];

					/* sync object to block on before running the TFU job.  Each TFU

					 * job will execute in the order submitted to its FD.  Synchronization

					 * against rendering jobs requires using sync objects.

					 */

					__u32 in_sync;

					/* Sync object to signal when the TFU job is done. */

					__u32 out_sync;

				};

				#if defined(__cplusplus)

				}

				#endif

									
										5

include/pci_ids/i965_pci_ids.h
									
												View File
												
				@@ -171,6 +171,7 @@ CHIPSET(0x3185, glk_2x6, "Intel(R) UHD Graphics 600 (Geminilake 2x6)")

				CHIPSET(0x3E90, cfl_gt1, "Intel(R) UHD Graphics 610 (Coffeelake 2x6 GT1)")

				CHIPSET(0x3E93, cfl_gt1, "Intel(R) UHD Graphics 610 (Coffeelake 2x6 GT1)")

				CHIPSET(0x3E99, cfl_gt1, "Intel(R) HD Graphics (Coffeelake 2x6 GT1)")

				CHIPSET(0x3E9C, cfl_gt1, "Intel(R) HD Graphics (Coffeelake 2x6 GT1)")

				CHIPSET(0x3E91, cfl_gt2, "Intel(R) UHD Graphics 630 (Coffeelake 3x8 GT2)")

				CHIPSET(0x3E92, cfl_gt2, "Intel(R) UHD Graphics 630 (Coffeelake 3x8 GT2)")

				CHIPSET(0x3E96, cfl_gt2, "Intel(R) HD Graphics (Coffeelake 3x8 GT2)")

				@@ -203,6 +204,10 @@ CHIPSET(0x5A54, cnl_5x8, "Intel(R) HD Graphics (Cannonlake 5x8 GT2)")

				CHIPSET(0x8A50, icl_8x8, "Intel(R) HD Graphics (Ice Lake 8x8 GT2)")

				CHIPSET(0x8A51, icl_8x8, "Intel(R) HD Graphics (Ice Lake 8x8 GT2)")

				CHIPSET(0x8A52, icl_8x8, "Intel(R) HD Graphics (Ice Lake 8x8 GT2)")

				CHIPSET(0x8A56, icl_4x8, "Intel(R) HD Graphics (Ice Lake 4x8 GT1)")

				CHIPSET(0x8A57, icl_6x8, "Intel(R) HD Graphics (Ice Lake 6x8 GT1.5)")

				CHIPSET(0x8A58, icl_4x8, "Intel(R) HD Graphics (Ice Lake 4x8 GT1)")

				CHIPSET(0x8A59, icl_6x8, "Intel(R) HD Graphics (Ice Lake 6x8 GT1.5)")

				CHIPSET(0x8A5A, icl_6x8, "Intel(R) HD Graphics (Ice Lake 6x8 GT1.5)")

				CHIPSET(0x8A5B, icl_4x8, "Intel(R) HD Graphics (Ice Lake 4x8 GT1)")

				CHIPSET(0x8A5C, icl_6x8, "Intel(R) HD Graphics (Ice Lake 6x8 GT1.5)")

									
										12

include/vulkan/vulkan.h
									
												View File
												
				@@ -2,7 +2,7 @@

				#define VULKAN_H_ 1

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

				@@ -24,6 +24,10 @@

				#include "vulkan_android.h"

				#endif

				#ifdef VK_USE_PLATFORM_FUCHSIA

				#include <zircon/types.h>

				#include "vulkan_fuchsia.h"

				#endif

				#ifdef VK_USE_PLATFORM_IOS_MVK

				#include "vulkan_ios.h"

				@@ -35,12 +39,6 @@

				#endif

				#ifdef VK_USE_PLATFORM_MIR_KHR

				#include <mir_toolkit/client_types.h>

				#include "vulkan_mir.h"

				#endif

				#ifdef VK_USE_PLATFORM_VI_NN

				#include "vulkan_vi.h"

				#endif

									
										2

include/vulkan/vulkan_android.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

1236

include/vulkan/vulkan_core.h

View File

File diff suppressed because it is too large Load Diff

									
										58

include/vulkan/vulkan_fuchsia.h
									
										Normal file
									
												View File
												
				@@ -0,0 +1,58 @@

				#ifndef VULKAN_FUCHSIA_H_

				#define VULKAN_FUCHSIA_H_ 1

				#ifdef __cplusplus

				extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

				** You may obtain a copy of the License at

				**

				**     http://www.apache.org/licenses/LICENSE-2.0

				**

				** Unless required by applicable law or agreed to in writing, software

				** distributed under the License is distributed on an "AS IS" BASIS,

				** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

				** See the License for the specific language governing permissions and

				** limitations under the License.

				*/

				/*

				** This header is generated from the Khronos Vulkan XML API Registry.

				**

				*/

				#define VK_FUCHSIA_imagepipe_surface 1

				#define VK_FUCHSIA_IMAGEPIPE_SURFACE_SPEC_VERSION 1

				#define VK_FUCHSIA_IMAGEPIPE_SURFACE_EXTENSION_NAME "VK_FUCHSIA_imagepipe_surface"

				typedef VkFlags VkImagePipeSurfaceCreateFlagsFUCHSIA;

				typedef struct VkImagePipeSurfaceCreateInfoFUCHSIA {

				    VkStructureType                         sType;

				    const void*                             pNext;

				    VkImagePipeSurfaceCreateFlagsFUCHSIA    flags;

				    zx_handle_t                             imagePipeHandle;

				} VkImagePipeSurfaceCreateInfoFUCHSIA;

				typedef VkResult (VKAPI_PTR *PFN_vkCreateImagePipeSurfaceFUCHSIA)(VkInstance instance, const VkImagePipeSurfaceCreateInfoFUCHSIA* pCreateInfo, const VkAllocationCallbacks* pAllocator, VkSurfaceKHR* pSurface);

				#ifndef VK_NO_PROTOTYPES

				VKAPI_ATTR VkResult VKAPI_CALL vkCreateImagePipeSurfaceFUCHSIA(

				    VkInstance                                  instance,

				    const VkImagePipeSurfaceCreateInfoFUCHSIA*  pCreateInfo,

				    const VkAllocationCallbacks*                pAllocator,

				    VkSurfaceKHR*                               pSurface);

				#endif

				#ifdef __cplusplus

				}

				#endif

				#endif

									
										2

include/vulkan/vulkan_ios.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

									
										2

include/vulkan/vulkan_macos.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

									
										65

include/vulkan/vulkan_mir.h
									
												View File
											
				@@ -1,65 +0,0 @@

				#ifndef VULKAN_MIR_H_

				#define VULKAN_MIR_H_ 1

				#ifdef __cplusplus

				extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

				** You may obtain a copy of the License at

				**

				**     http://www.apache.org/licenses/LICENSE-2.0

				**

				** Unless required by applicable law or agreed to in writing, software

				** distributed under the License is distributed on an "AS IS" BASIS,

				** WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

				** See the License for the specific language governing permissions and

				** limitations under the License.

				*/

				/*

				** This header is generated from the Khronos Vulkan XML API Registry.

				**

				*/

				#define VK_KHR_mir_surface 1

				#define VK_KHR_MIR_SURFACE_SPEC_VERSION   4

				#define VK_KHR_MIR_SURFACE_EXTENSION_NAME "VK_KHR_mir_surface"

				typedef VkFlags VkMirSurfaceCreateFlagsKHR;

				typedef struct VkMirSurfaceCreateInfoKHR {

				    VkStructureType               sType;

				    const void*                   pNext;

				    VkMirSurfaceCreateFlagsKHR    flags;

				    MirConnection*                connection;

				    MirSurface*                   mirSurface;

				} VkMirSurfaceCreateInfoKHR;

				typedef VkResult (VKAPI_PTR *PFN_vkCreateMirSurfaceKHR)(VkInstance instance, const VkMirSurfaceCreateInfoKHR* pCreateInfo, const VkAllocationCallbacks* pAllocator, VkSurfaceKHR* pSurface);

				typedef VkBool32 (VKAPI_PTR *PFN_vkGetPhysicalDeviceMirPresentationSupportKHR)(VkPhysicalDevice physicalDevice, uint32_t queueFamilyIndex, MirConnection* connection);

				#ifndef VK_NO_PROTOTYPES

				VKAPI_ATTR VkResult VKAPI_CALL vkCreateMirSurfaceKHR(

				    VkInstance                                  instance,

				    const VkMirSurfaceCreateInfoKHR*            pCreateInfo,

				    const VkAllocationCallbacks*                pAllocator,

				    VkSurfaceKHR*                               pSurface);

				VKAPI_ATTR VkBool32 VKAPI_CALL vkGetPhysicalDeviceMirPresentationSupportKHR(

				    VkPhysicalDevice                            physicalDevice,

				    uint32_t                                    queueFamilyIndex,

				    MirConnection*                              connection);

				#endif

				#ifdef __cplusplus

				}

				#endif

				#endif

									
										2

include/vulkan/vulkan_vi.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

									
										2

include/vulkan/vulkan_wayland.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

									
										2

include/vulkan/vulkan_win32.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

									
										2

include/vulkan/vulkan_xcb.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

									
										2

include/vulkan/vulkan_xlib.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

									
										2

include/vulkan/vulkan_xlib_xrandr.h
									
												View File
												
				@@ -6,7 +6,7 @@ extern "C" {

				#endif

				/*

				** Copyright (c) 2015-2018 The Khronos Group Inc.

				** Copyright (c) 2015-2019 The Khronos Group Inc.

				**

				** Licensed under the Apache License, Version 2.0 (the "License");

				** you may not use this file except in compliance with the License.

7

m4/ax_check_python_mako_module.m4

View File

@@ -45,13 +45,14 @@ AC_DEFUN([AX_CHECK_PYTHON_MAKO_MODULE],
 try:
     import sys
     import mako
     import distutils.version
 except ImportError as err:
     sys.exit(err)
 else:
     ver_req = map(int, '$1'.split('.'))
     ver_act = map(int, mako.__version__.split('.'))
     ver_req = distutils.version.LooseVersion('$1')
     ver_act = distutils.version.LooseVersion(mako.__version__)
     sys.exit(int(ver_req > ver_act))
     " | $PYTHON2 -
     " | $PYTHON -
     if test $? -ne 0 ; then
        AC_MSG_RESULT(no)

									
										200

meson.build
									
												View File
												
				@@ -1,4 +1,4 @@

				# Copyright © 2017-2018 Intel Corporation

				# Copyright © 2017-2019 Intel Corporation

				# Permission is hereby granted, free of charge, to any person obtaining a copy

				# of this software and associated documentation files (the "Software"), to deal

				@@ -25,8 +25,8 @@ project(

				    [find_program('python', 'python2', 'python3'), 'bin/meson_get_version.py']

				  ).stdout(),

				  license : 'MIT',

				  meson_version : '>= 0.44.1',

				  default_options : ['buildtype=debugoptimized', 'c_std=c99', 'cpp_std=c++11']

				  meson_version : '>= 0.45',

				  default_options : ['buildtype=debugoptimized', 'b_ndebug=if-release', 'c_std=c99', 'cpp_std=c++11']

				)

				cc = meson.get_compiler('c')

				@@ -34,8 +34,6 @@ cpp = meson.get_compiler('cpp')

				null_dep = dependency('', required : false)

				system_has_kms_drm = ['openbsd', 'netbsd', 'freebsd', 'dragonfly', 'linux'].contains(host_machine.system())

				# Arguments for the preprocessor, put these in a separate array from the C and

				# C++ (cpp in meson terminology) arguments since they need to be added to the

				# default arguments for both C and C++.

				@@ -43,8 +41,7 @@ pre_args = [

				  '-D__STDC_CONSTANT_MACROS',

				  '-D__STDC_FORMAT_MACROS',

				  '-D__STDC_LIMIT_MACROS',

				  '-DVERSION="@0@"'.format(meson.project_version()),

				  '-DPACKAGE_VERSION=VERSION',

				  '-DPACKAGE_VERSION="@0@"'.format(meson.project_version()),

				  '-DPACKAGE_BUGREPORT="https://bugs.freedesktop.org/enter_bug.cgi?product=Mesa"',

				]

				@@ -54,20 +51,21 @@ with_valgrind = get_option('valgrind')

				with_libunwind = get_option('libunwind')

				with_asm = get_option('asm')

				with_glx_read_only_text = get_option('glx-read-only-text')

				with_glx_direct = get_option('glx-direct')

				with_osmesa = get_option('osmesa')

				with_swr_arches = get_option('swr-arches')

				with_tools = get_option('tools')

				if with_tools.contains('all')

				  with_tools = ['freedreno', 'glsl', 'intel', 'nir', 'nouveau', 'xvmc']

				  with_tools = ['etnaviv', 'freedreno', 'glsl', 'intel', 'nir', 'nouveau', 'xvmc']

				endif

				dri_drivers_path = get_option('dri-drivers-path')

				if dri_drivers_path == ''

				  dri_drivers_path = join_paths(get_option('libdir'), 'dri')

				  dri_drivers_path = join_paths(get_option('prefix'), get_option('libdir'), 'dri')

				endif

				dri_search_path = get_option('dri-search-path')

				if dri_search_path == ''

				  dri_search_path = join_paths(get_option('prefix'), dri_drivers_path)

				  dri_search_path = dri_drivers_path

				endif

				with_gles1 = get_option('gles1')

				@@ -102,13 +100,15 @@ if _drivers.contains('auto')

				    elif ['arm', 'aarch64'].contains(host_machine.cpu_family())

				      _drivers = []

				    else

				      error('Unknown architecture. Please pass -Ddri-drivers to set driver options. Patches gladly accepted to fix this.')

				      error('Unknown architecture @0@. Please pass -Ddri-drivers to set driver options. Patches gladly accepted to fix this.'.format(

				            host_machine.cpu_family()))

				    endif

				  elif ['darwin', 'windows', 'cygwin', 'haiku'].contains(host_machine.system())

				    # only swrast would make sense here, but gallium swrast is a much better default

				    _drivers = []

				  else

				    error('Unknown OS. Please pass -Ddri-drivers to set driver options. Patches gladly accepted to fix this.')

				    error('Unknown OS @0@. Please pass -Ddri-drivers to set driver options. Patches gladly accepted to fix this.'.format(

				          host_machine.system()))

				  endif

				endif

				@@ -131,19 +131,21 @@ if _drivers.contains('auto')

				      ]

				    elif ['arm', 'aarch64'].contains(host_machine.cpu_family())

				      _drivers = [

				        'pl111', 'v3d', 'vc4', 'freedreno', 'etnaviv', 'imx', 'nouveau',

				        'kmsro', 'v3d', 'vc4', 'freedreno', 'etnaviv', 'nouveau',

				        'tegra', 'virgl', 'swrast',

				      ]

				    else

				      error('Unknown architecture. Please pass -Dgallium-drivers to set driver options. Patches gladly accepted to fix this.')

				      error('Unknown architecture @0@. Please pass -Dgallium-drivers to set driver options. Patches gladly accepted to fix this.'.format(

				            host_machine.cpu_family()))

				    endif

				  elif ['darwin', 'windows', 'cygwin', 'haiku'].contains(host_machine.system())

				    _drivers = ['swrast']

				  else

				    error('Unknown OS. Please pass -Dgallium-drivers to set driver options. Patches gladly accepted to fix this.')

				    error('Unknown OS @0@. Please pass -Dgallium-drivers to set driver options. Patches gladly accepted to fix this.'.format(

				          host_machine.system()))

				  endif

				endif

				with_gallium_pl111 = _drivers.contains('pl111')

				with_gallium_kmsro = _drivers.contains('kmsro')

				with_gallium_radeonsi = _drivers.contains('radeonsi')

				with_gallium_r300 = _drivers.contains('r300')

				with_gallium_r600 = _drivers.contains('r600')

				@@ -153,13 +155,20 @@ with_gallium_softpipe = _drivers.contains('swrast')

				with_gallium_vc4 = _drivers.contains('vc4')

				with_gallium_v3d = _drivers.contains('v3d')

				with_gallium_etnaviv = _drivers.contains('etnaviv')

				with_gallium_imx = _drivers.contains('imx')

				with_gallium_tegra = _drivers.contains('tegra')

				with_gallium_i915 = _drivers.contains('i915')

				with_gallium_svga = _drivers.contains('svga')

				with_gallium_virgl = _drivers.contains('virgl')

				with_gallium_swr = _drivers.contains('swr')

				if cc.get_id() == 'intel'

				  if meson.version().version_compare('< 0.49.0')

				    error('Meson does not have sufficient support of ICC before 0.49.0 to compile mesa')

				  elif with_gallium_swr and meson.version().version_compare('== 0.49.0')

				    warning('Meson as of 0.49.0 is sufficient for compiling mesa with ICC, but there are some caveats with SWR. 0.49.1 should resolve all of these')

				  endif

				endif

				with_gallium = _drivers.length() != 0 and _drivers != ['']

				if with_gallium and system_has_kms_drm

				@@ -175,14 +184,18 @@ if _vulkan_drivers.contains('auto')

				  if system_has_kms_drm

				    if host_machine.cpu_family().startswith('x86')

				      _vulkan_drivers = ['amd', 'intel']

				    elif ['arm', 'aarch64'].contains(host_machine.cpu_family())

				      _vulkan_drivers = []

				    else

				      error('Unknown architecture. Please pass -Dvulkan-drivers to set driver options. Patches gladly accepted to fix this.')

				      error('Unknown architecture @0@. Please pass -Dvulkan-drivers to set driver options. Patches gladly accepted to fix this.'.format(

				            host_machine.cpu_family()))

				    endif

				  elif ['darwin', 'windows', 'cygwin', 'haiku'].contains(host_machine.system())

				    # No vulkan driver supports windows or macOS currently

				    _vulkan_drivers = []

				  else

				    error('Unknown OS. Please pass -Dvulkan-drivers to set driver options. Patches gladly accepted to fix this.')

				    error('Unknown OS @0@. Please pass -Dvulkan-drivers to set driver options. Patches gladly accepted to fix this.'.format(

				          host_machine.system()))

				  endif

				endif

				@@ -196,11 +209,8 @@ endif

				if with_dri_i915 and with_gallium_i915

				  error('Only one i915 provider can be built')

				endif

				if with_gallium_imx and not with_gallium_etnaviv

				  error('IMX driver requires etnaviv driver')

				endif

				if with_gallium_pl111 and not with_gallium_vc4

				  error('pl111 driver requires vc4 driver')

				if with_gallium_kmsro and not (with_gallium_vc4 or with_gallium_etnaviv or with_gallium_freedreno)

				  error('kmsro driver requires one or more renderonly drivers (vc4, etnaviv, freedreno)')

				endif

				if with_gallium_tegra and not with_gallium_nouveau

				  error('tegra driver requires nouveau driver')

				@@ -215,8 +225,6 @@ elif system_has_kms_drm

				else

				  # FIXME: haiku doesn't use dri, and xlib doesn't use dri, probably should

				  # assert here that one of those cases has been met.

				  # FIXME: GNU (hurd) ends up here as well, but meson doesn't officially

				  # support Hurd at time of writing (2017/11)

				  # FIXME: illumos ends up here as well

				  with_dri_platform = 'none'

				endif

				@@ -230,7 +238,8 @@ if _platforms.contains('auto')

				  elif ['haiku'].contains(host_machine.system())

				    _platforms = ['haiku']

				  else

				    error('Unknown OS. Please pass -Dplatforms to set platforms. Patches gladly accepted to fix this.')

				    error('Unknown OS @0@. Please pass -Dplatforms to set platforms. Patches gladly accepted to fix this.'.format(

				          host_machine.system()))

				  endif

				endif

				@@ -307,7 +316,7 @@ elif _egl == 'true'

				  elif not with_shared_glapi

				    error('EGL requires shared-glapi')

				  elif not with_platforms

				    error('No platforms specified, consider -Dplatforms=drm,x11 at least')

				    error('No platforms specified, consider -Dplatforms=drm,x11,surfaceless at least')

				  elif not ['disabled', 'dri'].contains(with_glx)

				    error('EGL requires dri, but a GLX is being built without dri')

				  elif ['darwin', 'windows'].contains(host_machine.system())

				@@ -318,12 +327,12 @@ else

				  with_egl = false

				endif

				if with_egl and not (with_platform_drm or with_platform_surfaceless)

				if with_egl and not (with_platform_drm or with_platform_surfaceless or with_platform_android)

				  if with_gallium_radeonsi

				    error('RadeonSI requires drm or surfaceless platform when using EGL')

				    error('RadeonSI requires the drm, surfaceless or android platform when using EGL')

				  endif

				  if with_gallium_virgl

				    error('Virgl requires drm or surfaceless platform when using EGL')

				    error('Virgl requires the drm, surfaceless or android platform when using EGL')

				  endif

				endif

				@@ -361,9 +370,6 @@ if with_glvnd

				  endif

				endif

				# TODO: toggle for this

				with_glx_direct = true

				if with_vulkan_icd_dir == ''

				  with_vulkan_icd_dir = join_paths(get_option('datadir'), 'vulkan/icd.d')

				endif

				@@ -380,8 +386,8 @@ if with_any_vk and (with_platform_x11 and not with_dri3)

				  error('Vulkan drivers require dri3 for X11 support')

				endif

				if with_dri

				  if with_glx == 'disabled' and not with_egl and not with_gbm

				    error('building dri drivers require at least one windowing system')

				  if with_glx == 'disabled' and not with_egl and not with_gbm and with_osmesa != 'classic'

				    error('building dri drivers require at least one windowing system or classic osmesa')

				  endif

				endif

				@@ -602,7 +608,7 @@ with_gallium_xa = _xa != 'false'

				d3d_drivers_path = get_option('d3d-drivers-path')

				if d3d_drivers_path == ''

				  d3d_drivers_path = join_paths(get_option('libdir'), 'd3d')

				  d3d_drivers_path = join_paths(get_option('prefix'), get_option('libdir'), 'd3d')

				endif

				with_gallium_st_nine =  get_option('gallium-nine')

				@@ -646,6 +652,7 @@ if get_option('power8') != 'false'

				endif

				_opencl = get_option('gallium-opencl')

				clover_cpp_std = []

				if _opencl != 'disabled'

				  if not with_gallium

				    error('OpenCL Clover implementation requires at least one gallium driver.')

				@@ -654,10 +661,18 @@ if _opencl != 'disabled'

				  dep_clc = dependency('libclc')

				  with_gallium_opencl = true

				  with_opencl_icd = _opencl == 'icd'

				  if host_machine.cpu_family().startswith('ppc') and cpp.compiles('''

				      #if !defined(__VEC__) || !defined(__ALTIVEC__)

				      #error "AltiVec not enabled"

				      #endif''',

				      name : 'Altivec')

				    clover_cpp_std += ['cpp_std=gnu++11']

				  endif

				else

				  dep_clc = null_dep

				  with_gallium_opencl = false

				  with_gallium_icd = false

				  with_opencl_icd = false

				endif

				gl_pkgconfig_c_flags = []

				@@ -705,10 +720,16 @@ if with_platform_haiku

				  pre_args += '-DHAVE_HAIKU_PLATFORM'

				endif

				prog_python2 = find_program('python2')

				has_mako = run_command(prog_python2, '-c', 'import mako')

				prog_python = import('python3').find_python()

				has_mako = run_command(

				  prog_python, '-c',

				  '''

				from distutils.version import StrictVersion

				import mako

				assert StrictVersion(mako.__version__) > StrictVersion("0.8.0")

				  ''')

				if has_mako.returncode() != 0

				  error('Python (2.x) mako module required to build mesa.')

				  error('Python (3.x) mako module >= 0.8.0 required to build mesa.')

				endif

				if cc.get_id() == 'gcc' and cc.version().version_compare('< 4.4.6')

				@@ -771,22 +792,26 @@ if cc.compiles('int foo(void) __attribute__((__noreturn__));',

				endif

				# TODO: this is very incomplete

				if ['linux', 'cygwin'].contains(host_machine.system())

				if ['linux', 'cygwin', 'gnu'].contains(host_machine.system())

				  pre_args += '-D_GNU_SOURCE'

				endif

				# Check for generic C arguments

				c_args = []

				foreach a : ['-Werror=implicit-function-declaration',

				             '-Werror=missing-prototypes', '-fno-math-errno',

				             '-Werror=missing-prototypes', '-Werror=return-type',

				             '-fno-math-errno',

				             '-fno-trapping-math', '-Qunused-arguments']

				  if cc.has_argument(a)

				    c_args += a

				  endif

				endforeach

				if cc.has_argument('-Wmissing-field-initializers')

				  c_args += '-Wno-missing-field-initializers'

				endif

				foreach a : ['missing-field-initializers', 'format-truncation']

				  if cc.has_argument('-W' + a)

				    c_args += '-Wno-' + a

				  endif

				endforeach

				c_vis_args = []

				if cc.has_argument('-fvisibility=hidden')

				@@ -795,7 +820,8 @@ endif

				# Check for generic C++ arguments

				cpp_args = []

				foreach a : ['-fno-math-errno', '-fno-trapping-math',

				foreach a : ['-Werror=return-type',

				             '-fno-math-errno', '-fno-trapping-math',

				             '-Qunused-arguments']

				  if cpp.has_argument(a)

				    cpp_args += a

				@@ -805,7 +831,7 @@ endforeach

				# For some reason, the test for -Wno-foo always succeeds with gcc, even if the

				# option is not supported. Hence, check for -Wfoo instead.

				foreach a : ['non-virtual-dtor', 'missing-field-initializers']

				foreach a : ['non-virtual-dtor', 'missing-field-initializers', 'format-truncation']

				  if cpp.has_argument('-W' + a)

				    cpp_args += '-Wno-' + a

				  endif

				@@ -903,7 +929,7 @@ endif

				# case of cross compiling where we can use asm, and that's x86_64 -> x86 when

				# host OS == build OS, since in that case the build machine can run the host's

				# binaries.

				if meson.is_cross_build() 

				if with_asm and meson.is_cross_build()

				  if build_machine.system() != host_machine.system()

				    # TODO: It may be possible to do this with an exe_wrapper (like wine).

				    message('Cross compiling from one OS to another, disabling assembly.')

				@@ -925,7 +951,7 @@ endif

				with_asm_arch = ''

				if with_asm

				  if host_machine.cpu_family() == 'x86'

				    if system_has_kms_drm

				    if system_has_kms_drm or host_machine.system() == 'gnu'

				      with_asm_arch = 'x86'

				      pre_args += ['-DUSE_X86_ASM', '-DUSE_MMX_ASM', '-DUSE_3DNOW_ASM',

				                   '-DUSE_SSE_ASM']

				@@ -1064,6 +1090,13 @@ pre_args += '-DHAVE_ZLIB'

				dep_thread = dependency('threads')

				if dep_thread.found() and host_machine.system() != 'windows'

				  pre_args += '-DHAVE_PTHREAD'

				  if cc.has_function(

				      'pthread_setaffinity_np',

				      dependencies : dep_thread,

				      prefix : '#include <pthread.h>',

				      args : '-D_GNU_SOURCE')

				    pre_args += '-DHAVE_PTHREAD_SETAFFINITY'

				  endif

				endif

				dep_expat = dependency('expat')

				# this only exists on linux so either this is linux and it will be found, or

				@@ -1078,14 +1111,12 @@ dep_libdrm_amdgpu = null_dep

				dep_libdrm_radeon = null_dep

				dep_libdrm_nouveau = null_dep

				dep_libdrm_etnaviv = null_dep

				dep_libdrm_freedreno = null_dep

				dep_libdrm_intel = null_dep

				_drm_amdgpu_ver = '2.4.91'

				_drm_amdgpu_ver = '2.4.97'

				_drm_radeon_ver = '2.4.71'

				_drm_nouveau_ver = '2.4.66'

				_drm_etnaviv_ver = '2.4.89'

				_drm_freedreno_ver = '2.4.92'

				_drm_intel_ver = '2.4.75'

				_drm_ver = '2.4.75'

				@@ -1096,7 +1127,6 @@ _libdrm_checks = [

				              with_gallium_r300 or with_gallium_r600)],

				  ['nouveau', (with_gallium_nouveau or with_dri_nouveau)],

				  ['etnaviv', with_gallium_etnaviv],

				  ['freedreno', with_gallium_freedreno],

				]

				# VC4 only needs core libdrm support of this version, not a libdrm_vc4

				@@ -1154,45 +1184,41 @@ if with_gallium_opencl

				    'all-targets', 'linker', 'coverage', 'instrumentation', 'ipo', 'irreader',

				    'lto', 'option', 'objcarcopts', 'profiledata',

				  ]

				  llvm_optional_modules += ['coroutines', 'opencl']

				  llvm_optional_modules += ['coroutines']

				endif

				if with_amd_vk or with_gallium_radeonsi or with_gallium_swr

				  _llvm_version = '>= 5.0.0'

				if with_amd_vk or with_gallium_radeonsi

				  _llvm_version = '>= 7.0.0'

				elif with_gallium_swr

				  _llvm_version = '>= 6.0.0'

				elif with_gallium_opencl or with_gallium_r600

				  _llvm_version = '>= 3.9.0'

				else

				  _llvm_version = '>= 3.3.0'

				endif

				_shared_llvm = get_option('shared-llvm')

				_llvm = get_option('llvm')

				if _llvm == 'auto'

				dep_llvm = null_dep

				with_llvm = false

				if _llvm != 'false'

				  dep_llvm = dependency(

				    'llvm',

				    version : _llvm_version,

				    modules : llvm_modules,

				    optional_modules : llvm_optional_modules,

				    required : with_amd_vk or with_gallium_radeonsi or with_gallium_swr or with_gallium_opencl,

				    required : (

				      with_amd_vk or with_gallium_radeonsi or with_gallium_swr or

				      with_gallium_opencl or _llvm == 'true'

				    ),

				    static : not _shared_llvm,

				    method : 'config-tool',

				  )

				  with_llvm = dep_llvm.found()

				elif _llvm == 'true'

				  dep_llvm = dependency(

				    'llvm',

				    version : _llvm_version,

				    modules : llvm_modules,

				    optional_modules : llvm_optional_modules,

				  )

				  with_llvm = true

				else

				  dep_llvm = null_dep

				  with_llvm = false

				endif

				if with_llvm

				  _llvm_version = dep_llvm.version().split('.')

				  # Development versions of LLVM have an 'svn' or 'git' suffix, we don't want

				  # that for our version checks.

				  # svn suffixes are stripped by meson as of 0.43, and git suffixes are

				  # strippped as of 0.44, but we support older meson versions.

				  # 3 digits versions in LLVM only started from 3.4.1 on

				  if dep_llvm.version().version_compare('>= 3.4.1')

				@@ -1201,11 +1227,6 @@ if with_llvm

				    _llvm_patch = '0'

				  endif

				  if _llvm_patch.endswith('svn')

				    _llvm_patch = _llvm_patch.split('s')[0]

				  elif _llvm_patch.contains('git')

				    _llvm_patch = _llvm_patch.split('g')[0]

				  endif

				  pre_args += [

				    '-DHAVE_LLVM=0x0@0@0@1@'.format(_llvm_version[0], _llvm_version[1]),

				    '-DMESA_LLVM_VERSION_PATCH=@0@'.format(_llvm_patch),

				@@ -1215,6 +1236,9 @@ if with_llvm

				  # programs, so we need to build all C++ code in mesa without rtti as well to

				  # ensure that linking works.

				  if dep_llvm.get_configtool_variable('has-rtti') == 'NO'

				    if with_gallium_nouveau

				      error('The Nouveau driver requires rtti. You either need to turn off nouveau or use an LLVM built with LLVM_ENABLE_RTTI.')

				    endif

				    cpp_args += '-fno-rtti'

				  endif

				elif with_amd_vk or with_gallium_radeonsi or with_gallium_swr

				@@ -1289,7 +1313,13 @@ endif

				# TODO: symbol mangling

				if with_platform_wayland

				  prog_wl_scanner = find_program('wayland-scanner')

				  dep_wl_scanner = dependency('wayland-scanner', native: true)

				  prog_wl_scanner = find_program(dep_wl_scanner.get_pkgconfig_variable('wayland_scanner'))

				  if dep_wl_scanner.version().version_compare('>= 1.15')

				    wl_scanner_arg = 'private-code'

				  else

				    wl_scanner_arg = 'code'

				  endif

				  dep_wl_protocols = dependency('wayland-protocols', version : '>= 1.8')

				  dep_wayland_client = dependency('wayland-client', version : '>=1.11')

				  dep_wayland_server = dependency('wayland-server', version : '>=1.11')

				@@ -1336,7 +1366,7 @@ if with_platform_x11

				    dep_xfixes = dependency('xfixes')

				    dep_xcb_glx = dependency('xcb-glx', version : '>= 1.8.1')

				  endif

				  if (with_any_vk or with_glx == 'dri' or

				  if (with_any_vk or with_glx == 'dri' or with_egl or

				       (with_gallium_vdpau or with_gallium_xvmc or with_gallium_va or

				        with_gallium_omx != 'disabled'))

				    dep_xcb = dependency('xcb')

				@@ -1358,12 +1388,14 @@ if with_platform_x11

				      dep_xshmfence = dependency('xshmfence', version : '>= 1.1')

				    endif

				  endif

				  if with_glx == 'dri'

				  if with_glx == 'dri' or with_glx == 'gallium-xlib'

				    dep_glproto = dependency('glproto', version : '>= 1.4.14')

				  endif

				  if with_glx == 'dri' 

				    if with_dri_platform == 'drm'

				      dep_dri2proto = dependency('dri2proto', version : '>= 2.8')

				      dep_xxf86vm = dependency('xxf86vm')

				    endif

				    dep_glproto = dependency('glproto', version : '>= 1.4.14')

				  endif

				  if (with_egl or (

				      with_gallium_vdpau or with_gallium_xvmc or with_gallium_xa or

				@@ -1371,7 +1403,7 @@ if with_platform_x11

				    dep_xcb_xfixes = dependency('xcb-xfixes')

				  endif

				  if with_xlib_lease

				    dep_xcb_xrandr = dependency('xcb-randr', version : '>= 1.12')

				    dep_xcb_xrandr = dependency('xcb-randr')

				    dep_xlib_xrandr = dependency('xrandr', version : '>= 1.3')

				  endif

				endif

									
										22

meson_options.txt
									
												View File
												
				@@ -25,7 +25,7 @@ option(

				  choices : [

				    '', 'auto', 'x11', 'wayland', 'drm', 'surfaceless', 'haiku', 'android',

				  ],

				  description : 'comma separated list of window systems to support. If this is set to auto all platforms applicable to the OS will be enabled.'

				  description : 'window systems to support. If this is set to `auto`, all platforms applicable will be enabled.'

				)

				option(

				  'dri3',

				@@ -58,8 +58,8 @@ option(

				  type : 'array',

				  value : ['auto'],

				  choices : [

				    '', 'auto', 'pl111', 'radeonsi', 'r300', 'r600', 'nouveau', 'freedreno',

				    'swrast', 'v3d', 'vc4', 'etnaviv', 'imx', 'tegra', 'i915', 'svga', 'virgl',

				    '', 'auto', 'kmsro', 'radeonsi', 'r300', 'r600', 'nouveau', 'freedreno',

				    'swrast', 'v3d', 'vc4', 'etnaviv', 'tegra', 'i915', 'svga', 'virgl',

				    'swr',

				  ],

				  description : 'List of gallium drivers to build. If this is set to auto all drivers applicable to the target OS/architecture will be built'

				@@ -237,6 +237,12 @@ option(

				  choices : ['auto', 'true', 'false'],

				  description : 'Build with LLVM support.'

				)

				option(

				  'shared-llvm',

				  type : 'boolean',

				  value : true,

				  description : 'Whether to link llvm shared or statically.'

				)

				option(

				  'valgrind',

				  type : 'combo',

				@@ -295,8 +301,8 @@ option(

				  'tools',

				  type : 'array',

				  value : [],

				  choices : ['freedreno', 'glsl', 'intel', 'nir', 'nouveau', 'xvmc', 'all'],

				  description : 'List of tools to build.',

				  choices : ['etnaviv', 'freedreno', 'glsl', 'intel', 'intel-ui', 'nir', 'nouveau', 'xvmc', 'all'],

				  description : 'List of tools to build. (Note: `intel-ui` selects `intel`)',

				)

				option(

				  'power8',

				@@ -312,3 +318,9 @@ option(

				  choices : ['auto', 'true', 'false'],

				  description : 'Enable VK_EXT_acquire_xlib_display.'

				)

				option(

				  'glx-direct',

				  type : 'boolean',

				  value : true,

				  description : 'Enable direct rendering in GLX and EGL for DRI',

				)

									
										7

scons/custom.py
									
												View File
												
				@@ -48,7 +48,12 @@ import source_list

				# a path directly. We want to support both, so we need to detect the SCons version,

				# for which no API is provided by SCons 8-P

				scons_version = tuple(map(int, SCons.__version__.split('.')))

				# Scons version string has consistently been in this format:

				# MajorVersion.MinorVersion.Patch[.alpha/beta.yyyymmdd]

				# so this formula should cover all versions regardless of type

				# stable, alpha or beta.

				# For simplicity alpha and beta flags are removed.

				scons_version = tuple(map(int, SCons.__version__.split('.')[:3]))

				def quietCommandLines(env):

				    # Quiet command lines

									
										32

scons/gallium.py
									
												View File
												
				@@ -29,6 +29,7 @@ Frontend-tool for Gallium3D architecture.

				# SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

				#

				from __future__ import print_function

				import distutils.version

				import os

				@@ -221,10 +222,6 @@ def generate(env):

				    env['suncc'] = env['platform'] == 'sunos' and os.path.basename(env['CC']) == 'cc'

				    env['icc'] = 'icc' == os.path.basename(env['CC'])

				    if env['msvc'] and env['toolchain'] == 'default' and env['machine'] == 'x86_64':

				        # MSVC x64 support is broken in earlier versions of scons

				        env.EnsurePythonVersion(2, 0)

				    # shortcuts

				    machine = env['machine']

				    platform = env['platform']

				@@ -311,7 +308,20 @@ def generate(env):

				    if env.GetOption('num_jobs') <= 1:

				        env.SetOption('num_jobs', num_jobs())

				    env.Decider('MD5-timestamp')

				    # Speed up dependency checking.  See

				    # - https://github.com/SCons/scons/wiki/GoFastButton

				    # - https://bugs.freedesktop.org/show_bug.cgi?id=109443

				    # Scons version string has consistently been in this format:

				    # MajorVersion.MinorVersion.Patch[.alpha/beta.yyyymmdd]

				    # so this formula should cover all versions regardless of type

				    # stable, alpha or beta.

				    # For simplicity alpha and beta flags are removed.

				    scons_version = distutils.version.StrictVersion('.'.join(SCons.__version__.split('.')[:3]))

				    if scons_version < distutils.version.StrictVersion('3.0.2') or \

				       scons_version > distutils.version.StrictVersion('3.0.4'):

				        env.Decider('MD5-timestamp')

				    env.SetOption('max_drift', 60)

				    # C preprocessor options

				@@ -679,6 +689,18 @@ def generate(env):

				    env.PkgCheckModules('XF86VIDMODE', ['xxf86vm'])

				    env.PkgCheckModules('DRM', ['libdrm >= 2.4.75'])

				    if not os.path.exists("src/util/format_srgb.c"):

				        print("Checking for Python Mako module (>= 0.8.0)... ", end='')

				        try:

				            import mako

				        except ImportError:

				            print("no")

				            exit(1)

				        if distutils.version.StrictVersion(mako.__version__) < distutils.version.StrictVersion('0.8.0'):

				            print("no")

				            exit(1)

				        print("yes")

				    if env['x11']:

				        env.Append(CPPPATH = env['X11_CPPPATH'])

									
										3

scons/llvm.py
									
												View File
												
				@@ -99,9 +99,6 @@ def generate(env):

				            return

				        env.Prepend(CPPPATH = [os.path.join(llvm_dir, 'include')])

				        env.AppendUnique(CPPDEFINES = [

				            'HAVE_STDINT_H',

				        ])

				        env.Prepend(LIBPATH = [os.path.join(llvm_dir, 'lib')])

				        # LIBS should match the output of `llvm-config --libs engine mcjit bitwriter x86asmprinter irreader`

				        if llvm_version >= distutils.version.LooseVersion('5.0'):

									
										11

src/Makefile.am
									
												View File
												
				@@ -20,13 +20,13 @@

				# IN THE SOFTWARE.

				.PHONY: git_sha1.h

				git_sha1.h: $(top_srcdir)/src/git_sha1.h.in

				git_sha1.h:

					@echo "updating $@"

					@$(PYTHON2) $(top_srcdir)/bin/git_sha1_gen.py --output $@

					@$(PYTHON) $(top_srcdir)/bin/git_sha1_gen.py --output $@

				BUILT_SOURCES = git_sha1.h

				CLEANFILES = $(BUILT_SOURCES)

				EXTRA_DIST = git_sha1.h.in meson.build

				EXTRA_DIST = meson.build

				SUBDIRS = . gtest util mapi/glapi/gen mapi

				@@ -39,6 +39,7 @@ gl_HEADERS = \

				  $(top_srcdir)/include/GL/gl_mangle.h

				endif

				# TODO: When building with glvnd, glvnd itself should be providing gl.pc and the headers

				if HAVE_GLX

				glxdir = $(includedir)/GL

				glx_HEADERS = \

				@@ -80,6 +81,10 @@ if HAVE_BROADCOM_DRIVERS

				SUBDIRS += broadcom

				endif

				if HAVE_FREEDRENO_DRIVERS

				SUBDIRS += freedreno

				endif

				if NEED_OPENGL_COMMON

				SUBDIRS += mesa

				endif

									
										7

src/SConscript
									
												View File
												
				@@ -42,10 +42,6 @@ env.Append(CPPPATH = ["#" + env['build_dir']])

				if env['platform'] != 'windows':

				    SConscript('loader/SConscript')

				# When env['gles'] is set, the targets defined in mapi/glapi/SConscript are not

				# used.  libgl-xlib and libgl-gdi adapt themselves to use the targets defined

				# in mapi/glapi-shared/SConscript.  mesa/SConscript also adapts itself to

				# enable OpenGL ES support.

				SConscript('mapi/glapi/gen/SConscript')

				SConscript('mapi/glapi/SConscript')

				@@ -61,8 +57,5 @@ if not env['embedded']:

				    if env['platform'] == 'haiku':

				        SConscript('egl/SConscript')

				    if env['gles']:

				        SConscript('mapi/shared-glapi/SConscript')

				SConscript('gallium/SConscript')

									
										11

src/amd/Android.addrlib.mk
									
												View File
												
				@@ -33,12 +33,11 @@ LOCAL_SRC_FILES := $(ADDRLIB_FILES)

				LOCAL_C_INCLUDES := \

					$(MESA_TOP)/src \

					$(MESA_TOP)/src/amd/common \

					$(MESA_TOP)/src/amd/addrlib \

					$(MESA_TOP)/src/amd/addrlib/core \

					$(MESA_TOP)/src/amd/addrlib/inc/chip/gfx9 \

					$(MESA_TOP)/src/amd/addrlib/inc/chip/r800 \

					$(MESA_TOP)/src/amd/addrlib/gfx9/chip \

					$(MESA_TOP)/src/amd/addrlib/r800/chip

					$(MESA_TOP)/src/amd/addrlib/inc \

					$(MESA_TOP)/src/amd/addrlib/src \

					$(MESA_TOP)/src/amd/addrlib/src/core \

					$(MESA_TOP)/src/amd/addrlib/src/chip/gfx9 \

					$(MESA_TOP)/src/amd/addrlib/src/chip/r800

				LOCAL_EXPORT_C_INCLUDE_DIRS := \

					$(LOCAL_PATH) \

									
										12

src/amd/Makefile.addrlib.am
									
												View File
												
				@@ -24,13 +24,13 @@ ADDRLIB_LIBS = addrlib/libamdgpu_addrlib.la

				addrlib_libamdgpu_addrlib_la_CPPFLAGS = \

					$(DEFINES) \

					-I$(top_srcdir)/src/ \

					-I$(top_srcdir)/include \

					-I$(srcdir)/common \

					-I$(srcdir)/addrlib \

					-I$(srcdir)/addrlib/core \

					-I$(srcdir)/addrlib/inc/chip/gfx9 \

					-I$(srcdir)/addrlib/inc/chip/r800 \

					-I$(srcdir)/addrlib/gfx9/chip \

					-I$(srcdir)/addrlib/r800/chip

					-I$(srcdir)/addrlib/inc \

					-I$(srcdir)/addrlib/src \

					-I$(srcdir)/addrlib/src/core \

					-I$(srcdir)/addrlib/src/chip/gfx9 \

					-I$(srcdir)/addrlib/src/chip/r800

				addrlib_libamdgpu_addrlib_la_CXXFLAGS = \

					$(VISIBILITY_CXXFLAGS) $(CXX11_CXXFLAGS)

									
										2

src/amd/Makefile.common.am
									
												View File
												
				@@ -66,6 +66,6 @@ common_libamd_common_la_LIBADD = $(LIBELF_LIBS)

				common/sid_tables.h: $(srcdir)/common/sid_tables.py $(srcdir)/common/sid.h $(srcdir)/common/gfx9d.h

					$(AM_V_at)$(MKDIR_P) $(@D)

					$(AM_V_GEN) $(PYTHON2) $(srcdir)/common/sid_tables.py $(srcdir)/common/sid.h $(srcdir)/common/gfx9d.h > $@

					$(AM_V_GEN) $(PYTHON) $(srcdir)/common/sid_tables.py $(srcdir)/common/sid.h $(srcdir)/common/gfx9d.h > $@

				BUILT_SOURCES = $(AMD_GENERATED_FILES)

									
										56

src/amd/Makefile.sources
									
												View File
												
				@@ -5,35 +5,33 @@ COMMON_HEADER_FILES = \

					common/amd_kernel_code_t.h

				ADDRLIB_FILES = \

					addrlib/addrinterface.cpp \

					addrlib/addrinterface.h \

					addrlib/addrtypes.h \

					addrlib/amdgpu_asic_addr.h \

					addrlib/core/addrcommon.h \

					addrlib/core/addrelemlib.cpp \

					addrlib/core/addrelemlib.h \

					addrlib/core/addrlib.cpp \

					addrlib/core/addrlib.h \

					addrlib/core/addrlib1.cpp \

					addrlib/core/addrlib1.h \

					addrlib/core/addrlib2.cpp \

					addrlib/core/addrlib2.h \

					addrlib/core/addrobject.cpp \

					addrlib/core/addrobject.h \

					addrlib/gfx9/chip/gfx9_enum.h \

					addrlib/gfx9/coord.cpp \

					addrlib/gfx9/coord.h \

					addrlib/gfx9/gfx9addrlib.cpp \

					addrlib/gfx9/gfx9addrlib.h \

					addrlib/inc/chip/gfx9/gfx9_gb_reg.h \

					addrlib/inc/chip/r800/si_gb_reg.h \

					addrlib/r800/chip/si_ci_vi_merged_enum.h \

					addrlib/r800/ciaddrlib.cpp \

					addrlib/r800/ciaddrlib.h \

					addrlib/r800/egbaddrlib.cpp \

					addrlib/r800/egbaddrlib.h \

					addrlib/r800/siaddrlib.cpp \

					addrlib/r800/siaddrlib.h

					addrlib/inc/addrinterface.h \

					addrlib/inc/addrtypes.h \

					addrlib/src/addrinterface.cpp \

					addrlib/src/amdgpu_asic_addr.h \

					addrlib/src/core/addrcommon.h \

					addrlib/src/core/addrelemlib.cpp \

					addrlib/src/core/addrelemlib.h \

					addrlib/src/core/addrlib.cpp \

					addrlib/src/core/addrlib.h \

					addrlib/src/core/addrlib1.cpp \

					addrlib/src/core/addrlib1.h \

					addrlib/src/core/addrlib2.cpp \

					addrlib/src/core/addrlib2.h \

					addrlib/src/core/addrobject.cpp \

					addrlib/src/core/addrobject.h \

					addrlib/src/core/coord.cpp \

					addrlib/src/core/coord.h \

					addrlib/src/gfx9/gfx9addrlib.cpp \

					addrlib/src/gfx9/gfx9addrlib.h \

					addrlib/src/chip/gfx9/gfx9_gb_reg.h \

					addrlib/src/chip/r800/si_gb_reg.h \

					addrlib/src/r800/ciaddrlib.cpp \

					addrlib/src/r800/ciaddrlib.h \

					addrlib/src/r800/egbaddrlib.cpp \

					addrlib/src/r800/egbaddrlib.h \

					addrlib/src/r800/siaddrlib.cpp \

					addrlib/src/r800/siaddrlib.h

				AMD_COMPILER_FILES = \

					common/ac_binary.c \

10535

src/amd/addrlib/gfx9/chip/gfx9_enum.h

View File

File diff suppressed because it is too large Load Diff

									
										162

src/amd/addrlib/addrinterface.h → src/amd/addrlib/inc/addrinterface.h
									
												View File
												
				@@ -1,5 +1,5 @@

				/*

				 * Copyright © 2014 Advanced Micro Devices, Inc.

				 * Copyright © 2007-2018 Advanced Micro Devices, Inc.

				 * All Rights Reserved.

				 *

				 * Permission is hereby granted, free of charge, to any person obtaining

				@@ -177,7 +177,6 @@ typedef struct _ADDR_EQUATION

				                                                       ///< stacked vertically prior to swizzling

				} ADDR_EQUATION;

				/**

				****************************************************************************************************

				* @brief Alloc system memory flags.

				@@ -409,8 +408,6 @@ ADDR_E_RETURNCODE ADDR_API AddrCreate(

				    const ADDR_CREATE_INPUT*    pAddrCreateIn,

				    ADDR_CREATE_OUTPUT*         pAddrCreateOut);

				/**

				****************************************************************************************************

				*   AddrDestroy

				@@ -425,8 +422,6 @@ ADDR_E_RETURNCODE ADDR_API AddrCreate(

				ADDR_E_RETURNCODE ADDR_API AddrDestroy(

				    ADDR_HANDLE hLib);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                    Surface functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -658,8 +653,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeSurfaceInfo(

				    const ADDR_COMPUTE_SURFACE_INFO_INPUT*  pIn,

				    ADDR_COMPUTE_SURFACE_INFO_OUTPUT*       pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_SURFACE_ADDRFROMCOORD_INPUT

				@@ -748,8 +741,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeSurfaceAddrFromCoord(

				    const ADDR_COMPUTE_SURFACE_ADDRFROMCOORD_INPUT* pIn,

				    ADDR_COMPUTE_SURFACE_ADDRFROMCOORD_OUTPUT*      pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_SURFACE_COORDFROMADDR_INPUT

				@@ -931,8 +922,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeHtileInfo(

				    const ADDR_COMPUTE_HTILE_INFO_INPUT*    pIn,

				    ADDR_COMPUTE_HTILE_INFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_HTILE_ADDRFROMCOORD_INPUT

				@@ -995,8 +984,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeHtileAddrFromCoord(

				    const ADDR_COMPUTE_HTILE_ADDRFROMCOORD_INPUT*   pIn,

				    ADDR_COMPUTE_HTILE_ADDRFROMCOORD_OUTPUT*        pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_HTILE_COORDFROMADDR_INPUT

				@@ -1057,8 +1044,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeHtileCoordFromAddr(

				    const ADDR_COMPUTE_HTILE_COORDFROMADDR_INPUT*   pIn,

				    ADDR_COMPUTE_HTILE_COORDFROMADDR_OUTPUT*        pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     C-mask functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -1146,8 +1131,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeCmaskInfo(

				    const ADDR_COMPUTE_CMASK_INFO_INPUT*    pIn,

				    ADDR_COMPUTE_CMASK_INFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_CMASK_ADDRFROMCOORD_INPUT

				@@ -1208,8 +1191,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeCmaskAddrFromCoord(

				    const ADDR_COMPUTE_CMASK_ADDRFROMCOORD_INPUT*   pIn,

				    ADDR_COMPUTE_CMASK_ADDRFROMCOORD_OUTPUT*        pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_CMASK_COORDFROMADDR_INPUT

				@@ -1268,8 +1249,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeCmaskCoordFromAddr(

				    const ADDR_COMPUTE_CMASK_COORDFROMADDR_INPUT*   pIn,

				    ADDR_COMPUTE_CMASK_COORDFROMADDR_OUTPUT*        pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     F-mask functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -1350,8 +1329,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeFmaskInfo(

				    const ADDR_COMPUTE_FMASK_INFO_INPUT*    pIn,

				    ADDR_COMPUTE_FMASK_INFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_FMASK_ADDRFROMCOORD_INPUT

				@@ -1428,8 +1405,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeFmaskAddrFromCoord(

				    const ADDR_COMPUTE_FMASK_ADDRFROMCOORD_INPUT*   pIn,

				    ADDR_COMPUTE_FMASK_ADDRFROMCOORD_OUTPUT*        pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_FMASK_COORDFROMADDR_INPUT

				@@ -1503,8 +1478,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeFmaskCoordFromAddr(

				    const ADDR_COMPUTE_FMASK_COORDFROMADDR_INPUT*   pIn,

				    ADDR_COMPUTE_FMASK_COORDFROMADDR_OUTPUT*        pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                          Element/utility functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -1593,7 +1566,6 @@ ADDR_E_RETURNCODE ADDR_API AddrExtractBankPipeSwizzle(

				    const ADDR_EXTRACT_BANKPIPE_SWIZZLE_INPUT*  pIn,

				    ADDR_EXTRACT_BANKPIPE_SWIZZLE_OUTPUT*       pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMBINE_BANKPIPE_SWIZZLE_INPUT

				@@ -1651,8 +1623,6 @@ ADDR_E_RETURNCODE ADDR_API AddrCombineBankPipeSwizzle(

				    const ADDR_COMBINE_BANKPIPE_SWIZZLE_INPUT*  pIn,

				    ADDR_COMBINE_BANKPIPE_SWIZZLE_OUTPUT*       pOut);

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_SLICESWIZZLE_INPUT

				@@ -1679,8 +1649,6 @@ typedef struct _ADDR_COMPUTE_SLICESWIZZLE_INPUT

				                                        ///< README: When tileIndex is not -1, this must be valid

				} ADDR_COMPUTE_SLICESWIZZLE_INPUT;

				/**

				****************************************************************************************************

				*   ADDR_COMPUTE_SLICESWIZZLE_OUTPUT

				@@ -1711,7 +1679,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeSliceSwizzle(

				    const ADDR_COMPUTE_SLICESWIZZLE_INPUT*  pIn,

				    ADDR_COMPUTE_SLICESWIZZLE_OUTPUT*       pOut);

				/**

				****************************************************************************************************

				*   AddrSwizzleGenOption

				@@ -1802,8 +1769,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeBaseSwizzle(

				    const ADDR_COMPUTE_BASE_SWIZZLE_INPUT*  pIn,

				    ADDR_COMPUTE_BASE_SWIZZLE_OUTPUT*       pOut);

				/**

				****************************************************************************************************

				*   ELEM_GETEXPORTNORM_INPUT

				@@ -1844,8 +1809,6 @@ BOOL_32 ADDR_API ElemGetExportNorm(

				    ADDR_HANDLE                     hLib,

				    const ELEM_GETEXPORTNORM_INPUT* pIn);

				/**

				****************************************************************************************************

				*   ELEM_FLT32TODEPTHPIXEL_INPUT

				@@ -1901,8 +1864,6 @@ ADDR_E_RETURNCODE ADDR_API ElemFlt32ToDepthPixel(

				    const ELEM_FLT32TODEPTHPIXEL_INPUT* pIn,

				    ELEM_FLT32TODEPTHPIXEL_OUTPUT*      pOut);

				/**

				****************************************************************************************************

				*   ELEM_FLT32TOCOLORPIXEL_INPUT

				@@ -1956,6 +1917,21 @@ ADDR_E_RETURNCODE ADDR_API ElemFlt32ToColorPixel(

				    const ELEM_FLT32TOCOLORPIXEL_INPUT* pIn,

				    ELEM_FLT32TOCOLORPIXEL_OUTPUT*      pOut);

				/**

				****************************************************************************************************

				*   ElemSize

				*

				*   @brief

				*       Get bits-per-element for specified format

				*

				*   @return

				*       Bits-per-element of specified format

				*

				****************************************************************************************************

				*/

				UINT_32 ADDR_API ElemSize(

				    ADDR_HANDLE hLib,

				    AddrFormat  format);

				/**

				****************************************************************************************************

				@@ -2014,8 +1990,6 @@ ADDR_E_RETURNCODE ADDR_API AddrConvertTileInfoToHW(

				    const ADDR_CONVERT_TILEINFOTOHW_INPUT*  pIn,

				    ADDR_CONVERT_TILEINFOTOHW_OUTPUT*       pOut);

				/**

				****************************************************************************************************

				*   ADDR_CONVERT_TILEINDEX_INPUT

				@@ -2140,8 +2114,6 @@ ADDR_E_RETURNCODE ADDR_API AddrConvertTileIndex1(

				    const ADDR_CONVERT_TILEINDEX1_INPUT*    pIn,

				    ADDR_CONVERT_TILEINDEX_OUTPUT*          pOut);

				/**

				****************************************************************************************************

				*   ADDR_GET_TILEINDEX_INPUT

				@@ -2187,8 +2159,6 @@ ADDR_E_RETURNCODE ADDR_API AddrGetTileIndex(

				    const ADDR_GET_TILEINDEX_INPUT* pIn,

				    ADDR_GET_TILEINDEX_OUTPUT*      pOut);

				/**

				****************************************************************************************************

				*   ADDR_PRT_INFO_INPUT

				@@ -2233,8 +2203,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputePrtInfo(

				    const ADDR_PRT_INFO_INPUT*  pIn,

				    ADDR_PRT_INFO_OUTPUT*       pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     DCC key functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -2295,8 +2263,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeDccInfo(

				    const ADDR_COMPUTE_DCCINFO_INPUT*       pIn,

				    ADDR_COMPUTE_DCCINFO_OUTPUT*            pOut);

				/**

				****************************************************************************************************

				*   ADDR_GET_MAX_ALINGMENTS_OUTPUT

				@@ -2360,7 +2326,6 @@ ADDR_E_RETURNCODE ADDR_API AddrGetMaxMetaAlignments(

				*

				**/

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                    Surface functions for Gfx9

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -2395,7 +2360,8 @@ typedef union _ADDR2_SURFACE_FLAGS

				        UINT_32 noMetadata        :  1; ///< This resource has no metadata

				        UINT_32 metaRbUnaligned   :  1; ///< This resource has rb unaligned metadata

				        UINT_32 metaPipeUnaligned :  1; ///< This resource has pipe unaligned metadata

				        UINT_32 reserved          : 14; ///< Reserved bits

				        UINT_32 view3dAs2dArray   :  1; ///< This resource is a 3D resource viewed as 2D array

				        UINT_32 reserved          : 13; ///< Reserved bits

				    };

				    UINT_32 value;

				@@ -2523,8 +2489,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeSurfaceInfo(

				    const ADDR2_COMPUTE_SURFACE_INFO_INPUT*    pIn,

				    ADDR2_COMPUTE_SURFACE_INFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_SURFACE_ADDRFROMCOORD_INPUT

				@@ -2591,8 +2555,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeSurfaceAddrFromCoord(

				    const ADDR2_COMPUTE_SURFACE_ADDRFROMCOORD_INPUT*    pIn,

				    ADDR2_COMPUTE_SURFACE_ADDRFROMCOORD_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_SURFACE_COORDFROMADDR_INPUT

				@@ -2658,8 +2620,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeSurfaceCoordFromAddr(

				    const ADDR2_COMPUTE_SURFACE_COORDFROMADDR_INPUT*    pIn,

				    ADDR2_COMPUTE_SURFACE_COORDFROMADDR_OUTPUT*         pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                   HTile functions for Gfx9

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -2710,8 +2670,10 @@ typedef struct _ADDR2_META_MIP_INFO

				        struct

				        {

				            UINT_32    offset;

				            UINT_32    sliceSize;

				            UINT_32    offset;      ///< Metadata offset within one slice,

				                                    ///  the thickness of a slice is meta block depth.

				            UINT_32    sliceSize;   ///< Metadata size within one slice,

				                                    ///  the thickness of a slice is meta block depth.

				        };

				    };

				} ADDR2_META_MIP_INFO;

				@@ -2735,7 +2697,9 @@ typedef struct _ADDR2_COMPUTE_HTILE_INFO_INPUT

				    UINT_32             unalignedHeight;    ///< Depth surface original height (of mip0)

				    UINT_32             numSlices;          ///< Number of slices of depth surface (of mip0)

				    UINT_32             numMipLevels;       ///< Total mipmap levels of color surface

				    UINT_32             firstMipIdInTail;

				    UINT_32             firstMipIdInTail;   ///  Id of the first mip in tail,

				                                            ///  if no mip is in tail, it should be set to

				                                            ///  number of mip levels

				} ADDR2_COMPUTE_HTILE_INFO_INPUT;

				/**

				@@ -2777,8 +2741,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeHtileInfo(

				    const ADDR2_COMPUTE_HTILE_INFO_INPUT*    pIn,

				    ADDR2_COMPUTE_HTILE_INFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_HTILE_ADDRFROMCOORD_INPUT

				@@ -2836,8 +2798,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeHtileAddrFromCoord(

				    const ADDR2_COMPUTE_HTILE_ADDRFROMCOORD_INPUT*    pIn,

				    ADDR2_COMPUTE_HTILE_ADDRFROMCOORD_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_HTILE_COORDFROMADDR_INPUT

				@@ -2896,8 +2856,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeHtileCoordFromAddr(

				    const ADDR2_COMPUTE_HTILE_COORDFROMADDR_INPUT*    pIn,

				    ADDR2_COMPUTE_HTILE_COORDFROMADDR_OUTPUT*         pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     C-mask functions for Gfx9

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -2963,8 +2921,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeCmaskInfo(

				    const ADDR2_COMPUTE_CMASK_INFO_INPUT*    pIn,

				    ADDR2_COMPUTE_CMASK_INFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_CMASK_ADDRFROMCOORD_INPUT

				@@ -3026,8 +2982,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeCmaskAddrFromCoord(

				    const ADDR2_COMPUTE_CMASK_ADDRFROMCOORD_INPUT*   pIn,

				    ADDR2_COMPUTE_CMASK_ADDRFROMCOORD_OUTPUT*        pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_CMASK_COORDFROMADDR_INPUT

				@@ -3086,8 +3040,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeCmaskCoordFromAddr(

				    const ADDR2_COMPUTE_CMASK_COORDFROMADDR_INPUT*    pIn,

				    ADDR2_COMPUTE_CMASK_COORDFROMADDR_OUTPUT*         pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     F-mask functions for Gfx9

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -3170,8 +3122,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeFmaskInfo(

				    const ADDR2_COMPUTE_FMASK_INFO_INPUT*    pIn,

				    ADDR2_COMPUTE_FMASK_INFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_FMASK_ADDRFROMCOORD_INPUT

				@@ -3231,8 +3181,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeFmaskAddrFromCoord(

				    const ADDR2_COMPUTE_FMASK_ADDRFROMCOORD_INPUT*    pIn,

				    ADDR2_COMPUTE_FMASK_ADDRFROMCOORD_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_FMASK_COORDFROMADDR_INPUT

				@@ -3291,8 +3239,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeFmaskCoordFromAddr(

				    const ADDR2_COMPUTE_FMASK_COORDFROMADDR_INPUT*    pIn,

				    ADDR2_COMPUTE_FMASK_COORDFROMADDR_OUTPUT*         pOut);

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     DCC key functions for Gfx9

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -3321,7 +3267,8 @@ typedef struct _ADDR2_COMPUTE_DCCINFO_INPUT

				    UINT_32             numMipLevels;       ///< Total mipmap levels of color surface

				    UINT_32             dataSurfaceSize;    ///< The padded size of all slices and mip levels

				                                            ///< useful in meta linear case

				    UINT_32             firstMipIdInTail;

				    UINT_32             firstMipIdInTail;   ///< The id of first mip in tail, if no mip is in tail,

				                                            ///  it should be number of mip levels

				} ADDR2_COMPUTE_DCCINFO_INPUT;

				/**

				@@ -3356,7 +3303,9 @@ typedef struct _ADDR2_COMPUTE_DCCINFO_OUTPUT

				    union

				    {

				        UINT_32 fastClearSizePerSlice;  ///< Size of DCC within a slice should be fast cleared

				        UINT_32 dccRamSliceSize;

				        UINT_32 dccRamSliceSize;        ///< DCC ram size per slice. For mipmap, it's

				                                        ///  the slize size of a mip chain, the thickness of a

				                                        ///  a slice is meta block depth

				    };

				    ADDR2_META_MIP_INFO* pMipInfo;      ///< DCC mip information

				@@ -3376,7 +3325,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeDccInfo(

				    const ADDR2_COMPUTE_DCCINFO_INPUT*    pIn,

				    ADDR2_COMPUTE_DCCINFO_OUTPUT*         pOut);

				/**

				****************************************************************************************************

				*   ADDR2_COMPUTE_DCC_ADDRFROMCOORD_INPUT

				@@ -3628,6 +3576,55 @@ typedef union _ADDR2_SWTYPE_SET

				    UINT_32 value;

				} ADDR2_SWTYPE_SET;

				/**

				****************************************************************************************************

				*   ADDR2_SWMODE_SET

				*

				*   @brief

				*       Bit field that defines swizzle type

				****************************************************************************************************

				*/

				typedef union _ADDR2_SWMODE_SET

				{

				    struct

				    {

				        UINT_32 swLinear   : 1;

				        UINT_32 sw256B_S   : 1;

				        UINT_32 sw256B_D   : 1;

				        UINT_32 sw256B_R   : 1;

				        UINT_32 sw4KB_Z    : 1;

				        UINT_32 sw4KB_S    : 1;

				        UINT_32 sw4KB_D    : 1;

				        UINT_32 sw4KB_R    : 1;

				        UINT_32 sw64KB_Z   : 1;

				        UINT_32 sw64KB_S   : 1;

				        UINT_32 sw64KB_D   : 1;

				        UINT_32 sw64KB_R   : 1;

				        UINT_32 swVar_Z    : 1;

				        UINT_32 swVar_S    : 1;

				        UINT_32 swVar_D    : 1;

				        UINT_32 swVar_R    : 1;

				        UINT_32 sw64KB_Z_T : 1;

				        UINT_32 sw64KB_S_T : 1;

				        UINT_32 sw64KB_D_T : 1;

				        UINT_32 sw64KB_R_T : 1;

				        UINT_32 sw4KB_Z_X  : 1;

				        UINT_32 sw4KB_S_X  : 1;

				        UINT_32 sw4KB_D_X  : 1;

				        UINT_32 sw4KB_R_X  : 1;

				        UINT_32 sw64KB_Z_X : 1;

				        UINT_32 sw64KB_S_X : 1;

				        UINT_32 sw64KB_D_X : 1;

				        UINT_32 sw64KB_R_X : 1;

				        UINT_32 swVar_Z_X  : 1;

				        UINT_32 swVar_S_X  : 1;

				        UINT_32 swVar_D_X  : 1;

				        UINT_32 swVar_R_X  : 1;

				    };

				    UINT_32 value;

				} ADDR2_SWMODE_SET;

				/**

				****************************************************************************************************

				*   ADDR2_GET_PREFERRED_SURF_SETTING_INPUT

				@@ -3681,6 +3678,7 @@ typedef struct _ADDR2_GET_PREFERRED_SURF_SETTING_OUTPUT

				                                                ///  type

				    ADDR2_SWTYPE_SET      validSwTypeSet;       ///< Valid swizzle type bit combination

				    ADDR2_SWTYPE_SET      clientPreferredSwSet; ///< Client-preferred swizzle type bit combination

				    ADDR2_SWMODE_SET      validSwModeSet;       ///< Valid swizzle mode bit combination

				} ADDR2_GET_PREFERRED_SURF_SETTING_OUTPUT;

				/**

									
										33

src/amd/addrlib/addrtypes.h → src/amd/addrlib/inc/addrtypes.h
									
												View File
												
				@@ -1,5 +1,5 @@

				/*

				 * Copyright © 2014 Advanced Micro Devices, Inc.

				 * Copyright © 2007-2018 Advanced Micro Devices, Inc.

				 * All Rights Reserved.

				 *

				 * Permission is hereby granted, free of charge, to any person obtaining

				@@ -112,7 +112,6 @@ typedef int            INT;

				    #define GC_FASTCALL  ADDR_FASTCALL

				#endif

				#if defined(__GNUC__)

				    #define ADDR_INLINE static inline   // inline needs to be static to link

				#else

				@@ -353,7 +352,7 @@ typedef enum _AddrFormat {

				    ADDR_FMT_3_3_2                                = 0x00000003,

				    ADDR_FMT_RESERVED_4                           = 0x00000004,

				    ADDR_FMT_16                                   = 0x00000005,

				    ADDR_FMT_16_FLOAT                             = 0x00000006,

				    ADDR_FMT_16_FLOAT                             = ADDR_FMT_16,

				    ADDR_FMT_8_8                                  = 0x00000007,

				    ADDR_FMT_5_6_5                                = 0x00000008,

				    ADDR_FMT_6_5_5                                = 0x00000009,

				@@ -361,28 +360,28 @@ typedef enum _AddrFormat {

				    ADDR_FMT_4_4_4_4                              = 0x0000000b,

				    ADDR_FMT_5_5_5_1                              = 0x0000000c,

				    ADDR_FMT_32                                   = 0x0000000d,

				    ADDR_FMT_32_FLOAT                             = 0x0000000e,

				    ADDR_FMT_32_FLOAT                             = ADDR_FMT_32,

				    ADDR_FMT_16_16                                = 0x0000000f,

				    ADDR_FMT_16_16_FLOAT                          = 0x00000010,

				    ADDR_FMT_16_16_FLOAT                          = ADDR_FMT_16_16,

				    ADDR_FMT_8_24                                 = 0x00000011,

				    ADDR_FMT_8_24_FLOAT                           = 0x00000012,

				    ADDR_FMT_8_24_FLOAT                           = ADDR_FMT_8_24,

				    ADDR_FMT_24_8                                 = 0x00000013,

				    ADDR_FMT_24_8_FLOAT                           = 0x00000014,

				    ADDR_FMT_24_8_FLOAT                           = ADDR_FMT_24_8,

				    ADDR_FMT_10_11_11                             = 0x00000015,

				    ADDR_FMT_10_11_11_FLOAT                       = 0x00000016,

				    ADDR_FMT_10_11_11_FLOAT                       = ADDR_FMT_10_11_11,

				    ADDR_FMT_11_11_10                             = 0x00000017,

				    ADDR_FMT_11_11_10_FLOAT                       = 0x00000018,

				    ADDR_FMT_11_11_10_FLOAT                       = ADDR_FMT_11_11_10,

				    ADDR_FMT_2_10_10_10                           = 0x00000019,

				    ADDR_FMT_8_8_8_8                              = 0x0000001a,

				    ADDR_FMT_10_10_10_2                           = 0x0000001b,

				    ADDR_FMT_X24_8_32_FLOAT                       = 0x0000001c,

				    ADDR_FMT_32_32                                = 0x0000001d,

				    ADDR_FMT_32_32_FLOAT                          = 0x0000001e,

				    ADDR_FMT_32_32_FLOAT                          = ADDR_FMT_32_32,

				    ADDR_FMT_16_16_16_16                          = 0x0000001f,

				    ADDR_FMT_16_16_16_16_FLOAT                    = 0x00000020,

				    ADDR_FMT_16_16_16_16_FLOAT                    = ADDR_FMT_16_16_16_16,

				    ADDR_FMT_RESERVED_33                          = 0x00000021,

				    ADDR_FMT_32_32_32_32                          = 0x00000022,

				    ADDR_FMT_32_32_32_32_FLOAT                    = 0x00000023,

				    ADDR_FMT_32_32_32_32_FLOAT                    = ADDR_FMT_32_32_32_32,

				    ADDR_FMT_RESERVED_36                          = 0x00000024,

				    ADDR_FMT_1                                    = 0x00000025,

				    ADDR_FMT_1_REVERSED                           = 0x00000026,

				@@ -393,9 +392,9 @@ typedef enum _AddrFormat {

				    ADDR_FMT_5_9_9_9_SHAREDEXP                    = 0x0000002b,

				    ADDR_FMT_8_8_8                                = 0x0000002c,

				    ADDR_FMT_16_16_16                             = 0x0000002d,

				    ADDR_FMT_16_16_16_FLOAT                       = 0x0000002e,

				    ADDR_FMT_16_16_16_FLOAT                       = ADDR_FMT_16_16_16,

				    ADDR_FMT_32_32_32                             = 0x0000002f,

				    ADDR_FMT_32_32_32_FLOAT                       = 0x00000030,

				    ADDR_FMT_32_32_32_FLOAT                       = ADDR_FMT_32_32_32,

				    ADDR_FMT_BC1                                  = 0x00000031,

				    ADDR_FMT_BC2                                  = 0x00000032,

				    ADDR_FMT_BC3                                  = 0x00000033,

				@@ -550,7 +549,6 @@ typedef enum _AddrHtileBlockSize

				    ADDR_HTILE_BLOCKSIZE_8 = 8,

				} AddrHtileBlockSize;

				/**

				****************************************************************************************************

				*   AddrPipeCfg

				@@ -584,7 +582,8 @@ typedef enum _AddrPipeCfg

				    ADDR_PIPECFG_P8_32x64_32x32  = 15,

				    ADDR_PIPECFG_P16_32x32_8x16  = 17, /// 16 pipes

				    ADDR_PIPECFG_P16_32x32_16x16 = 18,

				    ADDR_PIPECFG_MAX             = 19,

				    ADDR_PIPECFG_RESERVED        = 19, /// reserved for internal use

				    ADDR_PIPECFG_MAX             = 20,

				} AddrPipeCfg;

				/**

				@@ -712,7 +711,6 @@ typedef enum _AddrTileType

				#define ADDR64D "lld" OR "I64d"

				#endif

				/// @brief Union for storing a 32-bit float or 32-bit integer

				/// @ingroup type

				///

				@@ -728,7 +726,6 @@ typedef union {

				    float    f;

				} ADDR_FLT_32;

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//

				//  Macros for controlling linking and building on multiple systems

									
										60

src/amd/addrlib/meson.build
									
												View File
												
				@@ -19,35 +19,33 @@

				# SOFTWARE.

				files_addrlib = files(

				  'addrinterface.cpp',

				  'addrinterface.h',

				  'addrtypes.h',

				  'core/addrcommon.h',

				  'core/addrelemlib.cpp',

				  'core/addrelemlib.h',

				  'core/addrlib.cpp',

				  'core/addrlib.h',

				  'core/addrlib1.cpp',

				  'core/addrlib1.h',

				  'core/addrlib2.cpp',

				  'core/addrlib2.h',

				  'core/addrobject.cpp',

				  'core/addrobject.h',

				  'gfx9/chip/gfx9_enum.h',

				  'gfx9/coord.cpp',

				  'gfx9/coord.h',

				  'gfx9/gfx9addrlib.cpp',

				  'gfx9/gfx9addrlib.h',

				  'amdgpu_asic_addr.h',

				  'inc/chip/gfx9/gfx9_gb_reg.h',

				  'inc/chip/r800/si_gb_reg.h',

				  'r800/chip/si_ci_vi_merged_enum.h',

				  'r800/ciaddrlib.cpp',

				  'r800/ciaddrlib.h',

				  'r800/egbaddrlib.cpp',

				  'r800/egbaddrlib.h',

				  'r800/siaddrlib.cpp',

				  'r800/siaddrlib.h',

				  'inc/addrinterface.h',

				  'inc/addrtypes.h',

				  'src/addrinterface.cpp',

				  'src/core/addrcommon.h',

				  'src/core/addrelemlib.cpp',

				  'src/core/addrelemlib.h',

				  'src/core/addrlib.cpp',

				  'src/core/addrlib.h',

				  'src/core/addrlib1.cpp',

				  'src/core/addrlib1.h',

				  'src/core/addrlib2.cpp',

				  'src/core/addrlib2.h',

				  'src/core/addrobject.cpp',

				  'src/core/addrobject.h',

				  'src/core/coord.cpp',

				  'src/core/coord.h',

				  'src/gfx9/gfx9addrlib.cpp',

				  'src/gfx9/gfx9addrlib.h',

				  'src/amdgpu_asic_addr.h',

				  'src/chip/gfx9/gfx9_gb_reg.h',

				  'src/chip/r800/si_gb_reg.h',

				  'src/r800/ciaddrlib.cpp',

				  'src/r800/ciaddrlib.h',

				  'src/r800/egbaddrlib.cpp',

				  'src/r800/egbaddrlib.h',

				  'src/r800/siaddrlib.cpp',

				  'src/r800/siaddrlib.h',

				)

				libamdgpu_addrlib = static_library(

				@@ -55,9 +53,9 @@ libamdgpu_addrlib = static_library(

				  files_addrlib,

				  include_directories : [

				    include_directories(

				      'core', 'inc/chip/gfx9', 'inc/chip/r800', 'gfx9/chip', 'r800/chip',

				      'inc', 'src', 'src/core', 'src/chip/gfx9', 'src/chip/r800',

				    ),

				    inc_amd_common, inc_src,

				    inc_amd_common, inc_common, inc_src,

				  ],

				  cpp_args : cpp_vis_args,

				)

									
										40

src/amd/addrlib/r800/chip/si_ci_vi_merged_enum.h
									
												View File
											
				@@ -1,40 +0,0 @@

				/*

				 * Copyright © 2014 Advanced Micro Devices, Inc.

				 * All Rights Reserved.

				 *

				 * Permission is hereby granted, free of charge, to any person obtaining

				 * a copy of this software and associated documentation files (the

				 * "Software"), to deal in the Software without restriction, including

				 * without limitation the rights to use, copy, modify, merge, publish,

				 * distribute, sub license, and/or sell copies of the Software, and to

				 * permit persons to whom the Software is furnished to do so, subject to

				 * the following conditions:

				 *

				 * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,

				 * EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES

				 * OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND

				 * NON-INFRINGEMENT. IN NO EVENT SHALL THE COPYRIGHT HOLDERS, AUTHORS

				 * AND/OR ITS SUPPLIERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				 * LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,

				 * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE

				 * USE OR OTHER DEALINGS IN THE SOFTWARE.

				 *

				 * The above copyright notice and this permission notice (including the

				 * next paragraph) shall be included in all copies or substantial portions

				 * of the Software.

				 */

				#if !defined (SI_CI_VI_MERGED_ENUM_HEADER)

				#define SI_CI_VI_MERGED_ENUM_HEADER

				typedef enum PipeInterleaveSize {

				ADDR_CONFIG_PIPE_INTERLEAVE_256B         = 0x00000000,

				ADDR_CONFIG_PIPE_INTERLEAVE_512B         = 0x00000001,

				} PipeInterleaveSize;

				typedef enum RowSize {

				ADDR_CONFIG_1KB_ROW                      = 0x00000000,

				ADDR_CONFIG_2KB_ROW                      = 0x00000001,

				ADDR_CONFIG_4KB_ROW                      = 0x00000002,

				} RowSize;

				#endif

									
										71

src/amd/addrlib/addrinterface.cpp → src/amd/addrlib/src/addrinterface.cpp
									
												View File
												
				@@ -1,5 +1,5 @@

				/*

				 * Copyright © 2014 Advanced Micro Devices, Inc.

				 * Copyright © 2007-2018 Advanced Micro Devices, Inc.

				 * All Rights Reserved.

				 *

				 * Permission is hereby granted, free of charge, to any person obtaining

				@@ -36,6 +36,8 @@

				#include "addrcommon.h"

				#include "util/macros.h"

				using namespace Addr;

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -59,13 +61,13 @@ ADDR_E_RETURNCODE ADDR_API AddrCreate(

				{

				    ADDR_E_RETURNCODE returnCode = ADDR_OK;

				    returnCode = Lib::Create(pAddrCreateIn, pAddrCreateOut);

				    {

				        returnCode = Lib::Create(pAddrCreateIn, pAddrCreateOut);

				    }

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   AddrDestroy

				@@ -95,8 +97,6 @@ ADDR_E_RETURNCODE ADDR_API AddrDestroy(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                    Surface functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -133,8 +133,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeSurfaceInfo(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   AddrComputeSurfaceAddrFromCoord

				@@ -199,8 +197,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeSurfaceCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                   HTile functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -302,8 +298,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeHtileCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     C-mask functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -406,8 +400,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeCmaskCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     F-mask functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -508,8 +500,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeFmaskCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     DCC key functions

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -544,8 +534,6 @@ ADDR_E_RETURNCODE ADDR_API AddrComputeDccInfo(

				    return returnCode;

				}

				///////////////////////////////////////////////////////////////////////////////

				// Below functions are element related or helper functions

				///////////////////////////////////////////////////////////////////////////////

				@@ -832,7 +820,7 @@ BOOL_32 ADDR_API ElemGetExportNorm(

				    Addr::Lib* pLib = Lib::GetLib(hLib);

				    BOOL_32 enabled = FALSE;

				    ADDR_E_RETURNCODE returnCode = ADDR_OK;

				    MAYBE_UNUSED ADDR_E_RETURNCODE returnCode = ADDR_OK;

				    if (pLib != NULL)

				    {

				@@ -848,6 +836,34 @@ BOOL_32 ADDR_API ElemGetExportNorm(

				    return enabled;

				}

				/**

				****************************************************************************************************

				*   ElemSize

				*

				*   @brief

				*       Get bits-per-element for specified format

				*

				*   @return

				*       Bits-per-element of specified format

				*

				****************************************************************************************************

				*/

				UINT_32 ADDR_API ElemSize(

				    ADDR_HANDLE hLib,

				    AddrFormat  format)

				{

				    UINT_32 bpe = 0;

				    Addr::Lib* pLib = Lib::GetLib(hLib);

				    if (pLib != NULL)

				    {

				        bpe = pLib->GetBpe(format);

				    }

				    return bpe;

				}

				/**

				****************************************************************************************************

				*   AddrConvertTileInfoToHW

				@@ -1103,7 +1119,6 @@ ADDR_E_RETURNCODE ADDR_API AddrGetMaxMetaAlignments(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                    Surface functions for Addr2

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -1140,7 +1155,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeSurfaceInfo(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeSurfaceAddrFromCoord

				@@ -1173,7 +1187,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeSurfaceAddrFromCoord(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeSurfaceCoordFromAddr

				@@ -1206,8 +1219,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeSurfaceCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                   HTile functions for Addr2

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -1244,7 +1255,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeHtileInfo(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeHtileAddrFromCoord

				@@ -1277,7 +1287,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeHtileAddrFromCoord(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeHtileCoordFromAddr

				@@ -1311,8 +1320,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeHtileCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     C-mask functions for Addr2

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -1350,7 +1357,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeCmaskInfo(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeCmaskAddrFromCoord

				@@ -1383,7 +1389,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeCmaskAddrFromCoord(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeCmaskCoordFromAddr

				@@ -1417,8 +1422,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeCmaskCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     F-mask functions for Addr2

				////////////////////////////////////////////////////////////////////////////////////////////////////

				@@ -1455,7 +1458,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeFmaskInfo(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeFmaskAddrFromCoord

				@@ -1488,7 +1490,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeFmaskAddrFromCoord(

				    return returnCode;

				}

				/**

				****************************************************************************************************

				*   Addr2ComputeFmaskCoordFromAddr

				@@ -1521,8 +1522,6 @@ ADDR_E_RETURNCODE ADDR_API Addr2ComputeFmaskCoordFromAddr(

				    return returnCode;

				}

				////////////////////////////////////////////////////////////////////////////////////////////////////

				//                                     DCC key functions for Addr2

				////////////////////////////////////////////////////////////////////////////////////////////////////

									
										5

src/amd/addrlib/amdgpu_asic_addr.h → src/amd/addrlib/src/amdgpu_asic_addr.h
									
												View File
												
				@@ -1,5 +1,5 @@

				/*

				 * Copyright © 2017 Advanced Micro Devices, Inc.

				 * Copyright © 2017-2018 Advanced Micro Devices, Inc.

				 * All Rights Reserved.

				 *

				 * Permission is hereby granted, free of charge, to any person obtaining

				@@ -90,12 +90,12 @@

				#define AMDGPU_VEGA20_RANGE     0x28, 0xFF

				#define AMDGPU_RAVEN_RANGE      0x01, 0x81

				#define AMDGPU_RAVEN2_RANGE     0x81, 0xFF

				#define AMDGPU_EXPAND_FIX(x) x

				#define AMDGPU_RANGE_HELPER(val, min, max) ((val >= min) && (val < max))

				#define AMDGPU_IN_RANGE(val, ...)   AMDGPU_EXPAND_FIX(AMDGPU_RANGE_HELPER(val, __VA_ARGS__))

				// ASICREV_IS(eRevisionId, revisionName)

				#define ASICREV_IS(r, rn)              AMDGPU_IN_RANGE(r, AMDGPU_##rn##_RANGE)

				#define ASICREV_IS_TAHITI_P(r)         ASICREV_IS(r, TAHITI)

				@@ -132,5 +132,6 @@

				#define ASICREV_IS_VEGA20_P(r)         ASICREV_IS(r, VEGA20)

				#define ASICREV_IS_RAVEN(r)            ASICREV_IS(r, RAVEN)

				#define ASICREV_IS_RAVEN2(r)           ASICREV_IS(r, RAVEN2)

				#endif // _AMDGPU_ASIC_ADDR_H

									
										2

src/amd/addrlib/inc/chip/gfx9/gfx9_gb_reg.h → src/amd/addrlib/src/chip/gfx9/gfx9_gb_reg.h
									
												View File
												
				@@ -2,7 +2,7 @@

				#define __GFX9_GB_REG_H__

				/*

				 * Copyright © 2017 Advanced Micro Devices, Inc.

				 * Copyright © 2007-2018 Advanced Micro Devices, Inc.

				 * All Rights Reserved.

				 *

				 * Permission is hereby granted, free of charge, to any person obtaining

Compare commits

3620 Commits mesa-18.2. ... 19.0

1 .editorconfig Unescape Escape View File

5 .mailmap Unescape Escape View File

451 .travis.yml Unescape Escape View File

10 Android.common.mk Unescape Escape View File

7 Android.mk Unescape Escape View File

6 CleanSpec.mk Unescape Escape View File

3 Makefile.am Unescape Escape View File

7 REVIEWERS Unescape Escape View File

1 SConstruct Unescape Escape View File

2 VERSION Unescape Escape View File

30 appveyor.yml Unescape Escape View File

103 bin/.cherry-ignore Unescape Escape View File

8 bin/get-pick-list.sh Unescape Escape View File

29 bin/git_sha1_gen.py Executable file → Normal file Unescape Escape View File

15 bin/install_megadrivers.py Executable file → Normal file Unescape Escape View File

88 bin/meson-cmd-extract.py Executable file Unescape Escape View File

3 common.py Unescape Escape View File

98 configure.ac Unescape Escape View File

13 docs/autoconf.html Unescape Escape View File

6 docs/download.html Unescape Escape View File

7 docs/envvars.html Unescape Escape View File

68 docs/features.txt Unescape Escape View File

2 docs/helpwanted.html Unescape Escape View File

102 docs/index.html Unescape Escape View File

46 docs/install.html Unescape Escape View File

3 docs/mesa.css Unescape Escape View File

193 docs/meson.html Unescape Escape View File

137 docs/release-calendar.html Unescape Escape View File

71 docs/releasing.html Unescape Escape View File

16 docs/relnotes.html Unescape Escape View File

188 docs/relnotes/18.1.6.html Normal file Unescape Escape View File

104 docs/relnotes/18.1.7.html Normal file Unescape Escape View File

180 docs/relnotes/18.1.8.html Normal file Unescape Escape View File

178 docs/relnotes/18.1.9.html Normal file Unescape Escape View File

3 docs/relnotes/18.2.8.html Unescape Escape View File

283 docs/relnotes/18.3.0.html Normal file Unescape Escape View File

63 docs/relnotes/18.3.1.html Normal file Unescape Escape View File

265 docs/relnotes/18.3.2.html Normal file Unescape Escape View File

2475 docs/relnotes/19.0.0.html Normal file View File

159 docs/relnotes/19.0.1.html Normal file Unescape Escape View File

122 docs/relnotes/19.0.2.html Normal file Unescape Escape View File

148 docs/relnotes/19.0.3.html Normal file Unescape Escape View File

243 docs/relnotes/19.0.4.html Normal file Unescape Escape View File

137 docs/relnotes/19.0.5.html Normal file Unescape Escape View File

153 docs/relnotes/19.0.6.html Normal file Unescape Escape View File

150 docs/relnotes/19.0.7.html Normal file Unescape Escape View File

62 docs/relnotes/19.0.8.html Normal file Unescape Escape View File

39 docs/repository.html Unescape Escape View File

3 docs/shading.html Unescape Escape View File

2 docs/sourcedocs.html Unescape Escape View File

82 docs/specs/EGL_MESA_device_software.txt Normal file Unescape Escape View File

95 docs/specs/EGL_MESA_query_driver.txt Normal file Unescape Escape View File

200 docs/specs/INTEL_shader_atomic_float_minmax.txt Normal file Unescape Escape View File

43 docs/specs/MESA_query_renderer.spec Unescape Escape View File

106 docs/submittingpatches.html Unescape Escape View File

2 docs/utilities.html Unescape Escape View File

21 docs/vmware-guest.html Unescape Escape View File

8 include/EGL/egl.h Unescape Escape View File

106 include/EGL/eglext.h Unescape Escape View File

2 include/GL/gl.h Unescape Escape View File

158 include/GL/internal/dri_interface.h Unescape Escape View File

12 include/c11/threads_win32.h Unescape Escape View File

4 include/c99_compat.h Unescape Escape View File

2 include/d3dadapter/present.h Unescape Escape View File

13 include/drm-uapi/drm_fourcc.h Unescape Escape View File

39 include/drm-uapi/v3d_drm.h Unescape Escape View File

5 include/pci_ids/i965_pci_ids.h Unescape Escape View File

12 include/vulkan/vulkan.h Unescape Escape View File

2 include/vulkan/vulkan_android.h Unescape Escape View File

1236 include/vulkan/vulkan_core.h View File

58 include/vulkan/vulkan_fuchsia.h Normal file Unescape Escape View File

2 include/vulkan/vulkan_ios.h Unescape Escape View File

2 include/vulkan/vulkan_macos.h Unescape Escape View File

65 include/vulkan/vulkan_mir.h Unescape Escape View File

2 include/vulkan/vulkan_vi.h Unescape Escape View File

2 include/vulkan/vulkan_wayland.h Unescape Escape View File

2 include/vulkan/vulkan_win32.h Unescape Escape View File

2 include/vulkan/vulkan_xcb.h Unescape Escape View File

3620 Commits

mesa-18.2. ... 19.0

1

.editorconfig

View File

5

.mailmap

View File

451

.travis.yml

View File

10

Android.common.mk

View File

7

Android.mk

View File

6

CleanSpec.mk

View File

3

Makefile.am

View File

7

REVIEWERS

View File

1

SConstruct

View File

2

VERSION

View File

30

appveyor.yml

View File

103

bin/.cherry-ignore

View File

8

bin/get-pick-list.sh

View File

29

bin/git_sha1_gen.py Executable file → Normal file

View File

15

bin/install_megadrivers.py Executable file → Normal file

View File

88

bin/meson-cmd-extract.py Executable file

View File

3

common.py

View File

98

configure.ac

View File

13

docs/autoconf.html

View File

6

docs/download.html

View File

7

docs/envvars.html

View File

68

docs/features.txt

View File

2

docs/helpwanted.html

View File

102

docs/index.html

View File

46

docs/install.html

View File

3

docs/mesa.css

View File

193

docs/meson.html

View File

137

docs/release-calendar.html

View File

71

docs/releasing.html

View File

16

docs/relnotes.html

View File

188

docs/relnotes/18.1.6.html Normal file

View File

104

docs/relnotes/18.1.7.html Normal file

View File

180

docs/relnotes/18.1.8.html Normal file

View File

178

docs/relnotes/18.1.9.html Normal file

View File

3

docs/relnotes/18.2.8.html

View File

283

docs/relnotes/18.3.0.html Normal file

View File

63

docs/relnotes/18.3.1.html Normal file

View File

265

docs/relnotes/18.3.2.html Normal file

View File

2475

docs/relnotes/19.0.0.html Normal file

View File

159

docs/relnotes/19.0.1.html Normal file

View File

122

docs/relnotes/19.0.2.html Normal file

View File

148

docs/relnotes/19.0.3.html Normal file

View File

243

docs/relnotes/19.0.4.html Normal file

View File

137

docs/relnotes/19.0.5.html Normal file

View File

153

docs/relnotes/19.0.6.html Normal file

View File

150

docs/relnotes/19.0.7.html Normal file

View File

62

docs/relnotes/19.0.8.html Normal file

View File

39

docs/repository.html

View File

3

docs/shading.html

View File

2

docs/sourcedocs.html

View File

82

docs/specs/EGL_MESA_device_software.txt Normal file

View File

95

docs/specs/EGL_MESA_query_driver.txt Normal file

View File

200

docs/specs/INTEL_shader_atomic_float_minmax.txt Normal file

View File

43

docs/specs/MESA_query_renderer.spec

View File

106

docs/submittingpatches.html

View File

2

docs/utilities.html

View File

21

docs/vmware-guest.html

View File

8

include/EGL/egl.h

View File

106

include/EGL/eglext.h

View File

2

include/GL/gl.h

View File

158

include/GL/internal/dri_interface.h

View File

12

include/c11/threads_win32.h

View File

4

include/c99_compat.h

View File

2

include/d3dadapter/present.h

View File

13

include/drm-uapi/drm_fourcc.h

View File

39

include/drm-uapi/v3d_drm.h

View File

5

include/pci_ids/i965_pci_ids.h

View File

12

include/vulkan/vulkan.h

View File

2

include/vulkan/vulkan_android.h

View File

1236

include/vulkan/vulkan_core.h

View File

58

include/vulkan/vulkan_fuchsia.h Normal file

View File

2

include/vulkan/vulkan_ios.h

View File

2

include/vulkan/vulkan_macos.h

View File

65

include/vulkan/vulkan_mir.h

View File

2

include/vulkan/vulkan_vi.h

View File

2

include/vulkan/vulkan_wayland.h

View File

2

include/vulkan/vulkan_win32.h

View File

2

include/vulkan/vulkan_xcb.h

View File

2

include/vulkan/vulkan_xlib.h

View File