fran/mesa - mesa - GNLUG git store

fran/mesa

Author	SHA1	Message	Date
Andres Gomez	3482790712	docs: add release notes for 17.2.8 Signed-off-by: Andres Gomez <agomez@igalia.com> mesa-17.2.8	2017-12-22 22:39:47 +02:00
Andres Gomez	fb618171b6	Update version to 17.2.8 Signed-off-by: Andres Gomez <agomez@igalia.com>	2017-12-22 22:34:12 +02:00
Andres Gomez	d9dd945d5c	cherry-ignore: r600: set DX10_CLAMP for compute shader too extra: The commit references a previous commit in which the changes should have been included but, as clarified by the developer, it is not needed for stable. Signed-off-by: Andres Gomez <agomez@igalia.com>	2017-12-20 19:40:38 +02:00
Andres Gomez	03538c210f	cherry-ignore: main: Clear shader program data whenever ProgramBinary is called extra: The commit just references a fix for an additional change in its v2. Signed-off-by: Andres Gomez <agomez@igalia.com>	2017-12-20 19:40:38 +02:00
Andres Gomez	dd0b6dfd91	cherry-ignore: radv: port merge tess info from anv fixes: This commit addressed earlier commit `d1c9f30d7f` which did not land in branch. Signed-off-by: Andres Gomez <agomez@igalia.com>	2017-12-20 19:40:38 +02:00
Andres Gomez	fb342ac0c8	cherry-ignore: added 17.3 nominations. stable: 17.3 nominations only. Signed-off-by: Andres Gomez <agomez@igalia.com>	2017-12-20 19:40:38 +02:00
Andres Gomez	aa8742864a	cherry-ignore: swr: Fix KNOB_MAX_WORKER_THREADS thread creation override. stable: This commit addressed earlier commit `ead0dfe31e` which did not land in branch. Signed-off-by: Andres Gomez <agomez@igalia.com>	2017-12-20 19:40:38 +02:00
Iago Toral Quiroga	5226e37717	i965/vec4: use a temp register to compute offsets for pull loads 64-bit pull loads are implemented by emitting 2 separate 32-bit pull load messages, where the second message loads from an offset at +16B. That addition of 16B to the original offset should not alter the original offset register used as source for the pull load instruction though, since the compiler might use that same offset register in other instructions (for example, for other pull loads in the shader code that take that same offset as reference). If the pull load is 32-bit then we only need to emit one message and we don't need to do offset calculations, but in that case the optimizer should be able to drop the redundant MOV. Fixes the following test on Haswell: KHR-GL45.gpu_shader_fp64.fp64.max_uniform_components Reviewed-by: Matt Turner <mattst88@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103007 (cherry picked from commit `8620f7ebbc`)	2017-12-20 19:40:38 +02:00
Bas Nieuwenhuizen	a8ee722262	radv: Fix multi-layer blits. We did not set the layer correctly for the dst, as we would keep using the base layer. Same for the source image. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102710 CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `b42e106d4d`)	2017-12-20 19:40:38 +02:00
Brian Paul	334ae3b0d2	gallium/aux: include nr_samples in util_resource_size() computation This function is only used in two places: 1. VMware driver, but only for HUD reporting 2. st/nine state tracker, used for texture memory accounting Fixes: `a69efa9482` ("util: add new util_resource_size() function in u_resource.[ch]") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `dde8309cde`)	2017-12-20 19:40:38 +02:00
Roland Scheidegger	3730b04e81	r600: use DX10_CLAMP bit in shader setup The docs are not very concise in what this really does, however both Alex Deucher and Nicolai Hähnle suggested this only really affects instructions using the CLAMP output modifier, and I've confirmed that with the newly changed piglit isinf_and_isnan test. So, with this bit set, if an instruction has the CLAMP modifier bit (which clamps to [0,1]) set, then NaNs will be converted to zero, otherwise the result will be NaN. D3D10 would require this, glsl doesn't have modifiers (with mesa clamp(x,0,1) would get converted to such a modifier) coupled with a whatever-floats-your-boat specified NaN behavior, but the clamp behavior should probably always be used (this also matches what a decomposition into min(1.0, max(x, 0.0)) would do, if min/max also adhere to the ieee spec of picking the non-nan result). Some apps may in fact rely on this, as this prevents misrenderings in This War of Mine since using ieee muls (`ce7a045fee`), without having to use clamped rcp opcode, which would also fix this bug there. radeonsi also seems to set this bit nowadays if I see that righ (albeit the llvm amdgpu code comment now says "Make clamp modifier on NaN input returns 0" instead of "Do not clamp NAN to 0" since it was changed, which also looks a bit misleading). v2: set it in all shader stages. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103544 Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `3835009796`)	2017-12-20 19:40:38 +02:00
Roland Scheidegger	5ffdc3a2b2	r600: use min_dx10/max_dx10 instead of min/max I believe this is the safe thing to do, especially ever since the driver actually generates NaNs for muls too. The ISA docs are not very helpful here, however the dx10 versions will pick a non-nan result over a NaN one (this is also the ieee754 behavior), whereas the non-dx10 ones will pick the NaN (verified by newly changed piglit isinf-and-isnan test). Other "modern" drivers will most likely do the same. This was shown to make some difference for bug 103544, albeit it is not required to fix it. Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `aab0bfc648`)	2017-12-20 19:40:38 +02:00
Brian Paul	9f94d6aa65	xlib: call _mesa_warning() instead of fprintf() We use _mesa_warning() everywhere else in this code. Change requested by Rick Irons of Mathworks. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `7a46063803`)	2017-12-20 19:40:38 +02:00
Bas Nieuwenhuizen	8f8b5c0ad3	spirv: Fix loading an entire block at once. There is no chain, so checking the length ends with a SEGFAULT. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103579 Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `b926da241a`) [Andres Gomez: 16 bits types and vtn_assert/fail not yet in 17.2] Signed-off-by: Andres Gomez <agomez@igalia.com> Conflicts: src/compiler/spirv/vtn_variables.c	2017-12-20 19:40:38 +02:00
Leo Liu	2e6593806c	radeon/vce: move destroy command before feedback command VCE processing IBs starts from session and task info at first level, other commands processed subsequently. The task info for destroy is embedded to destroy command, resulting that feedback command is not properly procoessed. This is causing kernel spin VM fault messages on Polaris and Vega10 card when running ends at encode application. The fix is also verified on VCE physical mode card. Signed-off-by: Leo Liu <leo.liu@amd.com> Cc: mesa-stable@lists.freedesktop.org Acked-by: Christian König <christian.koenig@amd.com> (cherry picked from commit `6d74cb2570`)	2017-12-20 19:40:38 +02:00
Matt Turner	e0160dd368	util: Add a SHA1 unit test program Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `513d7ffa23`) (cherry picked from commit `60ed1a07f2`) Squashed with: util: scons: wire up the sha1 test Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Andres Gomez <agomez@igalia.com> (cherry picked from commit `5d03a68640`) (cherry picked from commit `455ff75892`)	2017-12-20 19:40:22 +02:00
Matt Turner	2186a9ee6b	util: Assume little endian in the absence of platform-specific handling (cherry picked from commit `6a353479a7`) Squashed with: util: Use preprocessor correctly Fixes: `6a353479a7` ("util: Assume little endian in the absence of platform-specific handling") (cherry picked from commit `b8cbad624b`) Squashed with: util: Just give up and define PIPE_ARCH_LITTLE_ENDIAN on MSVC MSVC doesn't support #warning?! Getting really tired of this. (cherry picked from commit `676761252b`) Squashed with: util: Also include endian.h on cygwin If u_endian.h can't determine the endianess, the default behaviour in sha1.c is to build for big-endian Signed-off-by: Jon Turney <jon.turney@dronecode.org.uk> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `2c62ccb10a`)	2017-12-20 16:16:47 +02:00
Emil Velikov	9ca5f55786	docs: add sha256 checksums for 17.2.7 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-12-14 13:49:09 +00:00
Emil Velikov	834c36fa65	docs: add release notes for 17.2.7 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> mesa-17.2.7	2017-12-14 13:27:23 +00:00
Emil Velikov	3200305737	Update version to 17.2.7 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-12-14 13:17:30 +00:00
Eduardo Lima Mitev	62eb86dd4c	glsl/linker: Check that re-declared, inter-shader built-in blocks match >From GLSL 4.5 spec, section "7.1 Built-In Language Variables", page 130 of the PDF states: "If multiple shaders using members of a built-in block belonging to the same interface are linked together in the same program, they must all redeclare the built-in block in the same way, as described in section 4.3.9 “Interface Blocks” for interface-block matching, or a link-time error will result." Fixes: * GL45-CTS.CommonBugs.CommonBug_PerVertexValidation v2 (Neil Roberts): Explicitly look for gl_PerVertex in the symbol tables instead of waiting to find a variable in the interface. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102677 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Eduardo Lima Mitev <elima@igalia.com> Signed-off-by: Neil Roberts <nroberts@igalia.com> (cherry picked from commit `f9de7f5596`)	2017-12-08 18:27:01 +00:00
Eduardo Lima Mitev	2dc404f73f	glsl: Use the utility function to copy symbols between symbol tables This effectively factorizes a couple of similar routines. v2 (Neil Roberts): Non-trivial rebase on master Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Eduardo Lima Mitev <elima@igalia.com> Signed-off-by: Neil Roberts <nroberts@igalia.com> (cherry picked from commit `f5fe99ac85`)	2017-12-08 18:27:01 +00:00
Eduardo Lima Mitev	f76bea8abc	glsl_parser_extra: Add utility to copy symbols between symbol tables Some symbols gathered in the symbols table during parsing are needed later for the compile and link stages, so they are moved along the process. Currently, only functions and non-temporary variables are copied between symbol tables. However, the built-in gl_PerVertex interface blocks are also needed during the linking stage (the last step), to match re-declared blocks of inter-stage shaders. This patch adds a new utility function that will factorize current code that copies functions and variables between two symbol tables, and in addition will copy explicitly declared gl_PerVertex blocks too. The function will be used in a subsequent patch. v2 (Neil Roberts): Allow the src symbol table to be NULL and explicitly copy the gl_PerVertex symbols in case they are not referenced in the exec_list. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Eduardo Lima Mitev <elima@igalia.com> Signed-off-by: Neil Roberts <nroberts@igalia.com> (cherry picked from commit `4c62a270a9`)	2017-12-08 18:27:01 +00:00
Nicolai Hähnle	32b0691249	glsl: fix interpolateAtXxx(some_vec[idx], ...) with dynamic idx The dynamic index of a vector (not array!) is lowered to a sequence of conditional assignments. However, the interpolate_at_* expressions require that the interpolant is an l-value of a shader input. So instead of doing conditional assignments of parts of the shader input and then interpolating that (which is nonsensical), we interpolate the entire shader input and then do conditional assignments of the interpolated result. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `ca63a5ed3e`)	2017-12-08 18:27:01 +00:00
Nicolai Hähnle	267fe4f1f9	glsl: allow any l-value of an input variable as interpolant in interpolateAt* The intended rule has been clarified in GLSL 4.60, Section 8.13.2 (Interpolation Functions): "For all of the interpolation functions, interpolant must be an l-value from an in declaration; this can include a variable, a block or structure member, an array element, or some combination of these. Component selection operators (e.g., .xy) may be used when specifying interpolant." For members of interface blocks, var->data.must_be_shader_input must be determined on-the-fly after lowering interface blocks, since we don't want to disable varying packing for an entire block just because one input in it is used in interpolateAt. v2: keep setting must_be_shader_input in ast_function (Ian) v3: follow the relaxed rule of GLSL 4.60 v4: only apply the relaxed rules to desktop GL (the ES WG decided that the relaxed rules may apply in a future version but not retroactively; see also dEQP-GLES31.functional.shaders.multisample_interpolation.interpolate_at_centroid.negative.) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101378 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (v1) Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `4f42450b86`)	2017-12-08 18:27:01 +00:00
Fabian Bieler	07da9fd947	glsl: Fix gl_NormalScale. GLSL shaders can access the normal scale factor with the built-in gl_NormalScale. Mesa's modelspace lighting optimization uses a different normal scale factor than defined in the spec. We have to take care not to use this factor for gl_NormalScale. Mesa already defines two seperate states: state.normalScale and state.internal.normalScale. The first is used by the glsl compiler while the later is used by the fixed function T&L pipeline. Previously the only difference was some component swizzling. With this commit state.normalScale always uses the normal scale factor for eyespace lighting. Reviewed-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `c3ee464d7a`)	2017-12-08 18:27:01 +00:00
Fabian Bieler	fe15da8245	glsl: Match order of gl_LightSourceParameters elements. spotExponent and spotCosCutoff were swapped in the gl_builtin_uniform_element struct. Now the order matches across gl_builtin_uniform_element, glsl_struct_field and the spec. Reviewed-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `9bdb5457f4`)	2017-12-08 18:27:01 +00:00
Eric Anholt	511cfe1d70	broadcom/vc4: Fix handling of GFXH-515 workaround with a start vertex count. We failed to take the start into account for how many vertices to draw in this round, so we would end up decrementing count below 0, which as an unsigned number meant we would loop until the CLs soon ran out of space. When I wrote the code I was thinking about how to use the previously emitted shader state (no index bias baked into the elements) by emitting up to 65535 and then only re-emitting with bias for the second wround, but that doesn't work if the start is over 65535. Instead, just delay emitting shader state until we get into the drawarrays GFXH-515 loop and always bake the bias in when we're doing the workaround. (cherry picked from commit `84ab48c15c`)	2017-12-08 18:27:01 +00:00
Julien Isorce	d95c2edcde	st/va: change frame_idx from array to hash table The picture_id was assumed to be a frame number so in 0-31. But the vaapi client gstreamer-vaapi uses the surfaces handles as identifier which are unsigned int. This bug can happen when using a lot of vaapi surfaces within the same process. Indeed Mesa/st/va increments a counter for the surface ID: mesa/util/u_handle_table.c::handle_table_add which starts from 0 and incremented by 1 at each call. So creating more than 32 surfaces was a problem. The following bug contains a test that reproduces the problem by running a couple of vaapih264enc in the same process. The above also explains why there was no pb when running them in separated processes. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102006 Signed-off-by: Julien Isorce <jisorce@oblong.com> Tested-by: Tomas Rataj <rataj28@gmail.com> Acked-by: Christian König <christian.koenig@amd.com> Reviewed-and-tested-by: Boyuan Zhang <Boyuan.Zhang@amd.com> (cherry picked from commit `91d93aa621`)	2017-12-08 18:27:01 +00:00
Denis Pauk	3f1c8733d2	gallium/{r600, radeonsi}: Fix segfault with color format (v2) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102552 v2: Patch cleanup proposed by Nicolai Hähnle. * deleted changes in si_translate_texformat. Cc: Nicolai Hähnle <nhaehnle@gmail.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `74d2456491`)	2017-12-08 18:27:01 +00:00
Vinson Lee	ac78f7c76a	anv: Check if memfd_create is already defined. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103909 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> (cherry picked from commit `8c1e4b1afc`) [Emil Velikov: resolve trivial conflicts] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Conflicts: configure.ac meson.build src/intel/vulkan/anv_allocator.c	2017-12-08 18:27:01 +00:00
Matt Turner	2a266cb675	i965/fs: Unpack count argument to 64-bit shift ops on Atom 64-bit operations on Atom parts have additional restrictions over their big-core counterparts (validated by later patches). Specifically, the restriction that "Source and Destination horizontal stride must be aligned to the same qword" is violated by most shift operations since NIR uses a 32-bit value as the shift count argument, and this causes instructions like shl(8) g19<1>Q g5<4,4,1>Q g23<4,4,1>UD where src1 has a 32-bit stride, but the dest and src0 have a 64-bit stride. This caused ~4 pixels in the ARB_shader_ballot piglit test fs-readInvocation-uint.shader_test to be incorrect. Unfortunately no ARB_gpu_shader_int64 test hit this case because they operate on uniforms, and their scalar regions are an exception to the restriction. We work around this by effectively unpacking the shift count, so that we can read it with a 64-bit stride in the shift instruction. Unfortunately the unpack (a MOV with a dst stride of 2) is a partial write, and cannot be copy-propagated or CSE'd. Bugzilla: https://bugs.freedesktop.org/101984 (cherry picked from commit `b541945c20`)	2017-12-08 18:27:01 +00:00
Frank Richter	e53135725f	gallium/wgl: fix default pixel format issue When creating a context without SetPixelFormat() don't blindly take the pixel format reported by GDI. Instead, look for our own closest pixel format. Minor clean-ups added by Brian Paul. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103412 Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `bf41b2b262`)	2017-12-08 18:27:01 +00:00
Juan A. Suarez Romero	7d19ff0d67	glsl: add varying resources for arrays of complex types This patch is mostly a patch done by Ilia Mirkin. It fixes KHR-GL45.enhanced_layouts.varying_structure_locations. v2: fix locations for TCS/TES/GS inputs and outputs (Ilia) CC: Ilia Mirkin <imirkin@alum.mit.edu> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103098 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `d5a641106b`)	2017-12-08 18:27:01 +00:00
George Kyriazis	df137dff8f	swr: Handle resource across context changes Swr caches fb contents in tiles. Those tiles are stored on a per-context basis. When switching contexts that share resources we need to make sure that the tiles of the old context are being stored and the tiles of the new context are being invalidated (marked as invalid, hence contents need to be reloaded). The context does not get any dirty bits to identify this case. This has to be, then, coordinated by the resources that are being shared between the contexts. Add a "curr_pipe" hook in swr_resource that will allow us to identify a MakeCurrent of the above form during swr_update_derived(). At that time, we invalidate the tiles of the new context. The old context, will need to have already store its tiles by that time, which happens during glFlush(). glFlush() is being called at the beginning of MakeCurrent. So, the sequence of operations is: - At the beginning of glXMakeCurrent(), glFlush() will store the tiles of all bound surfaces of the old context. - After the store, a fence will guarantee that the all tile store make it to the surface - During swr_update_derived(), when we validate the new context, we check all resources to see what changed, and if so, we invalidate the current tiles. Fixes rendering problems with CEI/Ensight. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> (cherry picked from commit `b9aa0fa7d6`) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103732	2017-12-08 18:27:01 +00:00
Timothy Arceri	edd1f519f4	glsl: get correct member type when processing xfb ifc arrays This fixes a crash in: KHR-GL45.enhanced_layouts.xfb_block_stride Fixes: `0822517936` "glsl: add helper to process xfb qualifiers during linking" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `9d53ccccb2`)	2017-12-08 18:27:00 +00:00
Nicolai Hähnle	f51950c413	radeonsi: fix the R600_RESOURCE_FLAG_UNMAPPABLE check The flag is on the pipe_resource, not the r600_resource. I don't see an obvious bug related to this, but it could potentially lead to suboptimal placement of some resources. Fixes: `a41587433c` ("gallium/radeon: add R600_RESOURCE_FLAG_UNMAPPABLE") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> (cherry picked from commit `5e2962c949`)	2017-12-08 18:27:00 +00:00
Pierre Moreau	9c2f665863	nvc0/ir: Properly lower 64-bit shifts when the shift value is >32 Fixes: `61d7676df7` "nvc0/ir: add support for 64-bit shift lowering on SM20/SM30" Fixes fs-shift-scalar-by-scalar.shader_test from piglit for the current set-up: uniform int64_t ival -0x7dfcfefbdf6536ff # bit pattern: 0x82030104209ac901 uniform uint64_t uval 0x1400000085010203 uniform int shl 36 uniform int shr 36 uniform int64_t iexpected_shl 0x09ac901000000000 uniform int64_t iexpected_shr -0x7dfcff0 # bit pattern: 0xfffffffff8203010 uniform uint64_t uexpected_shl 0x5010203000000000 uniform uint64_t uexpected_shr 0x0000000001400000 draw rect ortho 12 0 4 4 Signed-off-by: Pierre Moreau <pierre.morrow@free.fr> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> (cherry picked from commit `9bee12160b`)	2017-12-08 18:27:00 +00:00
Vadym Shovkoplias	a552480249	glx/dri3: Remove unused deviceName variable deviceName string is declared, assigned and freed but actually never used in dri3_create_screen() function. Fixes: `2d94601582` ("Add DRI3+Present loader") Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> (cherry picked from commit `d555929239`)	2017-12-08 18:27:00 +00:00
Vadym Shovkoplias	ecf8e4a64c	intel/blorp: Fix possible NULL pointer dereferencing Fix incomplete check of input params in blorp_surf_convert_to_uncompressed() which can lead to NULL pointer dereferencing. Fixes: `5ae8043fed` ("intel/blorp: Add an entrypoint for doing bit-for-bit copies") Fixes: `f395d0abc8` ("intel/blorp: Internally expose surf_convert_to_uncompressed") Reviewed-by: Emil Velikov <emli.velikov@collabora.com> Reviewed-by: Andres Gomez <agomez@igalia.com> (cherry picked from commit `cdb3eb7174`) [Emil Velikov: drop non-applicable x/y hunk] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Conflicts: src/intel/blorp/blorp_blit.c	2017-12-08 18:27:00 +00:00
Kenneth Graunke	14df2f2d27	i965: Fix Smooth Point Enables. We want to program the 3DSTATE_RASTER field to the gl_context value, not the other way around. Fixes: `13ac46557a` (i965: Port Gen8+ 3DSTATE_RASTER state to genxml.) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `760e0156df`)	2017-12-08 18:27:00 +00:00
Eric Engestrom	83c85b28a4	compiler: use NDEBUG to guard asserts nir_validate.c's #endif already had the correct NDEBUG comment Fixes: `dcb1acdea0` "nir/validate: Only build in debug mode" Fixes: `9ff71b649b` "i965/nir: Validate that NIR passes call nir_metadata_preserve()" Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `7b85b9b877`) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Conflicts: src/compiler/nir/nir.h	2017-12-08 18:27:00 +00:00
Matt Turner	6a698f7f92	util: Fix disk_cache index calculation on big endian The cache-test test program attempts to create a collision (using key_a and key_a_collide) by making the first two bytes identical. The idea is fine -- the shader cache wants to use the first four characters of a SHA1 hex digest as the index. The following program unsigned char array[4] = {1, 2, 3, 4}; int ptr = (int )array; for (int i = 0; i < 4; i++) { printf("%02x", array[i]); } printf("\n"); printf("%08x\n", *ptr); prints 01020304 04030201 on little endian, and 01020304 01020304 on big endian. On big endian platforms reading the character array back as an int (as is done in disk_cache.c) does not yield the same results as reading the byte array. To get the first four characters of the SHA1 hex digest when we mask with CACHE_INDEX_KEY_MASK, we need to byte swap the int on big endian platforms. Bugzilla: https://bugs.freedesktop.org/103668 Bugzilla: https://bugs.gentoo.org/637060 Bugzilla: https://bugs.gentoo.org/636326 Fixes: `87ab26b2ab` ("glsl: Add initial functions to implement an on-disk cache") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `c690a7a8cd`)	2017-12-08 18:27:00 +00:00
Matt Turner	1cb18fc98b	util: Fix SHA1 implementation on big endian The code defines a macro blk0(i) based on the preprocessor condition BYTE_ORDER == LITTLE_ENDIAN. If true, blk0(i) is defined as a byte swap operation. Unfortunately, if the preprocessor macros used in the test are no defined, then the comparison becomes 0 == 0 and it evaluates as true. Fixes: `d1efa09d34` ("util: import sha1 implementation from OpenBSD") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `532674303a`)	2017-12-08 18:27:00 +00:00
Matt Turner	4473097d92	i965/fs: Handle negating immediates on MADs when propagating saturates MADs don't take immediate sources, but we allow them in the IR since it simplifies a lot of things. I neglected to consider that case. Fixes: `4009a9ead4` ("i965/fs: Allow saturate propagation to propagate negations into MADs.") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103616 Reported-and-Tested-by: Ruslan Kabatsayev <b7.10110111@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `a05af1f7b8`)	2017-12-08 18:27:00 +00:00
Ben Crocker	63f44f2e0f	docs/llvmpipe: document ppc64le as alternative architecture to x86. Power8, Power8NV, and Power9 are supported on an equal footing with X86. Cc: "17.2" "17.3" <mesa-stable@lists.freedesktop.org> Signed-off-by: Ben Crocker <bcrocker@redhat.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> [Eric: changed formatting, reworded a bit (with Ben's ack)] Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> (cherry picked from commit `060eb314eb`)	2017-12-08 18:27:00 +00:00
James Legg	773282c1f8	nir/opcodes: Fix constant-folding of bitfield_insert Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104119 CC: <mesa-stable@lists.freedesktop.org> CC: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `947470d10b`)	2017-12-08 18:27:00 +00:00
Alex Smith	6c9c07b0f9	radv: Add LLVM version to the device name string Allows apps to determine the LLVM version so that they can decide whether or not to enable workarounds for LLVM issues. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Cc: "17.2 17.3" <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> (cherry picked from commit `8fda98c4f1`) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Conflicts: src/amd/vulkan/radv_device.c src/amd/vulkan/radv_private.h	2017-12-08 18:27:00 +00:00
Kenneth Graunke	85724a9a69	meta: Fix ClearTexture with GL_DEPTH_COMPONENT. We only handled unpacking for GL_DEPTH_STENCIL formats. Cemu was hitting _mesa_problem() for an unsupported format in _mesa_unpack_float_32_uint_24_8_depth_stencil_row(), because the format was depth-only, rather than depth-stencil. Cc: "13.0 12.0" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94739 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103966 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `8705ed13e3`)	2017-12-08 18:27:00 +00:00
Kenneth Graunke	47482090ea	meta: Initialize depth/clear values on declaration. This helps avoid compiler warningss in the next commit - everything was initialized, but it wasn't obvious to static analysis. Suggested-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `d6d16c0218`)	2017-12-08 18:27:00 +00:00

1 2 3 4 5 ...

94852 Commits