Comparing 7b6d81833a..b0efa68301 - mesa

fran/mesa

Author	SHA1	Message	Date
Ian Romanick	2d2f1fd164	docs: Add some missing features to 9.0 release notes and GL3.txt Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-30 18:23:29 -07:00
Ian Romanick	0791484c42	mesa: Bump version to 9.0 Now that OpenGL 3.1 is supported by at least one driver, follow tradition and bump the major version number. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-30 18:23:28 -07:00
Marek Olšák	0e470533ad	r600g: enable transform feedback on Cayman There doesn't seem to be anything wrong with it.	2012-08-31 01:19:03 +02:00
Marek Olšák	64db3cc6ad	r600g: implement MSAA for Cayman Everything works except for blitting MSAA colorbuffers, which isn't so trivial on Cayman. It's a rarely-used feature anyway.	2012-08-31 01:19:03 +02:00
Anuj Phogat	f8a8f069ee	i965/msaa: flag _NEW_MULTISAMPLE in the brw_tracked_state This is required to get the program recompiled when SampleAlphaToCoverage is enabled. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2012-08-30 11:10:50 -07:00
Marek Olšák	c2e9dd0276	r600g: enable MSAA on r6xx by default DRM 2.22.0 is required though. Also require the new DRM for r700, as there are some important fixes for that generation too.	2012-08-30 19:43:56 +02:00
Marek Olšák	2f6eb3afb7	r600g: disable MSAA depth decompression on r6xx	2012-08-30 19:43:56 +02:00
Marek Olšák	78354011f9	r600g: implement color resolve for r600 The blend state is different and the resolve single-sample buffer must have FMASK and CMASK enabled. I decided to have one CMASK and one FMASK per context instead of per resource. There are new FMASK and CMASK allocation helpers and a new buffer_create helper for that.	2012-08-30 19:43:56 +02:00
Marek Olšák	863e2c85b9	r600g: fix CB_SHADER_MASK and CB_TARGET_MASK for r6xx	2012-08-30 19:43:56 +02:00
Marek Olšák	187d7fb2fe	r600g: implement draw_rectangle callback The color resolve on r6xx needs PT_RECTLIST. Using conventional primitive types (triangles and quads) produces an ugly line between two diagonally opposite corners. I guess a rectangular point sprite would work too.	2012-08-30 19:43:55 +02:00
Marek Olšák	8698a3b85d	r600g: implement MSAA for r700 Reviewed-by: Jerome Glisse <jglisse@redhat.com>	2012-08-30 19:43:55 +02:00
Marek Olšák	edf22a5c6d	r600g: change programming of CB_SHADER_MASK on r600-r700 This one actually makes more sense and gives the expected value for MSAA resolve. Reviewed-by: Jerome Glisse <jglisse@redhat.com>	2012-08-30 19:43:55 +02:00
Marek Olšák	1ff5f08823	configure.ac: require libdrm_radeon 2.6.39 for MSAA	2012-08-30 19:43:55 +02:00
Brian Paul	055093e33f	meta: remove call to _meta_in_progress(), fix multisample enable/disable This partially reverts `d638da23d2`. With gallium the meta code is not always built so the call to _meta_in_progress() was unresolved. Simply special-case the GL_MULTISAMPLE case in the meta code. There might be other special cases in the future given all the differences between legacy GL, core GL, GLES, etc. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=54234 and https://bugs.freedesktop.org/show_bug.cgi?id=54239 v2 (Paul Berry <stereotype441@gmail.com>): keep _meta_in_progress function, since it's needed by the i965 driver, but don't call it from core mesa. Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-30 08:28:19 -07:00
Brian Paul	aad7ccd261	meta: add parenthesis to silence compiler warnings Reviewed-by: Paul Berry <stereotype441@gmail.com>	2012-08-30 09:26:51 -06:00
Tapani Pälli	9121460f13	scons : add HAVE_DLOPEN to build environment fixes dlopen issue caused by `57c57df7b4` Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=54140 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-08-30 12:02:03 +01:00
Christian König	f1fd94f355	radeonsi: fix stupid bug added in commit `07838603b9` Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-30 10:23:32 +02:00
Eric Anholt	8393360659	i965/fs: Remove a dead member from live variables analysis. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-29 20:46:51 -07:00
Kenneth Graunke	6928bea7ca	i965/fs: Initialize output_components[] by filling it with zeros. Prior to commit `2f1869822`, emit_fb_writes() looped from 0 to 3, writing all four components of a vec4 color output. However, that broke for smaller output types (float, vec2, or vec3). To fix that, I introduced a new variable (output_components[]) containing the size of the output type for each render target. Unfortunately, I forgot to actually initialize it in the constructor, which meant that unless a shader wrote to gl_FragColor, or the specific output for each render target, output_components would contain a garbage value, and we'd loop for a completely non-deterministic amount of time. Not actually emitting any color writes seems like the right approach. We may still need to emit a render target write (to terminate the thread), but don't have to put in any sensible values (the shader didn't write anything, after all). Fixes a regression since `2f18698220`. NOTE: This is a candidate for stable release branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=54193 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Tested-by: Ian Romanick <idr@freedesktop.org>	2012-08-29 15:10:57 -07:00
Ian Romanick	42723d88d3	mesa: Do something sensible when on-line compression is requested but not possible It is possible to force S3TC extensions to be enabled. This is generally done to support applications that will only supply pre-compressed textures. This accounts for the vast majority of applications. However, there is still the possibility of an application asking for on-line compression. In that case, generate a warning and substitute a generic compressed format. The driver will either pick an uncompressed format or a compressed format that Mesa can handle on-line (e.g., FXT1). This should only cause problems for applications that request on-line compression and read the compressed texture back. This is likely an infinitesimal subset of an already infinitesimal subset. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-29 15:09:38 -07:00
Ian Romanick	0e0d664461	i965: Allow creation of OpenGL 3.1 contexts v2: Fix API_OPENGL_CORE handling when TEXTURE_FLOAT_ENABLED is not defined. Based on review feedback from Eric Anholt. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-29 15:09:38 -07:00
Ian Romanick	2a33a99737	i965: Advertise GLSL 1.40 and TexBOs in core contexts Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:38 -07:00
Ian Romanick	91473485fc	intel: Clean up bits of cruft in intelCreateContext This and the previous three commits should probably be squashed together... Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	bf8644e64d	i965: Set context flags Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	ca2b1fcb30	mesa/dri: Allow creation of forward-compatible contexts This is done by changing the API to API_OPENGL_CORE. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	36ceabfb74	mesa/es: Enable GL_OES_vertex_array_object Functionally the same as GL_ARB_vertex_array_object. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-29 15:09:37 -07:00
Ian Romanick	35cf6aeb8c	mesa: Enable GL_{ARB,APPLE}_vertex_array_object in all drivers This is a purely software extension. The drivers don't need to do any work to support it. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-29 15:09:37 -07:00
Ian Romanick	d1cf5c77b7	meta: Don't use deprecated keyword in 1.30 shader Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	ae88281b7b	mesa: Disallow alpha, luminance, and LA textures in core context Also disallow the 1, 2, 3, and 4 formats. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	04d6ffa06d	mesa: Disallow more deprecated functions in core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	91107b4ccf	mesa: Require names from Gen in core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	843b876ba3	mesa: Allow NULL vertex pointer without a VBO There is text in the OpenGL 3.x specs to explicitly allow this case. Weird. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	792214e8d4	mesa: Disallow VertexAttribPointer without a VAO in a core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	29512df635	mesa: Disallow wide lines in forward compatible context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:37 -07:00
Ian Romanick	7e1cab09a1	mesa: Only FRONT_AND_BACK is allowed for PolygonMode in core context Page 407 (page 423 of the PDF) of the OpenGL 3.0 spec says (in the list of deprecated functionality): "Separate polygon draw mode - PolygonMode face values of FRONT and BACK; polygons are always drawn in the same mode, no matter which face is being rasterized." Also modify meta to not use FRONT or BACK in a core context. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Paul Berry	d638da23d2	meta: Don't stray outside the confines of the API specified in the context Signed-off-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Ian Romanick	8e7b6a69e9	mesa: Don't allow display lists or evaluators in core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Ian Romanick	2bcf555490	mesa: Don't allow GL_EXTENSIONS query in core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Ian Romanick	c85a9a9996	mesa: Non-sprite points are deprecated Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Eric Anholt	7d8d1c7819	mesa: Fix VAO deletion on GL 3.1 core. We were calling through a dispatch table entry that was NULL, since the apple variant is only on legacy desktop. Just call the function we mean instead of indirecting through the dispatch.	2012-08-29 15:09:36 -07:00
Eric Anholt	8a4d560796	mesa: Enable a bunch of missing getters on 3.1 core. NOTE: maybe I enabled too many?	2012-08-29 15:09:36 -07:00
Eric Anholt	bb4a39ec95	mesa: Expose texture buffer objects when the context is GL 3.1 core. v2: Use API_OPENGL_CORE. v3: Only require desktop GL. If a driver can't support TexBOs in a non-core context, it should not enable them. Signed-off-by: Eric Anholt <eric@anholt.net> Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-29 15:09:36 -07:00
Ian Romanick	1b86a91c64	mesa: Allow PACK / UNPACK queries for ES2 These are part of the GL_EXT_unpack_subimage extension and ES 3.0. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Ian Romanick	a010215463	mesa: Kill ES2 wrapper functions v2: Fix completely broken condition around ClearColorIiEXT and ClearColorIuiEXT. v3: Add special VertexAttrib handling for ES2. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Ian Romanick	fc2219e448	mesa: glGetVertexAttribPointerv is part of core profile and ES2 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:36 -07:00
Ian Romanick	917f68071b	mesa/es: Validate glPointParameter pname in Mesa code rather than the ES wrapper v2: Add proper core-profile filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-29 15:09:36 -07:00
Ian Romanick	f778174ea1	mesa: Require OpenGL 2.0 for GL_POINT_SPRITE_COORD_ORIGIN The comment in the code even says this is the right thing to do. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-29 15:09:36 -07:00
Ian Romanick	25ffb86893	mesa: Require that drivers supporting point sprites support point parameters All drivers in Mesa do. This allows a lot of extension checking code to be gutted from the function. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-29 15:09:35 -07:00
Ian Romanick	33e01d93ca	mesa/es: Validate glGetTexEnv parameters in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	8a263b6efd	mesa/es: Validate glTexEnv parameters in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	d2b03f6e99	mesa/es: Validate glGetTexGen parameters in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	f329adfa49	mesa/es: Validate glTexGen parameters in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	0fa4ed05cf	mesa/es: Validate glLightModel pname in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	fb4f2d3425	mesa/es: Validate glMaterial face and pname in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	8df3f9bd5f	mesa/es: Validate glGetMaterial pname in Mesa code rather than the ES wrapper Fixes a bug that glGetMaterial[fx]v in ES1 contexts would (try to) allow queries of GL_AMBIENT_AND_DIFFUSE. This enum can only be used in glMaterial, not in the get. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	9555d7bdc1	mesa/es: Validate glGetPointerv pname in Mesa code rather than the ES wrapper v2: Add proper core-profile, GLES1, and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	d6c8913bc6	mesa/es: Validate glMatrixMode mode in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	10e7db1ccf	mesa/es: Validate glFog pname in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	b7c7e5e45a	mesa/es: Validate glReadPixels format and type in Mesa code rather than the ES wrapper v2: Add proper GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	4114dee99e	mesa/es: Validate glPixelStore pname in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:35 -07:00
Ian Romanick	08be1d288f	mesa/es: Validate glEnable cap in Mesa code rather than the ES wrapper Also handle glDisable, glIsEnabled, glEnableClientState, and glDisableClientState. v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	bca2cece02	mesa/es: Validate glHint target in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	2c87030a00	mesa/es: Validate glGetVertexAttribf pname in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. v3: Allow glGetVertexAttribfv(0, GL_CURRENT_VERTEX_ATTRIB_ARB, param) in OpenGL 3.1, just like OpenGL ES 2.0. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	c13f36ce4e	mesa/es: Validate glGetString pname in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	6a9b8f897a	mesa/es: Validate primitive modes in Mesa code rather than the ES wrapper v2: Add proper core-profile filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	72e076cb17	mesa: Refactor _mesa_valid_prim_mode to use a switch-statement This makes the next change a bit easier. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	01497a3560	mesa/es: Validate blend function enums in Mesa code rather than the ES wrapper v2: Add proper core-profile filtering. v3: Allow GL_SRC_ALPHA_SATURATE as a destination factor in GLES3. Based on review feedback from Eric Anholt. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	e58c19a204	mesa/es: Validate glClear mask in Mesa code rather than the ES wrapper	2012-08-29 15:09:34 -07:00
Ian Romanick	f0c99d0a6a	mesa/es: Validate glRenderbufferStorage internalFormat in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. v3: Allow GL_RGB10_A2UI in GLES3 based on review feedback from Eric Anholt. v4: Arg. Reject unsized RED and RG enums on GLES. More feedback from Eric. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	ae86ebfcc9	mesa/es: Validate glGetRenderbufferParameter pname in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-29 15:09:34 -07:00
Ian Romanick	0cdaa471ec	mesa/es: Validate glGetFramebufferAttachmentParameter pname in Mesa code rather than the ES wrapper v2: Add proper core-profile, GLES1, and GLES3 filtering. v3: Fix the GL_FRAMEBUFFER_ATTACHMENT_OBJECT_NAME query when the attachment type is GL_NONE on GLES3. Other cleanups. Based on review feedback from Eric Anholt. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	5b44a77428	mesa/es: Validate glGenerateMipmap target in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. v3: Fix a typo in GL_TEXTURE_2D_ARRAY checking. v4: Change !_mesa_is_desktop_gl tests to _mesa_is_gles test. The test around GL_TEXTURE_2D_ARRAY got some other changes because that enum is also available with GLES3 (which uses API_OPENGLES2). Based on review feedback from Eric Anholt. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Ian Romanick	7f991d26ad	mesa/es: Validate glFramebufferTexture2D textarget in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. v3: Change !_mesa_is_desktop_gl tests to _mesa_is_gles test. The test around GL_TEXTURE_2D_ARRAY got some other changes because that enum is also available with GLES3 (which uses API_OPENGLES2). Based on review feedback from Eric Anholt. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-29 15:09:34 -07:00
Tom Stellard	2809ae3d44	radeon/llvm: Fix encoding of FP immediates on SI	2012-08-29 15:52:10 -04:00
Tom Stellard	05113fd266	radeon/llvm: Create a register class for the M0 register The Common Subexpression Elimination pass will not operate on instructions with physical register defs, so we end up with several redundant copies to M0 when using interpolation. Adding a register class that only contains the M0 register allows use to use a virtual register to represent M0, and makes it possible for the Common Subexpression Elimination pass to remove the extra copies.	2012-08-29 15:52:10 -04:00
Tom Stellard	733c28a0d9	radeon/llvm: Set the neverHasSideEffects bit on more instructions This flag makes these instructions candidates for the dead code elimination and common subexpression elimination.	2012-08-29 15:52:10 -04:00
Tom Stellard	cf4ac69928	radeon/llvm: Declare the interpolation intrinsics as ReadOnly This signals to the Dead Code Elimination pass that it is safe to remove these instructions when they are dead.	2012-08-29 15:52:10 -04:00
Tom Stellard	73a2c4b9db	radeon/llvm: Mark M0 as a def when lowering interpolation instructions	2012-08-29 15:52:10 -04:00
Anuj Phogat	0fc11a24c8	meta: Add GLSL variant of _mesa_meta_GenerateMipmap() function This reduces the overhead of using the fixed function internally in the driver. V2: Use setup_glsl_generate_mipmap() and setup_ff_generate_mipmap() functions to avoid code duplication. Use glsl version when ARB_{vertex, fragmet}_shader are present. Remove redundant code. V3: Remove redundant border related code leaving the assertion. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Ian Romanick <idr@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-29 11:43:52 -07:00
Brian Paul	c824804c6f	glsl: s/class/struct/ for ast_type_qualifier To silence an MSVC compiler warning about class vs. struct. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-29 12:08:46 -06:00
Brian Paul	ec6478fd32	mesa: convert a few more macros to inline functions	2012-08-29 08:20:58 -06:00
Brian Paul	cf41d7c63a	mesa: remove COPY_4V_CAST() macro Only used in one place, and not really needed.	2012-08-29 08:20:58 -06:00
Brian Paul	fd9afb87d8	mesa: convert a bunch of math macros to inline functions	2012-08-29 08:20:58 -06:00
Brian Paul	454e23776d	tnl: use INTERP_4F() instead of four INTERP_F() calls	2012-08-29 08:20:58 -06:00
Brian Paul	ba6f47132d	swrast: fix wrong assignments in _swrast_add_spec_terms_line()	2012-08-29 08:20:58 -06:00
Brian Paul	1aee8803f8	mesa: test for GL_EXT_framebuffer_sRGB in glPopAttrib() To avoid spurious GL_INVALID_ENUM errors if the extension isn't supported.	2012-08-29 08:20:57 -06:00
Martin Pieuchot	c4c4d4ad1e	mesa: Define CPU_TO_LE32 to work on OpenBSD Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-29 08:05:17 -06:00
Brian Paul	4aede0018a	docs: remove mention of old driver maintenance People who need old drivers can use older versions of Mesa.	2012-08-28 13:09:02 -06:00
Andreas Boll	6eaccbfeeb	docs/utilities: add/update some useful utilities the progs/util directory is now in mesa demos replace glean with piglit add ApiTrace markup: replace the unordered list <ul> with a definition list <dl> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-28 13:08:56 -06:00
Eric Anholt	67e9ae8563	i965: Disable the swrast context setup on GL 3.1 core. I've reviewed the code, and the swrast callsites remaining are all in drawpixels/copypixels/bitmap/accum, or _swrast_BlitFramebuffer that shouldn't be hit. A piglit run with the context setup disabled on legacy GL and GLES2 showed regressions only in the copypixels and drawpixels tests. If the context type is forced, this reduces the shader_runner maximum heap size for glsl-algebraic-add-add-1.shader_test from 15,137,496b to 4,165,376b. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-28 11:43:04 -07:00
Eric Anholt	993c52d0be	i965: Replace general sw fallback support with a manual check for rendermode. There were no other cases that set it any more. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-28 11:43:04 -07:00
Eric Anholt	b0d23b66cf	intel: Move RenderMode fallback func to i915 driver. The Fallback field of the context struct doesn't work that way on i965, and it's the only caller of FALLBACK() in the driver. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-28 11:43:04 -07:00
Eric Anholt	628dfe9511	i965: Drop the old sw fallback for position array being disabled. This code has been in the driver since the first commit. I think it was trying to stop rendering from happening with a disabled position array. Core mesa has since had changes to deal with disabled position arrays correctly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-28 11:43:04 -07:00
Eric Anholt	5e3c093ff8	i965: Drop support for forcing drawing through sw fallbacks. It turns out it hasn't worked since at least 8.0. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-28 11:43:04 -07:00
Eric Anholt	bfae8650ec	i965: Move depth resolve for span fallbacks to a simpler place. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-28 11:43:04 -07:00
Eric Anholt	707f242c4b	i965: Drop manual hiz resolves in span rendering. swrast uses MapRenderbuffer, which leads to intel_miptree_map, which does the depth resolve. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-28 11:43:04 -07:00
Michel Dänzer	70f9dbe298	radeon/llvm: Handle TGSI KIL opcode for SI. Fixes piglit fp-kil and glBitmap() with radeonsi. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-28 20:27:23 +02:00
Michel Dänzer	16e42a5dd0	radeon/llvm: Basic support for SI EXEC register. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-28 20:26:50 +02:00
Michel Dänzer	6ca64393c9	radeonsi: Don't write to the PA_SC_RASTER_CONFIG register. It should be initialized by the kernel as necessary. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2012-08-28 20:24:52 +02:00
Marek Olšák	999b7f6665	r600g: fix relative addressing on RS780 and RS880 They should be treated like RV670. Tested-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-28 18:27:03 +02:00
Andreas Boll	3e20605c16	docs/helpwanted: add radeonsi todo list Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-28 17:36:07 +02:00
Andreas Boll	17f09b664b	configure.ac: add radeonsi to --with-gallium-drivers help string the help string is used by ./configure --help Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-28 17:35:36 +02:00
José Fonseca	bc8509b43b	llvmpipe: Bump the maximum texture size (in pixels). But cap the size in bytes, to avoid depleting the whole system memory, with humongus textures. Tested with max-texture-size piglit test. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-28 15:18:43 +01:00
Vadim Girlin	6463eb013f	u_vbuf: avoid unnecessary update of the vertex elements Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-08-28 18:01:13 +04:00
Matt Turner	971750e1cd	egl: fix invalid flag detection for EGL_KHR_create_context We want to check whether there are bits set outside of the valid flags. Fixes piglit test egl-create-context-invalid-flag-gl Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-27 15:11:11 -07:00
Kenneth Graunke	77d675926a	i965: Make VS programs obey the shader_precompile driconf option. Now that it's on by default, we may as well make it obey the flag, for consistency's sake if nothing else. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	9ef710575b	i965: Reenable the fragment shader precompile. Precompiling the shader at link time often allows us to avoid compiling it at the first use. This moves the expensive compilation and optimization process to game or level load time, rather than at draw time, where we really can't avoid any cycles and don't want to risk stalling the GPU. The downside is that we have to guess the non-orthagonal state the program will have set when it draws with the shader. Previously, we guessed wrong for nearly every shader, so it wasn't useful. With the recent SamplerUnits rework and this series, we've either eliminated state or made smarter guesses, and usually get it right now. In the L4D2 time demo, I now have 39 fragment shader recompiles and no vertex shader recompiles. Before this series and the SamplerUnits rework, I had 206 fragment shader recompiles and 192 vertex shader recompiles. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	88b3850c27	i965: Set swizzle fields in the VS precompile program key. This fixes a regression since `76d1301e8e`: I began setting SWIZZLE_XYZW for unused sampler units in the actual program keys, since this matched the FS precompile behavior. However, the VS precompile was expecting zero, so that commit made essentially every vertex shader (even those not using texturing) mismatch and need to be recompiled. Setting them in the VS precompile key solves the issue. It also is an improvement over our old behavior: previously we guessed that vertex shaders didn't use any textures at all. Now we actually look to see if the VS had any sampler uniforms and guess based on that. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	c20cb8d1f6	i965/vs: Add VS program key dumping to INTEL_DEBUG=perf. Eric added support for WM key debugging. This adds it for the VS. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	85b24b0751	i965/fs: Assume shadow sampler swizzling is <X, X, X, 1>. Our previous assumption, SWIZZLE_XYZW, was completely bogus for depth textures. There are no Y, Z, or W components. DEPTH_TEXTURE_MODE has three options: - GL_LUMINANCE: <X, X, X, 1> - GL_INTENSITY: <X, X, X, X> - GL_ALPHA: <0, 0, 0, X> The default value is GL_LUMINANCE, and most applications don't seem to alter DEPTH_TEXTURE_MODE. Make that our precompile guess. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	f3d0daf7ea	i965: Index sampler program key data by linker-assigned index. Now that most things are based on the linker-assigned index, it makes sense to convert the arrays in the VS/WM program key as well. It seems silly to leave them indexed by texture unit. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	ab17762c70	i965: Only set proj_attrib_mask for fixed function. brw_wm_prog_key's proj_attrib_mask field is designed to enable an optimization for fixed-function programs, letting us avoid projecting attributes where the divisor is 1.0. However, for shaders, this is not useful, and is pretty much impossible to guess when building the FS precompile key. Turning it off for shaders should allow the precompile to work and not lose much. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Suggested-by: Eric Anholt <eric@anholt.net> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	6cc14c2493	i965: Don't set stats_wm in the WM program key on Gen6+. It's only needed for Gen4/5 IZ lookup workarounds. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:40 -07:00
Kenneth Graunke	b6b1fc1261	i965: Don't set vp_outputs_written in the WM program key on Gen6+. It's only used by on pre-Sandybridge hardware. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:39 -07:00
Kenneth Graunke	87cdefed40	i965: Double the size of the state cache. We probably want to do something more sophisticated here, but this at least makes it through L4D2 without dumping the program cache. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-27 14:23:39 -07:00
Julien Cristau	ac889b2410	glapi/glx: call __glEmptyImage if USE_XCB, not memcpy directly We were stomping on the caller's buffer by ignoring their alignment requests and other pixel store modes. This patch makes the USE_XCB path match the older one more closely. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=52059 Signed-off-by: Julien Cristau <julien.cristau@logilab.fr> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-27 13:32:53 -06:00
Brian Paul	f308c80490	gallium/util: implement tile code for PIPE_FORMAT_Z32_FLOAT Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-27 13:32:53 -06:00
Brian Paul	a971476cc7	st/mesa: use fallback path for glCopyTexSubImage(GL_TEXTURE_1D_ARRAY) Fixes many failing cases in piglit copyteximage test. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-27 13:32:53 -06:00
Chad Versace	88edbdf9f0	i965: Move hiz resolve to after renderbuffer resizing (v2) Do all pre-draw hiz resolves after the renderbuffers are resized by intel_prepare_render. Otherwise, we may resolve buffers that are immediately discarded afterwards. Fixes the assertion failure below when resizing windows in KDE and under some unknown circumstance in Chrome OS: intel_resolve_map.c:46: intel_resolve_map_set: Assertion `(*tail)->need == need' failed. Also, remove the comment that "resolves must occur [...] before setting up any hardware state". That was true when resolves were implemented with meta-ops, but no longer with blorp. v2: - Keep brw_predraw_resolve_buffers in its current position, which is before any brw_context bits are modified. Instead, move the call to intel_prepare_render. Note: This is a candiate for the 8.0 branch. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=52252 Reported-by: Lu Hua <huax.lu@intel.com> Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-27 07:48:28 -07:00
Chad Versace	a2a7e640a4	i965: Remove redundant null check intel_renderbuffer_resolve_hiz checks if rb->mt is null, so there is no need for the caller to do so. Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-27 07:47:09 -07:00
Marek Olšák	7f0fcf17c3	r300g: implement TRUNC correctly This fixes some integer division tests.	2012-08-27 14:35:18 +02:00
Michel Dänzer	f402acdbe2	radeonsi: Use FP16 shader export format when necessary / possible. Fixes piglit fbo-blending-formats. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-27 11:51:56 +02:00
Michel Dänzer	26c7139d2c	radeonsi: Refactor initialization of shader export intrinsic arguments. In preparation for extending this code, which would make it rather unwieldy in its current place. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-27 11:51:49 +02:00
Michel Dänzer	d1e40b3d40	radeonsi: Maintain cache of pixel shader variants according to contxt state. Mostly inspired by r600g commit `4acf71f01e` ('r600g: cache shader variants instead of rebuilding v3'). Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-27 11:51:41 +02:00
Michel Dänzer	84fdda280f	radeonsi: Drop extraneous semicolons from pm4 state macro definitions. Could cause build failures if trying to use the macros in certain constructs. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-27 11:50:38 +02:00
Marek Olšák	a3d9d7ec79	r600g: implement compression for MSAA colorbuffers for evergreen This adds the FMASK and CMASK buffers. They share the same resource with color data. COMPRESSION and FAST_CLEAR are always enabled if both FMASK and CMASK are allocated. We initialize the CMASK to a "compressed" state (not "fast cleared"), so that we can keep FAST_CLEAR enabled all the time. Both FMASK and CMASK must be present at the moment. If either one is missing, the other one is not used. v2: add cayman regs in the list Reviewed-by: Jerome Glisse <jglisse@redhat.com>	2012-08-27 04:31:00 +02:00
Marek Olšák	48edfe0505	r600g: cleanup names around depth decompression for consistency with the upcoming color decompression naming Reviewed-by: Jerome Glisse <jglisse@redhat.com>	2012-08-27 04:31:00 +02:00
Marek Olšák	3ac54ac2c8	r600g: fix evergreen 8x MSAA sample positions The original samples positions took samples outside of the pixel boundary, leading to dark pixels on the edge of the colorbuffer, among other things. Reviewed-by: Jerome Glisse <jglisse@redhat.com>	2012-08-27 04:31:00 +02:00
Marek Olšák	1cfec6e2c8	r600g: set CB_TARGET_MASK to 0xf and not 0xff for resolve on evergreen independent_blend_enable must be true, so that the colormask isn't replicated in all colorbuffers. Reviewed-by: Jerome Glisse <jglisse@redhat.com>	2012-08-27 04:30:59 +02:00
Marek Olšák	1516a4f353	gallium/u_blitter: initialize sample mask in resolve Reviewed-by: Jerome Glisse <jglisse@redhat.com>	2012-08-27 04:30:59 +02:00
Tom Stellard	07c71d6ede	r300/compiler: Use variable lists in the rename_regs pass	2012-08-26 20:39:49 -04:00
Eric Anholt	7540f25a34	i965: Rewrite the comment describing the query object support. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-26 10:40:33 -07:00
Eric Anholt	f0159018d7	i965/gen6+: Add support for GL_ARB_timer_query. Needs updated libdrm. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-26 10:40:33 -07:00
Eric Anholt	9a2943ddf2	i965: Add support for GL_ARB_occlusion_query2. This extension is just a bit of core code on top of the GL_ARB_occlusion_query support. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-26 10:40:33 -07:00
Eric Anholt	b765119c5d	mesa: Add constants for the GL_QUERY_COUNTER_BITS per target. Drivers need to be able to communicate their actual number of bits populated in the field in order for applications to be able to properly handle rollover. There's a small behavior change here: Instead of reporting the GL_SAMPLES_PASSED bits for GL_ANY_SAMPLES_PASSED (which would also be valid), just return 1, because more bits don't make any sense. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-26 10:40:28 -07:00
Eric Anholt	6754ec831e	i965: Fix accumulator_contains() test to also reject swizzles of the dst. When faced with this sequence: MOV R1, c[1]; MAD R0, R2, R1.x, R1.y; we were concluding that the MOV of R1 set up our accumulator and so we could just use the previous result. Only, it's got R1.xyzw in it instead of the r1.y we're looking for. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=46784 NOTE: This is a candidate for the 8.0 branch.	2012-08-26 09:58:40 -07:00
Jakob Bornecrantz	33ee019422	st/dri: Support width and height getters Tested-by: Scott Moreau <oreaus@gmail.com> Signed-off-by: Jakob Bornecrantz <jakob@vmware.com>	2012-08-26 15:40:18 +02:00
Jakob Bornecrantz	15effe1fab	st/dri: Claim to support validate_usage Support version 3 as well as 2, since that is only the new format query, which Jesse added support for to st/dri when he added it to dri_inteface.h. Tested-by: Scott Moreau <oreaus@gmail.com> Signed-off-by: Jakob Bornecrantz <jakob@vmware.com>	2012-08-26 15:40:10 +02:00
Jakob Bornecrantz	93ebec87ed	dri: Make query image WIDTH and HEIGHT be version 4 Tested-by: Scott Moreau <oreaus@gmail.com> Signed-off-by: Jakob Bornecrantz <jakob@vmware.com>	2012-08-26 15:39:50 +02:00
Jakob Bornecrantz	6bb71b8cbe	dri: Remove image write function Since its not used by anything anymore and no release has gone out where it was being used. Tested-by: Scott Moreau <oreaus@gmail.com> Signed-off-by: Jakob Bornecrantz <jakob@vmware.com>	2012-08-26 15:39:41 +02:00
Jakob Bornecrantz	a669a5055e	gbm: Use libkms to replace DRI cursor images Uses libkms instead of dri image cursor. Since this is the only user of the DRI cursor and write interface we can remove cursor surfaces entirely from the DRI interface and as a consequence also from the Gallium interface as well. Tho to make everybody happy with this it would probably should add a kms_bo_write function, but that is probably wise in anyways. The only downside is that it adds a dependancy on libkms, this could how ever be replaced with the dumb_bo drm ioctl interface. Tested-by: Scott Moreau <oreaus@gmail.com> Signed-off-by: Jakob Bornecrantz <jakob@vmware.com>	2012-08-26 15:39:23 +02:00
Kenneth Graunke	a3685544e1	i965: Don't set iz_lookup the FS precompile's program key on Gen6+. We already changed the actual program key builder to only set these bits on gen < 6; this patch just brings the precompile state back in line so it doesn't mismatch every time. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 23:05:35 -07:00
Kenneth Graunke	98211d5af7	i965/fs: Fix INTEL_DEBUG=perf program key printing. When dumping differences in program keys, it printed messages of the format: [Name of thing that changed] [new]->[old] This was terribly confusing: the right arrow implies "the value changed from this to that", when in fact the message conveyed the opposite. Except that some of the time, it didn't, since we accidentally swapped the arguments to brw_debug_recompile_sampler_key. With two swaps, it would often come out in the expected format. This patch fixes it to properly print: [Name of thing that changed] [old]->[new] Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-25 23:01:50 -07:00
Kenneth Graunke	174d44a9c4	mesa: Use a new, more specific hook for shader uniform changes. Gallium drivers and i965 don't require special notification when sampler uniforms change. They simply see the _NEW_TEXTURE and adjust their indirection tables. These drivers don't want ProgramStringNotify: it simply causes pointless recompiles. Unfortunately, i915 still requires shader recompiles and needs ProgramStringNotify. Rather than trying to fix that, simply change the hook to a new, more specific one: ShaderUniformChange. On i915, this translates to ProgramStringNotify; others simply ignore it. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:10 -07:00
Kenneth Graunke	85e8e9e000	i965: Use linker-assigned sampler IDs in instruction encoding. When assigning uniform locations, the linker assigns each sampler uniform a sequential numerical ID. gl_shader_program::SamplerUnits maps these sampler variable IDs to the actual texture units they reference (specified via glUniform1i). Previously, we encoded this mapping in the SEND instruction encoding: the "sampler" was the texture unit number, and the binding table index was SURF_INDEX_TEXTURE(the texture unit number). This unfortunately meant that whenever the application changed the value of a sampler uniform, we had to recompile the shader to change the SEND instructions. This was horrible for the game Cogs, which repeatedly switches between using texture unit 0 and 1. It also made fragment shader precompiles useless: we'd do the precompile at glLinkShader() time, before the application called glUniform1i to set the sampler values. As soon as it did that, we'd have to recompile, wasting time and space in the program cache. This patch encodes the SamplerUnits indirection in the binding table, sampler state, and sampler default color tables. Instead of baking the texture unit number into the shader, we bake in the sampler variable ID assigned by the linker. Since those never change, we don't need to recompile programs on uniform changes. This does mean that the tables now depend on the linked shader program being used for rendering, rather than simply representing all available texture units. This could cause an increase in state emission. Another plus is that the sampler state and sampler default color tables are now compact: we only emit as many entries as there are sampler uniforms, with no holes in the table since the new sampler IDs are sequential. Previously we had to emit a full 16 entries every time, since the tables tracked the state of all active texture units. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:10 -07:00
Kenneth Graunke	2faa592e7f	i965: Add a "sampler state index" parameter to update_sampler_state(). This represents the index into the sampler state table or sampler default color table (the two are identical). Right now, this is still the texture unit, but that will change shortly. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:10 -07:00
Kenneth Graunke	28fab4295e	i965: Un-hardcode WM binding table from update_texture_surface. Currently, we mirror the VS and WM binding tables' texture entries. That may not continue to be true, so in preparation, pass in the binding table and surface index as arguments. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:10 -07:00
Kenneth Graunke	96a22f3583	i965/vs: Rename "sampler" to "texunit" in texturing code. The number we're passing around is actually the ID of the texture unit, as opposed to the numerical value our of sampler uniforms. Calling it "texunit" clarifies this slightly. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:09 -07:00
Kenneth Graunke	0ad2dce24a	i965/fs: Rename "sampler" to "texunit" in texturing code. The number we're passing around is actually the ID of the texture unit, as opposed to the numerical value our of sampler uniforms. Calling it "texunit" clarifies this slightly. Don't bother renaming fs_instruction::sampler. Although it's currently the texture unit, this series will change that. No need for the churn. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:09 -07:00
Kenneth Graunke	bf0308d8d6	i965/fs: Remove unused 'sampler' parameter in emit_texture_genX(). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:09 -07:00
Kenneth Graunke	76d1301e8e	i965: Set SWIZZLE_NOOP for unused texture units in the program keys. Previously, we left the swizzle key field as zero for unused texture units. The precompile sets all of them to SWIZZLE_NOOP, which meant that we mismatched almost every time. Since either works equally well, change it to SWIZZLE_NOOP to match the precompiles. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:09 -07:00
Kenneth Graunke	f510dd5d60	i965: Remove four and a half year old TODO comments about samplers. I can't actually understand what these mean, and they seem to essentially say "we should simplify things", which is a nice goal but not very specific. Presumably things got cleaned up at some point. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-25 12:01:09 -07:00
Kenneth Graunke	d1447f5bc9	i965: Fix brw_link_shader to return false rather than NULL. Fixes brw_shader.cpp:101:9: warning: converting to non-pointer type 'GLboolean {aka unsigned char}' from NULL [-Wconversion-null] Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-with-great-enthusiasm-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by Eric Anholt <eric@anholt.net>	2012-08-25 12:01:09 -07:00
Ian Romanick	f9767dac9a	mesa/es: Validate glGetBufferParameteriv pname in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-24 19:15:20 -07:00
Ian Romanick	93d109645a	mesa/es: Validate glMapBuffer access in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. v3: Really add proper core-profile and GLES3 filtering based on review feedback from Eric Anholt. It looks like previously there was some rebase / merge fail. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-24 19:13:18 -07:00
Ian Romanick	bd4e5dd355	mesa/es: Validate glBufferData usage in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering based on review feedback from Eric Anholt. It looks like previously there was some rebase / merge fail. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-24 19:13:18 -07:00
Ian Romanick	b0b6b76d52	mesa/es: Validate buffer object targets in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-24 19:13:18 -07:00
Ian Romanick	e2cf14d7b2	mesa/es: Validate VertexPointer types in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:18 -07:00
Ian Romanick	ef723ecce4	mesa/es: Remove redundant vertex pointer size validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:18 -07:00
Ian Romanick	a8f475d8f6	mesa/es: Validate TexCoordPointer size in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:18 -07:00
Ian Romanick	c3e9a207d0	mesa/es: Validate TexCoordPointer types in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:18 -07:00
Ian Romanick	e5ef0cbe0e	mesa/es: Validate NormalPointer types in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:18 -07:00
Ian Romanick	fb8218508a	mesa/es: Validate ColorPointer size in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:17 -07:00
Ian Romanick	07ccfef8d1	mesa/es: Validate ColorPointer types in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:17 -07:00
Ian Romanick	28ee443d7b	mesa/es: Remove redundant vertex attrib pointer type validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:17 -07:00
Ian Romanick	ae633d0b2e	mesa/es: Remove redundant vertex attrib pointer size validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:17 -07:00
Ian Romanick	946ddec163	mesa/es: Disallow BGRA vertex arrays in ES or ES2 contexts Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 19:13:17 -07:00
Ian Romanick	bbceed268e	mesa: Rearrange array type checking, filter more types in ES v2: Fix handling of GL_INT and GL_UNSIGNED_INT types pre-ES3.0, and fix handling of GL_INT_2_10_10_10_REV and GL_UNSIGNED_INT_2_10_10_10_REV in ES3.0. Based on review comments by Ken Graunke. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-24 19:13:17 -07:00
Ian Romanick	a33f360e8f	mesa: Refactor element type checking into its own function This consolidates the tests and makes the emitted error message consistent. v2: Rename _mesa_valid_element_type to valid_elements_type. Log the enum string instead of the hex value in error messages. Based on review comments from Brian Paul and Ken Graunke. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-24 19:13:12 -07:00
Brian Paul	229868edf7	wgl: update some comments	2012-08-24 14:09:03 -06:00
Brian Paul	4b7c0938e4	st/mesa: don't do (generic) compression of 1D or 1D_ARRAY textures As with the previous commit for core Mesa. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-08-24 14:09:03 -06:00
Brian Paul	a3af27e993	mesa: add generic compressed -> uncompressed format helper _mesa_generic_compressed_format_to_uncompressed_format() probably wins the prize for longest function name in Mesa. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-08-24 14:09:03 -06:00
Brian Paul	13d0bb21a9	mesa: don't try (generic) compression of 1D and 1D_ARRAY textures See comments in the code for details. Note: we only need to special-case the generic compressed formats since specific texture formats are error-checked earlier to see if the compression format is compatible with the texture type. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-08-24 14:09:03 -06:00
Brian Paul	d47a6ada9c	mesa: add texture target field to ChooseTextureFormat() driver hook This will let us choose the actual hardware format depending on the type of texture. v2: fixup radeon, nouveau, intel and swrast drivers too Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-24 14:08:57 -06:00
Brian Paul	ba7218061b	xlib: remove texture compression hackery I think this was left-over debug code from long ago. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 13:15:27 -06:00
Brian Paul	09fafd3b85	st/mesa: clean up use of 'target' variable in st_context_teximage() 'target' was used both as a parameter of type st_texture_type and then re-used for GL_TEXTURE_x targets. Rename the function parameter and add a new local 'GLenum target'. And remove an extraneous break statement.	2012-08-24 13:15:27 -06:00
Matt Turner	261719b21c	automake: convert vgapi	2012-08-24 11:08:19 -07:00
Matt Turner	ba4a36d8cd	build: Check for bison-generated file before bailing because of no bison .y/.c was a typo. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 11:08:19 -07:00
Matt Turner	179d8aa331	Move _mesa_dl* functions into dlopen.h and inline them No point in having an extra function call for inlinable functions. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2012-08-24 11:08:19 -07:00
Tapani Pälli	57c57df7b4	mesa/dlopen: use HAVE_DLOPEN instead of _GNU_SOURCE Patches changes mesa to use 'HAVE_DLOPEN' defined by configure and Android.mk instead of _GNU_SOURCE for detecting dlopen capability. This makes dlopen to work also on Android where _GNU_SOURCE is not defined. [mattst88] v2: HAVE_DLOPEN is sufficient for including dlfcn.h, remove mingw/blrts checks around dlfcn.h inclusion. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2012-08-24 11:08:19 -07:00
Matt Turner	df4dccc7a9	build: Only add links to .so files if we're building them Xlib-GLX and OSMesa support static building. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=53962	2012-08-24 11:08:19 -07:00
Matt Turner	c56b57f4a1	build: Add libOSMesa.so.$(VERSION) link to libdir	2012-08-24 11:08:19 -07:00
Matt Turner	a8fd8cb9e7	build: Replace OSMESA_VERSION with generic VERSION_NUMBER Can be used by other modules.	2012-08-24 11:08:19 -07:00
Matt Turner	383a70bf9a	build: Order AC_CONFIG_FILES list Makefiles before .pc files before directories. Alphabetize files of the same type.	2012-08-24 11:08:19 -07:00
Matt Turner	8cdce6c136	build: Only build libmesa.la when needed Namely, for Xlib-GLX, OSMesa, or test programs.	2012-08-24 11:08:19 -07:00
Matt Turner	00f3d9b11a	build: Remove duplicate DRI automake conditionals	2012-08-24 11:08:19 -07:00
Matt Turner	d23b1b7977	build: Remove GLU_DIRS	2012-08-24 11:08:19 -07:00
Matt Turner	0abb26ebff	build: Only generate dispatch assembly code that will be built	2012-08-24 11:08:19 -07:00
Paul Berry	5133bd6585	i965: don't clear resolve map when doing fast depth clears. Previously, when performing a fast depth clear, we would also clear the miptree's resolve map. This destroyed important information, since the resolve map contains information about needed resolves for all levels and layers of the miptree, whereas a depth clear only applies to a single level/layer combination at a time. As a result, resolves would sometimes fail to occur, leading to incorrect rendering. Fixes rendering artifacts with shadow maps in Unigine Heaven and Unigine Sanctuary. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50270 Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-24 09:59:27 -07:00
Paul Berry	4b8b6f385e	i965/HiZ: remove assertion from intel_resolve_map_set(). There are three possible resolve map states for each (level, layer) of a depth miptree: "needs HiZ resolve", "needs depth resolve", and "needs neither". When HiZ was first implemented on i965, any attempt to directly transition between "needs HiZ resolve" and "needs depth resolve" without passing through the "needs neither" state would have been a bug indicating that a necessary resolve hadn't been performed. Accordingly, intel_resolve_map_set() contained an assertion to verify that no such direct transition happened. However, now that we support fast depth clears, there is a valid transition from the "needs HiZ resolve" to the "needs depth resolve" state. When doing a fast depth clear, the old state of the buffer is irrelevant, since we are completely replacing it with the clear value, so it is not necessary to do any resolves before clearing--we can transition, if necessary, directly from the "needs HiZ resolve" state to the "needs depth resolve" state. To avoid spurious assertions in this valid case, this patch just removes the assertion. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-24 09:59:27 -07:00
Christian König	9aacd5cc67	radeonsi: remove old tilling handling Just use the functionality provided by the surface manager instead. This fixes just another bunch of piglit tests. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-24 18:11:31 +02:00
Ian Romanick	86f29cf7d0	mesa/es: Validate glCreateShader targets in Mesa code rather than the ES wrapper v2: Add proper core-profile filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-24 09:06:31 -07:00
Ian Romanick	b042f7a1ff	mesa/es: Validate glGetProgramiv pnames in Mesa code rather than the ES wrapper v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-24 09:06:31 -07:00
Ian Romanick	1a200b68cd	mesa: Filter glGetProgramiv pnames based on available extensions Previously you could always glGetProgramiv one of the transform feedback or geometry shader enums even if the extension wasn't supported. In addtion, this reverts part of `bda6ad27`. I think the hunks involving GL_PROGRAM_BINARY_LENGTH_OES were spurious. Mesa has no support for any other part of GL_OES_get_program_binary. v2: Remove redundant return in get_programiv based on review feedback from Matt Turner. v3: Correctly handle UBO related enums. v4: Emit the bad enum in the _mesa_error call based on review feedback from Brian Paul. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-24 09:06:31 -07:00
Brian Paul	9282ebbaa5	swrast: implement cubical depth texture sampling Fixes a few more failures in the piglit copyteximage test. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-24 09:38:44 -06:00
Blaž Tomažič	87280d56a3	clover: Accept CL_MEM_READ_WRITE flag Fix API functions for memory objects to accept CL_MEM_READ_WRITE flag. Signed-off-by: Blaž Tomažič <blaz.tomazic@gmail.com> [ Francisco Jerez: Drop incorrect change in clCreateSubBuffer. ]	2012-08-24 17:10:14 +02:00
Tom Stellard	167ecf5ba3	radeon/llvm: Cleanup R600Instructions.td	2012-08-24 14:14:55 +00:00
Brian Paul	388af5b6f4	main: fix ES compile breakage	2012-08-24 06:40:06 -06:00
Brian Paul	4fec5e9154	mesa/swrast: fix GL_TEXTURE_2D_ARRAY texture fetches for dxt formats As with the previous commit. This fixes the last crash in the piglit copyteximage test but there's still some failures.	2012-08-24 06:18:42 -06:00
Brian Paul	d78b44c265	mesa/swrast: fix GL_TEXTURE_2D_ARRAY texture fetches for latc/rgtc formats Fix-up the texel fetch functions so that they handle 3D coords (as used for array textures) and remove the "f_2d" part from their names. Helps fix swrast crashes in piglit's copyteximage test. More to come.	2012-08-24 06:18:41 -06:00
Brian Paul	fe2cc65fbb	mesa: code movement in teximage.c To get rid of a forward declaration.	2012-08-24 06:18:41 -06:00
Brian Paul	bdff1dfb39	mesa: consolidate glTexImage and glCompressedTexImage code There was a lot of similar or duplicated code before. To minimize this patch's size, use a forward declaration for compressed_texture_error_check(). Move the function in the next patch.	2012-08-24 06:18:41 -06:00
Brian Paul	e93cb4b34f	mesa: make glTexImage, glCompressedTexImage proxy code more alike Next up, we can combine the teximage() and compressed_teximage() functions.	2012-08-24 06:18:41 -06:00
Brian Paul	c1a9e6010b	mesa: rename texpal.[ch] to texcompress_cpal.[ch] To be consistent with other files related to texture compression.	2012-08-24 06:18:41 -06:00
Brian Paul	aab06dc0f0	mesa: s/GLuint/gl_format/ in _mesa_compressed_format_to_glenum() No real change here, just use the right type.	2012-08-24 06:18:41 -06:00
Brian Paul	46751edca9	mesa: new _mesa_num_tex_faces() helper Not a real big help now, but will be useful for the GL_ARB_texture_cube_map_array extension in the future.	2012-08-24 06:18:41 -06:00
Brian Paul	8a935d71ff	mesa: make _mesa_get_proxy_tex_image() static It's not used by any other file.	2012-08-24 06:18:41 -06:00
Brian Paul	637a79aa23	mesa: don't clear proxy image fields when regular GL error is generated If a proxy texture call generates a regular GL error, we should not clear the proxy image's width/height/depth/format fields. Use a new PROXY_ERROR token to distinguish proxy errors from regular GL errors. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-08-24 06:18:41 -06:00
Brian Paul	1f5b1f9846	mesa: fix glTexImage proxy texture error generation When calling glTexImage() with a proxy target most error conditions should generate a GL error. We were erroneously doing the proxy-error behaviour (where we zeroed-out the image's width/height/depth/format fields) in too many places. There's another issue with proxy textures, but that'll be fixed in the next patch. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-08-24 06:18:41 -06:00
José Fonseca	3e3f99277d	draw: Fix regression in draw_set_sampler(_views). draw->samplers(_views) now has PIPE_SHADER_TYPES elements, instead of PIPE_MAX_SAMPLERS as before. Also, shader_stage must be less than PIPE_SHADER_TYPES to prevent buffer overflow. Trivial.	2012-08-24 11:28:00 +01:00
Vadim Girlin	e84d45fdb7	build: don't leave git_sha1.h.tmp after build/install Fixes "`main/git_sha1.h.tmp': Permission denied" build error. See https://bugs.freedesktop.org/show_bug.cgi?id=52064 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-24 11:16:14 +04:00
Tom Stellard	1434a86f50	radeon/llvm: Set End of Program bit on RAT instructions This code was accidently dropped during the MCCodeEmitter conversion.	2012-08-23 21:54:32 +00:00
Tom Stellard	1bd7b29a66	radeon/llvm: Use correct instruction for moving immediates This should fix an assertion failure that was happening in some compute shaders.	2012-08-23 21:54:32 +00:00
Tom Stellard	2ad8608cb3	radeon/llvm: Fix some coding style issues	2012-08-23 21:54:32 +00:00
Tom Stellard	228a6641cc	radeon/llvm: Pull changes from external version of the backend	2012-08-23 21:54:32 +00:00
Tom Stellard	5a1edb8655	radeon/llvm: Simplify the convert to ISA pass	2012-08-23 21:54:32 +00:00
Tom Stellard	cb5227b403	radeon/llvm: Make sure to use the Text section in the AsmPrinter	2012-08-23 21:54:31 +00:00
Matt Turner	68a2c510a6	build: Fix installation of GLES2 headers Reported-by: U. Artie Eoff <ullysses.a.eoff@intel.com> Tested-by: U. Artie Eoff <ullysses.a.eoff@intel.com>	2012-08-23 14:07:35 -07:00
Matt Turner	fc9ea7c74d	build: Fix GLES linkage with libglapi Reported-by: Ian Romanick <idr@freedesktop.org>	2012-08-23 14:07:35 -07:00
Anuj Phogat	e592f7df03	i965/msaa: Add sample-alpha-to-coverage support for multiple render targets Render Target Write message should include source zero alpha value when sample-alpha-to-coverage is enabled for an FBO with multiple render targets. Source zero alpha value is used as fragment coverage for all the render targets. This patch makes piglit tests draw-buffers-alpha-to-coverage and alpha-to-coverage-no-draw-buffer-zero to pass on Sandybridge. No regressions are observed with piglit all.tests. V2: Revert all the changes made in emit_color_write() function to include src0 alpha for targets > 0. Now handling this case in a if block. V3: Correctly calculate the instruction length for buffer zero. Properly handle the case of dual_src_blend when alpha-to-coverage is enabled. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Paul Berry <stereotype441@gmail.com>	2012-08-23 13:30:54 -07:00
Stéphane Marchesin	ff996cafce	glsl/linker: Avoid buffer over-run in parcel_out_uniform_storage::visit_field When too may uniforms are used, the error will be caught in check_resources (src/glsl/linker.cpp). NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Stéphane Marchesin <marcheu@chromium.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Tested-by: Benoit Jacob <bjacob@mozilla.com>	2012-08-23 11:42:19 -07:00
Ian Romanick	9b028faeaa	mesa/es: Validate glCompressedTexSubImage internalFormat in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:31 -07:00
Ian Romanick	dd0eb00487	mesa/es: Validate glCompressedTexImage internalFormat in Mesa code rather than the ES wrapper v2: Add proper core-profile filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:31 -07:00
Ian Romanick	c11096e94a	mesa/es: Validate glCopyTexImage internalFormat in Mesa code rather than the ES wrapper v2: Add GLES3 filtering. I'm not 100% sure this is correct. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:31 -07:00
Ian Romanick	9848e86af0	mesa/es: Validate glTexSubImage format and type in Mesa code rather than the ES wrapper v2: Add proper GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:31 -07:00
Ian Romanick	409620e477	mesa/es: Validate glTexImage format, type, and internalFormat in Mesa code rather than the ES wrapper v2: Add proper GLES3 filtering. v3: Collapse ALPHA, LUMINANCE, and LUMINANCE_ALPHA cases per review comment from Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:31 -07:00
Ian Romanick	0686ccac95	mesa/es: Validate glTexImage border in Mesa code rather than the ES wrapper Also validate glCopyTexImage border. This fixes a bug in the APIspec. Previously glTexImage3DOES could be passed a non-zero border without error. NOTE: This is a candidate for stable release branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:31 -07:00
Ian Romanick	59d965333c	mesa: Generate an error when glCopyTexImage border is invalid NOTE: This is a candidate for stable release branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	2dcb40bb44	mesa/es: Add support for GL_APPLE_texture_max_level This is desktop OpenGL functionality that has always existed. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	c9689e3e55	mesa/es: Validate glGetTexParameter pnames in Mesa code rather than the ES wrapper This also adds a missing extension (and API) check around GL_TEXTURE_CROP_RECT_OES. v2: Add proper core-profile and GLES3 filtering. GL_TEXTURE_MAX_LEVEL is (incorrectly) accepted in ES contexts. A future patch will add GL_APPLE_texture_max_level, and meta really needs this. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	b3dd524a10	mesa/es: Validate glTexParameter pnames in Mesa code rather than the ES wrapper This also adds a missing extension (and API) check around GL_TEXTURE_CROP_RECT_OES. v2: Add proper core-profile, GLES1, and GLES3 filtering. GL_TEXTURE_MAX_LEVEL is (incorrectly) accepted in ES contexts. A future patch will add GL_APPLE_texture_max_level, and meta really needs this. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	4269cace79	mesa/es: Remove redundant glBindTexture target validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	3f7c8364cf	mesa: Filter glBindTexture targets based on supported features. Fixed the piglit test arb_texture_buffer_object-negative-unsupported. NOTE: This is a candidate for stable release branches. v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	530c9d764b	mesa/es: Validate tex image targets in Mesa code rather than the ES wrapper This should take care of all the TexImage, TexSubImage, CopyTexImage, CompressedTexImage3DOES, and CopyTexSubImage type paths. v2: Add proper core-profile and GLES3 filtering. v3: Squash the CompressedTexImage3DOES patch per review comment from Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	ea9b212fca	mesa/es: Validate EGLImageTargetTexture2DOES target in Mesa code rather than the ES wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	a0595cb450	mesa/es: Validate glTexParameter targets in Mesa code rather than the ES wrapper Ditto for glGetTexParameter targets. v2: Add proper core-profile and GLES3 filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:30 -07:00
Ian Romanick	842efb9447	mesa/es: Validate GL_TEXTURE_WRAP param in Mesa code rather than the ES wrapper v2: Add proper core-profile filtering. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:29 -07:00
Ian Romanick	d53101a9f3	mesa: Refactor validate_texture_wrap_mode to use a switch-statement This makes the next couple changes a little easier. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-23 10:15:29 -07:00
Ian Romanick	2abf555496	meta: Don't modify GL_GENERATE_MIPMAP state when it doesn't exist This is a bit of a hack. _mesa_meta_GenerateMipmap shouldn't even be used in contexts where GL_GENERATE_MIPMAP doesn't exist (i.e., core profile and ES2) because it uses fixed-function, and fixed-function doesn't exist there either! A GLSL-based _mesa_meta_GenerateMipmap should be available soon. When that is available, this patch will be irrelevant and should be reverted. v2: Change (ctx->API != API_OPENGLES2 && ctx->API != API_OPENGL_CORE) to (ctx->API == API_OPENGL \|\| ctx->API == API_OPENGLES) based on review comment from Brian Paul. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-23 10:15:29 -07:00
Tapani Pälli	2ddfca9837	build/glsl: fix android build v2 Commit `77a3efc6b9` broke android build that sets its own value for GLSL_SRCDIR before including Makefile.sources. Patch moves overriding the value after include, this works as GLSL_SRCDIR variable gets expanded only later. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2012-08-23 10:13:38 -07:00
Matt Turner	a6b8b709cd	automake: convert es1api	2012-08-23 09:40:06 -07:00
Matt Turner	0f8110cb0c	automake: convert es2api	2012-08-23 09:38:32 -07:00
Vadim Girlin	68d6441930	st/dri: pass config options to the state tracker Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-08-23 19:57:51 +04:00
Vadim Girlin	a6457c0692	st/mesa: accept and handle configuration options from st/dri Currently there is a single option - force_glsl_extensions_warn. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-08-23 19:57:51 +04:00
Vadim Girlin	44f69fc825	st/dri: add force_glsl_extensions_warn option to dri options Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-08-23 19:57:51 +04:00
Vadim Girlin	e7c177ec9e	st/dri: use driver name for driconf section lookup The name is taken from the driver_descriptor, so it will be the same as expected by driconf utility. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-08-23 19:57:51 +04:00
Vadim Girlin	6547733593	swrast: add DRM_DRIVER_DESCRIPTOR to store driver name	2012-08-23 19:57:50 +04:00
Paulo Alcantara	b41f36bde7	egl_dri2: Fix segmentation fault The segmentation fault occurs when DRI2 is not loaded up and dri2_setup_screen() function deferences dri2_dpy->dri2 (since it's NULL at this point). This patch fixes the segmentation fault by checking if dri2 pointer is not NULL before deferencing it. Signed-off-by: Paulo Alcantara <pcacjr@profusion.mobi> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-23 09:17:23 -06:00
Tom Stellard	90bd1d52bb	radeon/llvm: Use the MCCodeEmitter for R600	2012-08-23 15:00:48 +00:00
Tom Stellard	235318a578	radeon/llvm: Use the MCCodeEmitter for SI	2012-08-23 15:00:48 +00:00
Tom Stellard	2de24024c1	radeon/llvm: Set 64BitPtr feature bit for SI	2012-08-23 15:00:48 +00:00
Tom Stellard	3f9b6aa0f4	radeon/llvm: Lower RETFLAG DAG Node to S_ENDPGM on SI	2012-08-23 15:00:48 +00:00
Tom Stellard	e30b4644b6	radeon/llvm: Add AsmPrinter	2012-08-23 15:00:48 +00:00
Tom Stellard	e61c54cb6b	radeon/llvm: Mark JUMP as a pseudo instruction	2012-08-23 15:00:48 +00:00
Tom Stellard	ead72204f1	radeon/llvm: Remove the last uses of MachineOperand flags	2012-08-23 15:00:47 +00:00
Tom Stellard	67a47a445b	radeon/llvm: Add flag operand to some instructions This new operand replaces the MachineOperand flags in LLVM, which will be deprecated soon. Eventually all instructions should have a flag operand, but for now this operand has only been added to instructions that need it.	2012-08-23 15:00:47 +00:00
Tom Stellard	3a7a56e7aa	radeon/llvm: Encapsulate setting of MachineOperand flags MachineOperand flags will be removed soon, so it is convienent to have only one function that modifies them.	2012-08-23 15:00:47 +00:00
Matt Turner	bee2edbf3d	build: Link DRI drivers with dricore in case of no direct rendering Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	bfd7d6f58b	build: Only build libmesagallium.la if building Gallium Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	f9786394e5	build: Clean glx Makefile.am mapi/glapi is already built when make is run in src/glx. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	d9b109892d	build: Put mapi/shared-glapi in CORE_DIRS SRC_DIRS was overwritten (visible in the second hunk). Also don't require mapi/shared-glapi to be built for GLES. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	8c9b78aad1	build: Only allow shared-glapi with DRI Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	32e8ce6d24	build: Set sensible DRI/X11/OSMesa defaults Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	53248e5f95	build: Print whether shared-glapi is enabled Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	625651cf81	build/x11: Force usage of C++ linker Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	9049b7f0fa	build/x11: Don't link against shared-glapi Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Matt Turner	be5fe7b320	build: Remove deprecated --with-driver= flag Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-22 11:08:06 -07:00
Christian König	302c66ff81	radeonsi: rework vertex format handling Preventing piglit's draw-vertices test from hanging the GPU. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-22 15:33:54 +02:00
Christian König	07838603b9	radeonsi: fix SPI_PS_INPUT_ENA handling We need to enable at least one interpolation mode, otherwise the GPU will hang. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-22 15:33:49 +02:00
Vadim Girlin	8d1a9a984f	r600g: fix lockups with dual_src_blend v2 Disable blending when dual_src_blend is enabled and number of color exports in the current fragment shader is less than 2. Fixes lockups with ext_framebuffer_multisample- alpha-to-coverage-dual-src-blend piglit test. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-08-22 12:12:22 +04:00
Jakob Bornecrantz	c4610e9f92	st/dri: Add shared usage on buffers created Tested-by: Scott Moreau <oreaus@gmail.com> Signed-off-by: Jakob Bornecrantz <jakob@vmware.com>	2012-08-22 00:01:28 +02:00
Jakob Bornecrantz	61e95b8a5f	gbm: Add shared usage on images created Tested-by: Scott Moreau <oreaus@gmail.com> Signed-off-by: Jakob Bornecrantz <jakob@vmware.com>	2012-08-22 00:01:28 +02:00
Anuj Phogat	df2c4cbced	mesa: Fix generic compressed texture formats' handling in glTexImage/glCopyTexImage The generic texture formats should be accepted by the <internalformat> parameter of TexImage1D, TexImage2D, TexImage3D, CopyTexImage1D, and CopyTexImage2D functions. When the application specifies a generic format, the driver is free to pick an uncompressed format. This patch reverts the changes due to following commit: commit `a36581ccc0` mesa: do more teximage error checking for generic compressed formats This patch fixes compressed texture format failures in intel oglconform pxconv-gettex test case: https://bugs.freedesktop.org/show_bug.cgi?id=47220 Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-21 15:00:06 -07:00
Tom Stellard	1cb07bd3b8	radeon/llvm: ExpandSpecialInstrs - Add support for cube instructions	2012-08-21 15:42:44 +00:00
Tom Stellard	6c99f2101f	radeon/llvm: ExpandSpecialInstrs - Add support for vector instructions	2012-08-21 15:42:44 +00:00
Tom Stellard	82a5d0c641	radeon/llvm: Add R600ExpandSpecialInstrs pass This pass expends reduction instructions into a MachineInstrBundle that contains 4 instruction, one for each instruction slot.	2012-08-21 15:42:44 +00:00
Tom Stellard	0588298575	radeon/llvm: Add helper function for getting sub reg indices	2012-08-21 15:42:44 +00:00
Michel Dänzer	1a25ebe3ce	radeonsi: Handle NULL sampler views getting passed in by the state tracker. Don't dereference NULL pointers, and if all views are NULL, don't generate an invalid PM4 packet which locks up the GPU. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2012-08-21 15:42:25 +02:00
Ian Romanick	c1114c619a	APIspec: Remove cruft about AMD_compressed_???_texture Mesa doesn't support these extensions, and it seems unlikely that it ever will Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:34 -07:00
Ian Romanick	4c32ee5bca	mesa/es: Remove redundant glFramebufferTexture3D textarget validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:34 -07:00
Ian Romanick	7c9afe50fd	mesa/es: Remove redundant glGetShaderiv pname validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:34 -07:00
Ian Romanick	aaef441638	mesa/es: Remove redundant glCompressedTexImage border validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	d39cb8e9ef	mesa/es: Remove redundant glPointSizePointer type validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	d54004c352	mesa/es: Remove redundant glGetBufferPointer pname validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	68d7ce3e9e	mesa/es: Remove redundant glGetVertexAttribPointer pname validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	5be5cf6934	mesa/es: Remove redundant element type validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	b99a8caff1	mesa/es: Remove redundant glGetShaderPrecisionFormat shader type validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	c914ac239e	mesa/es: Remove redundant depth func validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	af276d9d4b	mesa/es: Remove redundant stencil op fail/zfail/zpass validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	f3f993153c	mesa/es: Remove redundant shade model mode validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:33 -07:00
Ian Romanick	5a193557d1	mesa/es: Remove redundant light pname and light validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	0234410791	mesa/es: Remove redundant hint mode validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	a4251da3b2	mesa/es: Remove redundant separate stencil face validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	9113d0e686	mesa/es: Remove redundant stencil function validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	1087745afe	mesa/es: Remove redundant logic op operand validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	bf03589882	mesa/es: Remove redundant alpha function validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	8f55d83569	mesa/es: Remove redundant separate stencil mask face validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	52d57985c6	mesa/es: Remove redundant front-face mode validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	e1dbf56a10	mesa/es: Remove redundant face culling mode validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:32 -07:00
Ian Romanick	66404557db	mesa/es: Remove redundant blend equation mode validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:06:31 -07:00
Ian Romanick	e39ea674d0	mesa/es: Remove redundant texture target validation Mesa doesn't check the parameter passed to glMultiTexCoord*. It does, however, mask the texture value to prevent out-of-bounds writes. This patch will promote this non-conformant behavior to OpenGL ES 1. I don't think anyone will care, and the gets some silly code out of a hot path. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 16:05:11 -07:00
Ian Romanick	386e2f3289	mesa/es: Rearrange placement of GL_TEXTURE_MAX_ANISOTROPY_EXT in APIspec Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 09:52:45 -07:00
Ian Romanick	27e55805fb	mesa/es: Remove redundant min/mag filter validation Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-20 09:52:27 -07:00
Mathias Fröhlich	926a4a922f	radeon-llvm: Start multithreaded before using llvm. This is required to make some of llvm's api calls thread save. In particular the PassRegistry, which is implicitly accessed while compiling shader programs. The PassRegistry uses a mutex that is only active if the llvm_is_multithreaded() returns true. Calling llvm_start_multithreading() makes this happen and by calling this function we try to make sure that we can savely compile shaders in paralell. Since there is also a call llvm_stop_multithreading() in the llvm api, we cannot guarantee that this does not get switched off while we are relying on this being set, but for the easier use cases this fixes a race with the radeon llvm compiler we have as of today. Signed-off-by: Mathias Froehlich <Mathias.Froehlich@web.de> Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-20 16:27:23 +00:00
archibald	59361d76a5	r600g: Move common compute/3D register init to its own function Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-20 15:35:09 +00:00
Christoph Bumiller	c51f8e2790	nv50/ir/tgsi: handle DP2 in tgsi Instruction srcMask Solved by Tiziano Bacocco on IRC.	2012-08-18 17:38:56 +02:00
Christoph Bumiller	f3a7be740d	nv50/ir/emit: don't forget saturation bit on f32 add immediate Solved by Maxim Levitsky on IRC.	2012-08-18 17:38:45 +02:00
Tilman Sauerbeck	d0ace4e949	mesa: use #if over #ifdef in the FEATURE_ES1 check to fix a build failure. mfeatures.h will define FEATURE_ES1 to 0 if it's not defined yet. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=53664 Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-18 07:53:54 -06:00
Brian Paul	5b542681dc	st/mesa: fix sampler view counting In the past, when we called pipe::set_sampler_views(n) the drivers set samplers [n..MAX] to NULL. We no longer do that. The state tracker code was already trying to set unused sampler views to NULL to cover that case, but the logic was broken and unnoticed until now. This patch fixes it. Strictly speaking, this patch shouldn't be necessary. Drivers should simply ignore unused samplers and sampler views. But some drivers like llvmpipe (and others?) count those things and they figure into state validation. That could be fixed in the future. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=53617 Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-08-18 07:40:10 -06:00
Brian Paul	d65eb02537	util: update and fix u_upload_mgr.h comments	2012-08-18 07:39:52 -06:00
Brian Paul	84e5cb37d3	st/mesa: use Elements() instead of hard-coded number And add a comment about the velems_util_draw[] array.	2012-08-18 07:39:52 -06:00
Brian Paul	1a9e4d5113	mesa: remove unused params, add const qualifiers Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-18 07:39:52 -06:00
Brian Paul	a6af24ee14	mesa: querying GL_TEXTURE_COMPRESSED_IMAGE_SIZE for a buffer obj is illegal GL_INVALID_OPERATION is to be raised when querying a non-compressed image/buffer. Since a buffer object can't have a compressed format this query always generates an error. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-18 07:39:51 -06:00
Ian Romanick	34472a0d87	mesa/es: Don't generate ES1 type conversion wrappers These are gradually going to get whittled away and eventually folded into the source files with the native type functions. v2: Add (speculative) SConscript changes. These may be broken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-17 18:12:20 -07:00
Eric Anholt	d707e337f5	i965: Fix bug in the old FS backend's projtex() calculation. In the old backend, we looked at any FS attribute's proj_attrib_mask bits, not just texcoords. Now that we have _mesa_vert_result_to_frag_attrib(), we can fill in the other FS inputs with correct proj_attrib_mask info. NOTE: This is a candidate for stable branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=46644 Signed-off-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-17 10:05:34 -07:00
Kenneth Graunke	3df13b32e5	mesa: Support GL_TEXTURE_BUFFER in GetTexLevelParameter[if]v in GL 3.1+. The OpenGL 3.1 specification explicitly allows this. Oddly, the ARB_texture_buffer_object spec's issues section claims this isn't allowed, but proceeds to explain that the extension simply doesn't edit the underlying spec to allow it, and thus it didn't appear in the list of legal texture targets. Thus, this patch legalizes it only in 3.1+ contexts, but still returns INVALID_ENUM in earlier contexts that expose ARB_texture_buffer_object. Unfortunately, the behavior of the call is horrendously undefined. Fixes oglconform's tbo/negative.textureParams test. v2: Require desktop OpenGL. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Eric Anholt <eric@anholt.net> Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-17 09:14:36 -07:00
Kenneth Graunke	8c37fc1e92	mesa: Split out part of glGetTexLevelParameter into a helper function. Move the _mesa_GetTexLevelParameter[iv] functions below the helper function so the prototype is available. This will be useful in the next commit. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-17 09:14:36 -07:00
Kenneth Graunke	58d11524da	mesa: Add GL_TEXTURE_CUBE_MAP to _mesa_max_texture_levels(). [v2] For cube maps, _mesa_generate_mipmap() calls this with GL_TEXTURE_CUBE_MAP (the gl_texture_object's Target) rather than one of the faces. This caused _mesa_max_texture_levels() to return 0, which resulted in maxLevels == -1 and the next line's assertion to fail. This function is called from seven places: - fbobject.c: framebuffer_texture() - mipmap.c: _mesa_generate_mipmap() - texgetimage.c: - getteximage_error_check() - getcompressedteximage_error_check() - texparam.c: _mesa_GetTexLevelParameteriv() - texstorage.c: tex_storage_error_check() All of these (or their callers) now explicitly check for invalid targets already, so this shouldn't cause invalid targets to slip through. (Technically _mesa_generate_mipmap() doesn't check for invalid targets, but the API-facing _mesa_GenerateMipmapEXT() function does.) +2 oglconforms (float-texture/mipmap.automatic and mipmap.manual) In addition to fixing the mipmap bug, it should also cause glTexStorage to accept GL_TEXTURE_CUBE_MAP, which is explicitly allowed by the spec. v2: Drop alterations to callers; this is now in a patch series that adds explicit checking to API functions. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-17 09:14:36 -07:00
Kenneth Graunke	9e4fde85e4	mesa: Add explicit target checking to GetTexLevelParameter[if]v(). Previously, it relied on _mesa_max_texture_levels() for texture target error checking. This was somewhat dodgy, as _mesa_max_texture_levels() is called in seven diferent places, not all of which necessarily accept the same list of targets. I copied the list of legal targets from _mesa_max_texture_levels(), so this patch should not introduce any change in behavior. Future patches will cause the two to diverge. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-17 09:14:36 -07:00
Kenneth Graunke	63396ce4c0	mesa: Add explicit target checking to Get[Compressed]TexImage(). Previously, they relied on _mesa_max_texture_levels() for texture target error checking. This was somewhat dodgy, as _mesa_max_texture_levels() is called in seven diferent places, not all of which necessarily accept the same list of targets. I copied the list of legal targets from _mesa_max_texture_levels() but removed the proxy targets, as both functions explicitly rejected those targets. This changes the order in which we check errors, which could change whether we return INVALID_VALUE or INVALID_ENUM. However, it shouldn't change the list of accepted targets. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-17 09:14:36 -07:00
Brian Paul	f69273f952	llvmpipe: remove polygon stipple assertion It's possible for us to have an unused sampler bound when the fragment shader itself doesn't use any samplers. So the assertion isn't valid. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=53616	2012-08-17 09:07:49 -06:00
Brian Paul	553a08d314	svga: minor code reformatting To be consistent with other functions.	2012-08-16 17:03:43 -06:00
Matt Turner	81ba2c53b6	build: Remove -shared from OSMesa's LDFLAGS Would break the static build.	2012-08-16 15:04:54 -07:00
Matt Turner	d12b07eb1a	build: Remove EXTRA_LIB_PATH You can add extra library paths to LDFLAGS directly.	2012-08-16 15:04:54 -07:00
Matt Turner	e273ed37ea	build: Require X11 pkg-config files	2012-08-16 15:04:53 -07:00
Marek Olšák	f36c404f90	r600g: disable tiling for 422 formats again	2012-08-16 20:44:54 +02:00
Marek Olšák	795834432b	r600g: fix blits of subsampled formats	2012-08-16 20:44:54 +02:00
Marek Olšák	6fd9218bb4	r600g: fix copying between NPOT mipmapped compressed textures We aligned the dimensions to the blocksize, then divided by it (in r600_blit.c), then minified, which was wrong. The minification must be done first, not last. This fixes piglit/fbo-generatemipmap-formats with S3TC and maybe a bunch of other tests too. Tested on RV730.	2012-08-16 20:44:54 +02:00
Marek Olšák	b8e9cf5d96	r600g: make F2U trans-only on r600-r700 This fixes a failing assertion in r600_asm.c.	2012-08-16 20:44:53 +02:00
Marek Olšák	0d7e002815	r600g: set CB_COLOR_INFO to INVALID for disabled colorbuffers on r600-r700 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-16 20:44:53 +02:00
Marek Olšák	951ac46a6a	r600g: rename r600_resource_texture to r600_texture	2012-08-16 20:44:53 +02:00
Marek Olšák	952c905767	r600g: always put tiled textures in VRAM	2012-08-16 20:44:53 +02:00
Marek Olšák	773ff5705f	r600g: cleanup r600_resource_texture in favor of radeon_surface	2012-08-16 20:44:53 +02:00
Marek Olšák	362a25aac5	r600g: remove unused parameter in r600_texture_create_object	2012-08-16 20:44:53 +02:00
Marek Olšák	c4993d15eb	r600g: fixup the usage flag for the flushed depth texture	2012-08-16 20:44:53 +02:00
Philipp Brüschweiler	0efd564a09	wayland-drm: close fd after the display is uninitialized This fixes a "kernel rejected pushbuf: Bad file descriptor" error on wl_drm display destruction. Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2012-08-16 13:17:06 -04:00
José Fonseca	50dec63790	scons: Fix MinGW cross compilation. Compensate for the recent changes and assumptions added to Makefiles.sources	2012-08-16 17:21:52 +01:00
Tom Stellard	5f82d19248	radeon/llvm: Lower implicit parameters before ISel	2012-08-16 16:04:51 +00:00
Brian Paul	0d308ef8fe	gallium/draw: move misplaced brace	2012-08-16 09:16:42 -06:00
Brian Paul	f6b7157550	mesa: raise GL_INVALID_OPERATION in glGenerateMipmap for missing base image This seems to be expected by the WebGL texture-mips test. The error makes sense, but I haven't found (yet) any OpenGL documentation specifying this error condition. See http://bugs.freedesktop.org/show_bug.cgi?id=44912 Note: This is a candidate for the 8.0 branch. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-16 09:11:14 -06:00
Brian Paul	d663a557fd	r600: update sampler, sampler_view code for the future For when we have pipe->set_sampler_states(pipe, shader, start, num, samplers), etc. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-08-16 09:01:31 -06:00
Brian Paul	10e552d056	rbug: update data structures, functions for future changes To support geom/compute/etc shaders, samplers, sampler views, etc. To support pipe->bind_sampler_states() w/ start_slot.	2012-08-16 09:01:31 -06:00
Brian Paul	109e87dc6a	gallium/trace: add 'start' parameter to bind_sampler_states/views()	2012-08-16 09:01:31 -06:00
Brian Paul	d4ab8bd095	gallium/identity: add 'start' parameter to bind_sampler_states/views()	2012-08-16 09:01:31 -06:00
Brian Paul	f3cc4990a0	galahad: add 'start' parameter to bind_sampler_states/views()	2012-08-16 09:01:31 -06:00
Brian Paul	bd3733c0be	svga: add 'start' parameter to bind_sampler_states/views()	2012-08-16 09:01:31 -06:00
Brian Paul	c969cb1447	llvmpipe: add 'start' parameter to bind_sampler_states/views()	2012-08-16 09:01:31 -06:00
Brian Paul	25a42f39e3	softpipe: add 'start' parameter to bind_sampler_states/views() To support updating a sub-range of sampler states/views in the future. Note that we always pass start=0 at this time.	2012-08-16 09:01:31 -06:00
Brian Paul	348ac08bfd	gallium/trace: consolidate sampler, sampler_view code	2012-08-16 09:01:31 -06:00
Brian Paul	0ad95b923a	gallium/identity: consolidate sampler, sampler_view code This will simplify things when the pipe_context functions are consolidated.	2012-08-16 09:01:31 -06:00
Brian Paul	f3c3aff6ef	st/mesa: add support for GS textures and samplers	2012-08-16 09:01:31 -06:00
Brian Paul	6c8a132158	st/mesa: combine vertex/fragment sampler state in arrays As with other recent changes, put the vertex and fragment sampler state into arrays indexed by the shader type. This will let us easily add support for other types of shaders in the future.	2012-08-16 09:01:31 -06:00
Brian Paul	cab2fed135	gallium: remove PIPE_MAX_VERTEX/GEOMETRY_SAMPLERS #define PIPE_MAX_SAMPLERS, PIPE_MAX_VERTEX_SAMPLERS and PIPE_MAX_GEOMETRY_SAMPLERS were all defined to the same value (16). In various places we're creating arrays such as sampler_views[PIPE_SHADER_TYPES][PIPE_MAX_SAMPLERS] so we were assuming the same number of max samplers for all shader stages anyway. Of course, drivers are still free to advertise different numbers of max samplers for different shaders.	2012-08-16 09:01:31 -06:00
Brian Paul	a2c1df4c9a	draw: index samplers and sampler_view state by shader type So that we can handle GS state and other types of shaders in the future.	2012-08-16 09:01:31 -06:00
Brian Paul	bef196c792	draw: move tgsi-related state into a tgsi sub-struct To better organize things a bit.	2012-08-16 09:01:31 -06:00
Brian Paul	df87fb5913	gallium: add a shader stage/type param to some draw functions To prepare for geometry shader texture support in the draw module. Note: we still only handle the vertex shader case.	2012-08-16 09:01:31 -06:00
Brian Paul	a8ed00d5f1	st/mesa: silence signed/unsigned comparison warning	2012-08-16 09:00:08 -06:00
Brian Paul	d733e5da9c	svga: move result->key expression after result != NULL check	2012-08-16 08:58:55 -06:00
Brian Paul	50188adf7d	svga: fix result==NULL logic in emit_fs_consts() The previous test for result != NULL was kind of bogus since we dereferenced the pointer earlier in the code. Now, check for result != NULL first, then get the result->key info. Also, remove the useless "offset +=" code at the end.	2012-08-16 08:58:55 -06:00
Brian Paul	d55e0f1ba0	svga: update comment (s/SVGA_NEW_VS_RESULT/SVGA_NEW_VS_PRESCALE/)	2012-08-16 08:58:55 -06:00
Brian Paul	2a5eeeaebe	svga: rename svga_hw_vs_parameters -> svga_hw_vs_constants and similarly for svga_hw_fs_parameters	2012-08-16 08:58:55 -06:00
Niels Ole Salscheider	8cc1860d4a	st/mesa: index can be negative in the PROGRAM_CONSTANT case NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-16 08:56:09 -06:00
Brian Paul	fd41cbc557	mesa: add cast to silence warning in _mesa_pack_rgba_span_from_ints()	2012-08-16 08:55:48 -06:00
Brian Paul	658044cde1	meta: remove unused variable	2012-08-16 08:53:55 -06:00
Michel Dänzer	1b11395a36	radeonsi: Fix symbol conflicts with r600g. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50389 Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 12:01:16 +02:00
Michel Dänzer	51d9f37a72	radeonsi: Fix memory leaks if returning early from some state functions. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 11:58:24 +02:00
Michel Dänzer	4b64fa2ff1	radeonsi: Fix LLVM context leak. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 11:58:24 +02:00
Michel Dänzer	18abc270c5	gallium/radeon: Don't assign virtual address space for BO that already has one. We'd end up re-using the old one and throwing away the new one anyway, but only after a roundtrip to the kernel. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 11:58:24 +02:00
Michel Dänzer	a60be05284	gallium/radeon: Create hole for waste when allocating from va_offset. Otherwise, the wasted area could never be used for an allocation again. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 11:58:24 +02:00
Michel Dänzer	1f455ef5bc	gallium/radeon: Fix potential address space loss in radeon_bomgr_force_va(). Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 11:58:23 +02:00
Michel Dänzer	6d59b7f6dc	gallium/radeon: Delete uppermost virtual address space hole if it's at the top. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 11:58:23 +02:00
Michel Dänzer	f5fe81daea	gallium/radeon: Fix losing holes when allocating virtual address space. If a hole exactly matches the allocated size plus alignment, we would fail to preserve the alignment as a hole. This would result in never being able to use the alignment area for an allocation again. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 11:58:23 +02:00
Michel Dänzer	206d07625c	gallium/radeon: Merge holes when freeing virtual address space. Otherwise we'll likely end up with an ever increasing amount of ever smaller holes. Requires keeping the list ordered wrt offsets. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 09:39:36 +02:00
Michel Dänzer	c25968f3e2	gallium/radeon: Make va_offset 64 bits wide. Otherwise we'd wrap around after 32 bits. The kernel currently limits GPU virtual address space to 4GB anyway, but that will probably change sooner or later, and this would result in confusing error messages when running out of virtual address space even now. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-16 09:37:33 +02:00
Vinson Lee	1597176f70	llvmpipe: Silence Coverity incorrect sizeof expression defect. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-15 22:15:49 -07:00
Vinson Lee	3d6892c479	scons: Add option to enable floating-point textures. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-15 22:04:24 -07:00
Dave Airlie	6a3ac03f2b	glx/dri2: add dri2 prime support. This adds support for having libGL pick a different driver for prime support. DRI_PRIME env var is set to the value retrieved from the server randr provider calls, by the calling process. (generally DRI_PRIME=1 will be the right answer). Signed-off-by: Dave Airlie <airlied@redhat.com>	2012-08-16 10:02:10 +10:00
Vincent Lejeune	565a4e2a86	radeon/llvm: Enable if-cvt Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:14 +00:00
Vincent Lejeune	a614979286	radeon/llvm: Add callbacks needed by if-cvt Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:14 +00:00
Vincent Lejeune	0eca5fd919	radeon/llvm: Lower branch/branch_cond into predicated jump Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:14 +00:00
Vincent Lejeune	6db2e9fdb0	radeon/llvm: Add a predicated JUMP instruction Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:13 +00:00
Vincent Lejeune	8263408a91	radeon/llvm: Support for predicate bit Tom Stellard: - A few changes to predicate register defs Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:13 +00:00
Vincent Lejeune	8f597d57e9	r600g: Glue to handle predicate aware output from llvm Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:13 +00:00
Vincent Lejeune	72f7632c6b	r600g: Fix instruction group merge when there are predicated insts. Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:13 +00:00
Vincent Lejeune	56227f875b	radeon/llvm: Do not use PV/PS if PRED_SEL does not match Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:13 +00:00
Vincent Lejeune	da676eab93	r600g: Add support for predicates Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-15 21:07:13 +00:00
Christian König	cf76edd300	radeonsi: move ps sampler state into PM4 stream Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-15 22:13:19 +02:00
Christian König	ec5b698525	radeonsi: move ps sampler views into PM4 stream Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-15 22:13:19 +02:00
Christian König	54de6f452c	radeonsi: move vertex state descriptors into PM4 stream Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-15 22:13:19 +02:00
Christian König	f2c95d93db	radeonsi: add shader data infrastructure With this we can embed data for the shaders (like resource descriptors) into the PM4 stream. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-15 22:13:19 +02:00
Christian König	4444b9d1ec	radeon/llvm: add support to fetch temps as vectors Necessary for texture fetches with temp regs as source on SI. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-15 22:13:19 +02:00
Tom Stellard	b6051bc785	radeon/llvm: Remove AMDGPUUtil.cpp	2012-08-15 18:35:26 +00:00
Apostolos Bartziokas	040c2e0456	radeon/llvm: Cleanup AMDGPUUtil.cpp	2012-08-15 18:35:25 +00:00
Tom Stellard	3aaa209293	radeon/llvm: Lower loads from USE_SGPR adddress space during DAG lowering	2012-08-15 18:35:25 +00:00
Tom Stellard	40c41fe890	radeon/llvm: Add live-in registers during DAG lowering Psuedo instructions emulating live-in registers have been removed and their corresponding intrinsics are now being lowered during DAG lowering.	2012-08-15 18:35:25 +00:00
Tom Stellard	f3480f9234	radeon/llvm: Lower store_output intrinsic during DAG lowering	2012-08-15 18:35:25 +00:00
Tom Stellard	a76a0f7422	radeon/llvm: Force VTX_READ instructions to use same reg for src and dst I was seeing some GPU hangs that seemed to be cause by ALU instructions writing to the same register used as the source for VTX_READ. Adding this constraint to the VTX_READ instructions avoids this situation.	2012-08-15 18:35:25 +00:00
Marek Olšák	97b4b97b2f	radeonsi: fix build breakage after u_blitter changes	2012-08-15 20:03:37 +02:00
Marek Olšák	e0cc61bd91	gallium/u_blitter: document custom meta helpers	2012-08-15 19:20:58 +02:00
Marek Olšák	b3b5bb9ddb	r600g: disable handling of DISCARD_RANGE https://bugs.freedesktop.org/show_bug.cgi?id=53130	2012-08-15 19:20:58 +02:00
Marek Olšák	44f14ebd7b	r600g: implement timestamp query and get_timestamp hook Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-15 19:20:58 +02:00
Marek Olšák	1932bc8aae	r600g: enable MSAA on evergreen by default v2: add the DRM version check	2012-08-15 19:20:58 +02:00
Marek Olšák	870af19d70	r600g: implement copying between MSAA textures	2012-08-15 19:20:58 +02:00
Marek Olšák	0f86915c53	r600g: implement MSAA color resolve	2012-08-15 19:20:58 +02:00
Marek Olšák	94b634eca0	r600g: implement MSAA depth-stencil decompression and resolve and integer textures, which are resolved the same as depth, I think.	2012-08-15 19:20:58 +02:00
Marek Olšák	6d3ad2dd2b	r600g: implement TXQ_LZ opcode	2012-08-15 19:20:57 +02:00
Marek Olšák	4b78df9c81	r600g: implement MSAA rendering and texturing for evergreen and cayman	2012-08-15 19:20:57 +02:00
Marek Olšák	a01791add0	r600g: implement set_sample_mask	2012-08-15 19:20:57 +02:00
Marek Olšák	6517225078	r600g: implement alpha-to-coverage	2012-08-15 19:20:57 +02:00
Marek Olšák	26cb887ea2	r600g: implement alpha-to-one	2012-08-15 19:20:57 +02:00
Marek Olšák	4f21595276	r600g: remove support for 3-channel colorbuffers We have no sampler support for them.	2012-08-15 19:20:57 +02:00
Marek Olšák	2f14202f52	configure.ac: bump libdrm_radeon requirement to 2.6.38	2012-08-15 19:20:57 +02:00
Marek Olšák	a7f4d3b740	winsys/radeon: print error if CS is overflowed and don't submit the CS to the kernel.	2012-08-15 19:20:57 +02:00
Marek Olšák	dc5e61d884	gallium/u_blitter: implement X and Y texture flipping	2012-08-15 19:20:57 +02:00
Marek Olšák	825b45366d	gallium/u_blitter: implement blitting multisample resources It can blit only one sample at a time (it should be called in a loop).	2012-08-15 19:20:57 +02:00
Marek Olšák	dacf5dc9ac	gallium: add TGSI support for multisample textures The only allowed instructions are TXQ_LZ and TXF. TXQ_LZ is like TXQ, but without the LOD parameter (which is always zero with MSAA textures) The 3rd or the 4th texcoord component in TXF should contain the sample index for a 2D_MSAA or 2D_ARRAY_MSAA texture, respectively.	2012-08-15 19:20:57 +02:00
Marek Olšák	ba53573a8b	gallium/tgsi: fix TGSI text parser The problem was that the string matching succeeded e.g. for "2D" when there was actually "2D_MSAA" and then failed parsing "_MSAA". To prevent similar failures in the future, let's fix this kind of error everywhere.	2012-08-15 19:20:57 +02:00
Marek Olšák	b7c4ee21c5	gallium/u_blit: set dst format from pipe_resource, not pipe_surface We use it to decide whether we can use resource_copy_region. NOTE: This is a candidate for the 8.0 branch.	2012-08-15 19:20:57 +02:00
Marek Olšák	1a17c42344	gallium: make pipe_box signed in order to represent flipped blits This will be used by u_blitter.	2012-08-15 19:20:57 +02:00
Marek Olšák	03b78ceb50	st/mesa: don't clamp fragment color with integer colorbuffer	2012-08-15 19:20:57 +02:00
Marek Olšák	e06d6168cb	mesa: flush vertices in test_framebuffer_completeness	2012-08-15 19:20:57 +02:00
Michel Dänzer	538085c5d4	st/egl: Fix up for ClientVersion -> ClientMajorVersion rename. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=53513 Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-15 10:49:39 +02:00
Jordan Justen	b3900ed5ad	i965: add ARB_texture_rgb10_a2ui support Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Jordan Justen	091eb15b69	meta: allow CopyTexSubImage on integer formats Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Jordan Justen	6671d0dad3	mesa ReadPixels: handle signed/unsigned integer clamping Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Jordan Justen	f7333b6345	mesa pack: handle packed integer formats with clamping Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Jordan Justen	1a814217c3	mesa unpack: call _mesa_problem when unpack function is not available Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Jordan Justen	b3dd048cbb	mesa texstore: handle signed/unsigned integer clamping Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Jordan Justen	7208505d30	mesa GetTexImage: handle signed/unsigned integer clamping Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Jordan Justen	7ef270867c	mesa pack: handle uint and int clamping properly Rename _mesa_pack_rgba_span_int to _mesa_pack_rgba_span_from_uints. Add _mesa_pack_rgba_span_from_ints. These separate routines allow the integer clamping to be handled properly for signed versus unsigned integers. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 17:07:42 -07:00
Chad Versace	1938501fbf	intel: Fix rendering to a multisample front buffer We need to downsample before flushing BUFFER_FAKE_FRONT_LEFT to BUFFER_FRONT_LEFT in intel_flush_front. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-14 16:19:25 -07:00
Chad Versace	a43599d1d1	intel: Clean up intel_flush_front Stop repeating ourselves. Replace the 4 instances of `driContext->driDrawablePriv` with `driDrawable`. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-14 16:19:25 -07:00
Chad Versace	38b748ce29	intel: Refactor intel_downsample_for_dri2_flush Move it from intel_screen.c to intel_context.c. Redeclare as non-static. A future commit will use it in multiple files. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-14 16:19:25 -07:00
Ian Romanick	cde2b7e55d	docs: Add EGL extensions to release notes Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-14 15:45:17 -07:00
Ian Romanick	dbecb41300	egl: Allow OpenGL ES 3.0 as a version In the DRI2 back-end this will get the same API as GLES 2.0. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:03 -07:00
Ian Romanick	a2ce2eba26	dri2: Note that __DRI_API_GLES2 is also used for OpenGL ES 3.0 Unlike 1.x to 2.0, OpenGL ES 3.0 is backwards compatible with 2.0. Use the same API flag for both. Applications that specifically want 3.0 will specify this using the major / minor version attributes. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:03 -07:00
Ian Romanick	7b4b4f8e68	egl_dri2: Add support for EGL_KHR_create_context and EGL_EXT_create_context_robustness Just like in GLX, EGL_KHR_create_context requires DRI2 version >= 3, and EGL_EXT_create_context_robustness requires both DRI2 version >= 3 and the __DRI2_ROBUSTNESS extension. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:03 -07:00
Ian Romanick	f171571bfc	egl: Implement front-end support for EGL_EXT_create_context_robustness Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:03 -07:00
Ian Romanick	63beb3df98	egl: Implement front-end support for EGL_KHR_create_context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:03 -07:00
Ian Romanick	9d76ad2fac	egl_dri2: Silence warnings about missing initializers egl_dri2.c: At top level: egl_dri2.c:325:4: warning: missing initializer [-Wmissing-field-initializers] egl_dri2.c:325:4: warning: (near initialization for 'swrast_driver_extensions[2].version') [-Wmissing-field-initializers] egl_dri2.c:330:4: warning: missing initializer [-Wmissing-field-initializers] egl_dri2.c:330:4: warning: (near initialization for 'swrast_core_extensions[1].version') [-Wmissing-field-initializers] Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:03 -07:00
Ian Romanick	3fd79dd988	egl: Rename ClientVersion to ClientMajorVersion, add ClientMinorVersion Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:03 -07:00
Ian Romanick	ce55741cbc	egl_dri2: Use createContextAttribs if DRI2 version >= 3 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:02 -07:00
Ian Romanick	38f91f2b08	egl_dri2: Require DRI2 version 2 The extra block in dri2_create_context is to prevent extra white space noise in the next patch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:02 -07:00
Ian Romanick	0c445bb618	dri_util: Compare against the correct API enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 15:41:02 -07:00
Ian Romanick	258771882d	mesa: Enable GL_ARB_invalidate_subdata v2: Add GL_ARB_invalidate_subdata to release notes at Brian's suggestion. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 14:39:33 -07:00
Ian Romanick	07e12c4917	mesa: Add skeleton implementations of glInvalidateTex{Sub,}Image These are part of GL_ARB_invalidate_subdata (but not OpenGL ES 3.0). v2: Add comment explaining why minimum dimensions are set to 1 for some texture targets. Add default case to switch statement to silence compiler warnings and detect new texture targets. Both changes suggested by Brian. Also use _mesa_is_desktop_gl as suggested by Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 14:39:33 -07:00
Ian Romanick	f241ffd48c	mesa: Add skeleton implementations of glInvalidateBuffer{Sub,}Data These are part of GL_ARB_invalidate_subdata (but not OpenGL ES 3.0). v2: Use _mesa_bufferobj_mapped instead of testing gl_buffer_object::Pointer as suggested by Brian. Also use _mesa_is_desktop_gl as suggested by Ken. v3: Add a comment by the map subrange / discard range overlap test and fix an off-by-one error noticed by Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 14:39:33 -07:00
Ian Romanick	e2370bcc1d	mesa/es: Pass context to _mesa_init_bufferobj_dispatch With this change _mesa_init_bufferobj_dispatch won't set function pointers that don't exist in OpenGL ES. v2: Use _mesa_is_desktop_gl and _mesa_is_gles3 as suggested by Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 14:39:33 -07:00
Ian Romanick	342be8aa88	mesa: Add skeleton implementations of glInvalidate{Sub,}Framebuffer These are part of GL_ARB_invalidate_subdata and OpenGL ES 3.0. v2: Reject aux buffers in core context, and use _mesa_is_desktop_gl and _mesa_is_gles3. Both suggested by Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 14:39:33 -07:00
Ian Romanick	12249b9c96	glapi: Add GL_ARB_invalidate_subdata Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 14:39:33 -07:00
Ian Romanick	2a1ca4ff73	mesa/es3: Add _mesa_is_gles3 predicate Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 14:39:29 -07:00
Ian Romanick	9bcb9fad65	intel: Implement ARB_texture_storage This is basically cut-and-paste from the swrast implementation, and it could probably be (slightly) more optimal. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 14:39:19 -07:00
Ian Romanick	92b614172f	mesa: update glext.h to version 83 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-14 12:19:24 -07:00
Matt Turner	79e9e1b32f	build: Use MKDIR_P in src/mesa/Makefile.am Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:39 -07:00
Matt Turner	02f52e8df5	build: Use AM_V_GEN in src/mesa/Makefile.am Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:39 -07:00
Matt Turner	1b200d9001	build: Fix autogen.sh to allow out-of-tree builds Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:39 -07:00
Matt Turner	85d355f122	build: Fix out-of-tree generation of builtin_function.cpp Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:39 -07:00
Matt Turner	2191a79b4e	build: Fix gtest out-of-tree build Introduced by `3d000e7dd`. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:39 -07:00
Matt Turner	e939250b63	build: Fix out-of-tree generation of api_exec_es{1,2}.c Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:39 -07:00
Matt Turner	5c2a6b74ed	build/sources.mak: Add src/glsl/glcpp to INCLUDE_DIRS Fixes problem where libdricore's of-out-tree build couldn't find glcpp.h. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:38 -07:00
Matt Turner	fa74175210	build/sources.mak: Remove unused GLSL_LIBS Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-14 10:54:38 -07:00
Ian Romanick	707f067915	mesa: Kill GL_ARB_shadow_ambient with fire No driver supports this extension, and it seems unlikely than any driver ever will. I think r300c may have supported it at one time, but that driver has already been removed. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-08-14 10:40:04 -07:00
Tom Stellard	b49771970b	radeon/llvm: Inline immediate offset when lowering implicit parameters	2012-08-14 14:06:20 +00:00
Tom Stellard	2fae8227ad	radeon/llvm: Use correct opcocde for BREAK_LOGICALNZ_i32	2012-08-14 13:26:30 +00:00
José Fonseca	ea8dcfc90d	scons: Populate top_srcdir and top_builddir variables when reading Makefiles.sources. This is not entirely correct, as scons doesn't put binaries in a "src" subdirectory, but doesn't seem to be a problem for now.	2012-08-14 12:19:56 +01:00
Kenneth Graunke	605f964d5c	mesa: Use GLdouble for depthMax in final unpack conversions. The final step of _mesa_unpack_depth_span is to take the temporary GLfloat depth values and convert them to the desired format. When converting to GL_UNSIGNED_INTEGER with depthMax > 0xffffff, we use double-precision math to avoid overflow and precision problems. Or at least that's the idea. Unfortunately GLdouble z = depthValues[i] * (GLfloat) depthMax; actually causes single-precision multiplication, since both operands are GLfloats. Casting depthMax to GLdouble causes the scaling to be done with double-precision math. Fixes a regression in oglconform's depth-stencil basic.read.ds test since `c60ac7b179`, where the expected and actual values differed slightly. For example, 0xcfa7a6 vs. 0xcfa7a4. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=49772 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 19:16:38 -07:00
Eric Anholt	43e3a7533d	i965: Fix the scaling of seconds to ms in perf debug. headdesk	2012-08-13 17:50:25 -07:00
Ian Romanick	d606926013	i965: Validate API and version in brwCreateContext v2: Use base-10 for versions like gl_context::Version. Suggested by Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 17:38:55 -07:00
Ian Romanick	db273724c9	i915: Validate API and version in i915CreateContext v2: Use base-10 for versions like gl_context::Version. Suggested by Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 17:36:50 -07:00
Ian Romanick	a81e4b3e92	i830: Validate API and version before calling i830CreateContext Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-13 17:23:48 -07:00
Ian Romanick	2b63624326	intel: In the i915 driver, the chipset cannot be i965 In the i965 dirver, the chipset must be i965. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-13 17:23:24 -07:00
Ian Romanick	70f47505a2	dri: Pass API_OPENGL_CORE through to the drivers This forces the drivers to do at least some validation of context API and version before creating the context. In r100 and r200 drivers, this means that they don't do any post-hoc validation. v2: Actually reject compatibility profile 3.2+ contexts. Thanks Ken. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 17:17:12 -07:00
Ian Romanick	7e81f553bc	mesa: Filter a bunch more functions based on API Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 17:17:00 -07:00
Ian Romanick	0fef911ce4	mesa: Don't advertise extensions that are part of GL 1.5 in a core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 16:19:36 -07:00
Ian Romanick	aa0b1e902b	mesa: Don't advertise extensions that are part of GL 1.4 in a core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 16:19:36 -07:00
Ian Romanick	213945385a	mesa: Don't advertise extensions that are part of GL 1.3 in a core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 16:19:36 -07:00
Ian Romanick	7ef1869d69	mesa: Don't advertise extensions that are part of GL 1.2 in a core context Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 16:19:36 -07:00
Ian Romanick	4d39b86315	mesa: Don't advertise deprecated extensions in a core context It may be possible to trim the list of extensions futher. These are just the obvious extensions that add functionality that the core context explicitly forbids. Apple's core-context extension list is just the extensions on top of the core GL version. I'm not sure we want to go that far, but removing some things that have been in core since 2.1 may be okay. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-13 16:19:36 -07:00
Christopher James Halse Rogers	cd4a61100d	build: Fix libdricore out-of-tree builds (v2) v2: Add both top_srcdir and top_builddir to mesa asm include dirs. These require both in-tree and build-time-generated files. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Christopher James Halse Rogers <christopher.halse.rogers@canonical.com>	2012-08-13 12:24:54 -07:00
Christopher James Halse Rogers	73fef0178a	build/mapi: More killing of TOP in favour of top_srcdir Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Christopher James Halse Rogers <christopher.halse.rogers@canonical.com>	2012-08-13 12:24:47 -07:00
Christopher James Halse Rogers	77a3efc6b9	build/glsl: fix location of generated files. Like in src/mesa, use GLSL_BUILDDIR/GLSL_SRCDIR to unambiguously distinguish between in-tree and generated files. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Christopher James Halse Rogers <christopher.halse.rogers@canonical.com>	2012-08-13 12:24:39 -07:00
Christopher James Halse Rogers	37a1b8083e	build/glapi: fix includes for generated files Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Christopher James Halse Rogers <christopher.halse.rogers@canonical.com>	2012-08-13 12:24:31 -07:00
Christopher James Halse Rogers	3fe69bac49	build: fix out of tree generation of glapi_mapi_tmp.h Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Christopher James Halse Rogers <christopher.halse.rogers@canonical.com>	2012-08-13 12:24:25 -07:00
Christopher James Halse Rogers	726f534bbb	build/glx: fix include paths for out-of-tree builds Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Christopher James Halse Rogers <christopher.halse.rogers@canonical.com>	2012-08-13 12:24:17 -07:00
Christopher James Halse Rogers	b2ecaab7ad	build: fix location of generated files in src/mesa (v4) Also fix include paths for the generated headers. v2: Switch to using self-explanatory BUILDDIR/SRCDIR defined from top_builddir/top_srcdir rather than the ambiguous TOP. v3: Add both top_builddir and top_srcdir to include flags for mesa asm. These rely on both in-tree and build-time-generated includes. v4: Rebased on top of `948c8f502a`. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Christopher James Halse Rogers <christopher.halse.rogers@canonical.com> Signed-off-by: Matt Turner <mattst88@gmail.com>	2012-08-13 12:24:04 -07:00
Kenneth Graunke	4e087de51a	intel: Reserve enough space to finish occlusion queries on Gen6. After realizing that brw_finish_batch emitted some final PIPE_CONTROLs to record occlusion queries, Chris noted that we probably hadn't reserved enough space to actually emit them. Reserving a full 60 bytes seems a bit harsh, since we only need that much if occlusion queries are actually active. Plus, 28 bytes would be sufficient for Gen7, and 24 for Gen4-5. We could optimize this in the future, but it doesn't seem too critical. NOTE: This is a candidate for stable release branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=53311 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-12 20:12:28 -07:00
Kenneth Graunke	9da50667f4	intel: Move finish_batch() call before MI_BATCH_BUFFER_END and padding. On Gen4+, brw_finish_batch() calls brw_emit_query_end(), which emits some extra PIPE_CONTROLs to capture the current occlusion query data. Unfortunately, it was being called after _intel_batchbuffer_flush added the MI_BATCH_BUFFER_END, meaning those PIPE_CONTROLs didn't get inside the batch. Not only does this likely cause bogus occlusion query values, it can also cause crashes: with the recent change to use 64-bit depth count writes on Gen6+, we started emitting an odd-length PIPE_CONTROL, which happened after the MI_NOOP padding. This resulted in an odd-length batch buffer, which resulted in execbuf2 returning -EINVAL and the application dying with an intel_do_flush_locked failure. On older generations, finish_batch() doesn't emit any state, so this change shouldn't have any effect. Huge thanks to Chris Wilson for helping me figure this out. NOTE: This is a candidate for stable release branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=53311 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-12 20:12:13 -07:00
Eric Anholt	006c1a3c65	i965: Add perf debug for stalls during shader compiles. v2: fix bad comment from before I gave up and decided to just use doubles. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:25 -07:00
Eric Anholt	97a5f0ff2e	i965: Add performance debug for when the state cache gets nuked. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:25 -07:00
Eric Anholt	fc3b7c9b56	i965: Add performance debug for shader recompiles. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:25 -07:00
Eric Anholt	b4da272a6e	i965: Add performance debug for fast clear fallbacks. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:25 -07:00
Eric Anholt	0e723b135b	intel: Add performance debug for some common GPU stalls. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:25 -07:00
Eric Anholt	4cfb9e3000	i965: Add performance debug for register spilling. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:25 -07:00
Eric Anholt	d72ff03e69	i965: Add INTEL_DEBUG=perf for failure to compile 16-wide shaders. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:25 -07:00
Eric Anholt	79198063b8	intel: Rename INTEL_DEBUG=fall to INTEL_DEBUG=perf. I want to introduce some more debug output for performance surprises that includes fallbacks, but aren't necessarily software rasterization. Leave INTEL_DEBUG=fall in place for those that have used that flag before. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 19:08:24 -07:00
Pauli Nieminen	bf6c1b7470	meta: texture rectangle textures may not have mipmaps Avoid INVALID_OPERATION error if decompressing rectangle texture. Setting mipmap level limits for those textures is error that must not be hit by meta code to mislead user. [v3/Kayden]: Resolve conflicts due to Eric picking a subset of Pauli's original changes. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 16:18:46 -07:00
Pauli Nieminen	b9daa83463	meta: Use sampler object for mipmap generation Sampler objects are perfect for meta operations.Sampler object is separate state object that shadows the sampling state in texture object. With sampler object mipmap can maintain same sampling state for all subsequent generation requests. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 16:18:43 -07:00
Pauli Nieminen	ac4dc5e931	mesa/samplerobj: Avoid crash in sampler query if texture unit is disabled Sampler queries are so far made only for enabled texture unit. But if any code would query sampler before checking texture unit state that would result to NULL deference. Making the inline helper easier to use with NULL check makes a lot sense because compiler is likely to combine the checks for the current texture. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 16:18:41 -07:00
Pauli Nieminen	5606bd574e	mesa: Remove unnecessary parameters CompressedTexImage In tune with previous patches. Again there is duplication of information in function parameters that is good to remove. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 15:49:30 -07:00
Pauli Nieminen	c9a7dfcf92	mesa: Remove unnecessary parameters from AllocTextureImageBuffer Size and format information is always stored in gl_texture_image structure. That makes it preferable to remove duplicate information from parameters to make interface easier to understand. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 15:49:28 -07:00
Pauli Nieminen	c5af889180	mesa: Remove unnecessary parameters from TexImage gl_texture_image structure always holds size and internal format before TexImage driver hook is called. Those passing same information in function parameters only duplicates information making the interface harder to understand. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 15:49:13 -07:00
Tom Stellard	e98ace934e	configure: Check xcb version when X11 pkgconfig exists Commit `6882381a2e` added a dependency on a newer version of xcb, but the version check wasn't added in all the necessary places. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-12 15:42:43 -07:00
Chí-Thanh Christopher Nguyễn	4c73282d2b	gbm: Fix build without gallium_drm_loader pipe_loader_drm_probe_fd only exists if HAVE_PIPE_LOADER_DRM is defined. Patch improved as suggested by Vadim A. Misbakh-Soloviov. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=52962	2012-08-12 14:38:32 -07:00
Christian König	9f5ff5981c	radeonsi: move drawing into new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:26 +02:00
Christian König	583c212115	radeonsi: move sync handling into new state handler So we can remove all the old atom handling. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:26 +02:00
Christian König	303f4b7dcd	radeonsi: separate and disable streamout for now I have my doubts that this code still works on SI. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:26 +02:00
Christian König	696b6cf466	radeonsi: remove ps_partial_flush Not needed any more. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:26 +02:00
Christian König	7acb194a7b	radeonsi: remove r6xx_flush_and_inv atom It is not used any more. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:25 +02:00
Christian König	708337e62e	radeonsi: move init state to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:25 +02:00
Christian König	862df0885a	radeonsi: add support for PKT3 cmds to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:25 +02:00
Christian König	ce40e4726c	radeonsi: cleanup shader headers Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-08-11 09:58:25 +02:00
Chad Versace	996ff1c9bf	Revert "mesa: Remove C++11 narrowing warnings" This reverts commit `9f5a5d541d`. Fixes the following build error on GCC 4.2.3: cc1plus: error: unrecognized command line option "-Wno-narrowing" The GCC Manual incorrectly stated that commit `9f5a5d54` woulde be safe for old versions of GCC. Reported-by: Andy Furniss <andyqos@ukfsn.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-10 14:05:14 -07:00
Brian Paul	16c702ef3b	softpipe: fix softpipe_delete_fs_state() failed assertion The var!=softpipe->fs_variant assertion was failing because we weren't nulling the softpipe->fs_variant pointer when binding a new shader. Since softpipe->fs_variant depends on the current fs, it's of no use when a new FS is bound. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=53318 Note: This is a candidate for the 8.0 branch. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-10 13:27:04 -06:00
Brian Paul	3487b93cc4	cso: rearrange some structure fields for consistency	2012-08-10 12:14:17 -06:00
Brian Paul	cf77c29e60	st/mesa: fix renderbuffer validation bug After we attach a new renderbuffer in this function we need to make sure Mesa's update_framebuffer() gets called. Fixes crash in WebGL conformance/textures/texture-attachment-formats.html, but the test still fails for other reasons. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=53316 Note: This is a candidate for the 8.0 branch. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-10 11:49:36 -06:00
Chad Versace	9f5a5d541d	mesa: Remove C++11 narrowing warnings Add -Wno-narrowing to CXXFLAGS for gcc. It is safe to add this flag even for versions of gcc that don't recognize it. From the GCC Manual [1]: "[GCC] allows the use of new -Wno- options with old compilers". This removes warnings of the form warning: narrowing conversion of X from 'int' to 'float' inside { } is ill-formed in C++11 [-Wnarrowing] in ff_fragment_shader.cpp and gen6_blorp.cpp of the form. When building i965, I observed no other difference in the build output. [1] http://gcc.gnu.org/onlinedocs/gcc/Warning-Options.html Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-10 09:59:41 -07:00
Brian Paul	f7af4beae5	gallivm: fix crash in lp_sampler_static_state() Fixes WebGL conformance/uniforms/uniform-default-values.html crash. We need to check for the null view pointer before accessing view->texture. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=53317 Note: This is a candidate for the 8.0 branch. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-08-10 09:45:25 -06:00
Brian Paul	9b04abe368	st/mesa: fix glCopyTexSubImage crash Fixes a WebGL crash. The dest texture image is at level 2 and is of size 1x1 texel. The st texture image is a stand-alone resource, not a pointer into a complete mipmap. So the resource has one level and trying to write to level 2 blows up. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=53314 and http://bugs.freedesktop.org/show_bug.cgi?id=53319 Note: This is a candidate for the 8.0 branch. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-08-10 09:45:17 -06:00
Chad Versace	6cb9e99a75	intel: Always downsample in intel_miptree_map_multisample Always downsample before mapping, even if the map mode contains GL_MAP_INVALIDATE_RANGE_BIT. If we neglect to downsample when only a subrect is mapped then the upsample in intel_miptree_unmap_multisample may write garbage to the region outside the subrect. (Eric gave my patch `e88cfbb` a conditional reviewed-by with the condition that it always downsample before mapping. I forgot to make that change before pushing the patch.) Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-09 15:21:02 -07:00
Eric Anholt	04a11b5f5e	i965/gen6+: Add support for edge flags. Fixes the 3 new piglit edgeflag tests. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=40707 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-09 09:07:50 -07:00
Eric Anholt	b3367f56d8	i965/vs: Convert EdgeFlagPointer values appropriately for the VS on gen4. Fixes piglit gl-2.0/edgeflag. NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-09 09:07:49 -07:00
Eric Anholt	3eb8d71225	i965/vs: Add comment noting copy_edgeflag state dependency. It's already in the state struct. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-09 09:07:49 -07:00
Eric Anholt	e119f98472	i965/vs: Add support for copying user edge flags. Fixes the glsl skinning demo regression since changing to the new GLSL compiler, and is part of fixing piglit gl-2.0-edgeflag. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50079 NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-09 09:07:49 -07:00
Olivier Galibert	7426d9d769	i965/fs: Fix the FS inputs setup when some SF outputs aren't used in the FS. If there was an edge flag or a two-side-color pair present, we'd end up mismatched and read values from earlier in the VUE for later FS inputs. v2: Fix regression in gles2conform shaders generating point size. (change by anholt) Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> NOTE: This is a candidate for the 8.0 branch.	2012-08-09 09:07:49 -07:00
Vinson Lee	3466538171	st/mesa: Initialize tgsi_texture_offset Padding field. Fixes uninitialized scalar variable defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-08 22:36:27 -07:00
Kenneth Graunke	68bccc40f5	glx/dri: Initialize reset to __DRI_CTX_RESET_NO_NOTIFICATION. If the application has requested reset notification, then dri2_convert_glx_attribs will initialize this to the correct value. Otherwise, it's supposed to initialize this to NO_NOTIFICATION, but doesn't when num_attribs == 0. (The consensus seems to be that we should make it do so, but that's more invasive, so I'm pushing this for now.) Fixes a regression since `a8724d85f8` where trying to run OilRush_x86 or apitrace heaven_x64 would result in: dri_util.c:221: dri2CreateContextAttribs: Assertion `!"Should not get here."' failed. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=53076 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2012-08-08 17:15:21 -07:00
Tapani Pälli	94f22fbe78	intel: use _mesa_meta_Clear with OpenGL ES 1.1 v2 Patch changes i915 and i965 drivers to use fixed function version of meta clear when running on ES 1.1. This fixes rendering errors seen with Google Maps, Angry Birds and Gallery3D on Android platform. Change `88128516d4` exposes all extensions internally to be available independent of GL flavour, therefore check against ARB_fragment_shader does not work. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50333 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-08 17:15:21 -07:00
Kenneth Graunke	5deb1d1a1f	i965: Rework the extra flushes surrounding occlusion queries. This removes the CS stall on Ivybridge. On Sandybridge, the depth stall needs to be preceded by a non-zero post-sync op, which requires a CS stall, which needs a stall at scoreboard. Emit the full workaround. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Cc: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-08 17:15:21 -07:00
Eric Anholt	b0adbda75a	i965/vs: Protect pow(x,y) MOV of y on gen4 from other instruction flags. I don't know if it was possible to trigger this bug -- we don't merge saturates into the math instruction because we're bad at coalescing currently, and there's nothing generating these with predicates. Still, let's avoid future bugs when we do smarter codegen. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-08 16:21:31 -07:00
Eric Anholt	9b4053cabd	i965: Drop the confusing saturate argument to math instruction setup. This was ridiculous. We were ignoring the inst->header.saturate flag in the case of math and only math. On gen4, we would leave inst->header.saturate in place if it happened to be set, which would end up being applied to the implicit mov and thus trash the first argument. On gen6, we would overwrite inst->header.saturate with the saturate flag from the argument, which was not set appropriately in brw_vec4_emit.cpp, and was only not a bug due to our incompetence at coalescing saturate moves. By ripping the argument out and making saturate work just like all the other brw_eu_emit.c code generation, we can avoid both these classes of bugs. Fixes piglit fog-modes, and the new specific fs-saturate-exp2 case. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=48628 NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-08 16:21:30 -07:00
Eric Anholt	33dfdc735e	i965: Make brw_set_saturate() use stdbool. There was a chance for brw_wm_emit.c to screw up and pass (1 << 4) instead of 1, which would get converted to 0 when stored. Instead, use stdbool which converts nonzero to true/1 like we want.	2012-08-08 16:21:30 -07:00
Eric Anholt	1b148e660e	mesa: In conditional rendering fallback, check the query status. Otherwise, conditional rendering always takes the fallthrough "render it anyway" case unless the application had itself done a check or wait on the query. Fixes intel oglconform's conditional_render advanced.nofbo.readpixels. Reviewed-by: Brian Paul <brianp@vmware.com> NOTE: This is a candidate for the 8.0 branch.	2012-08-08 16:21:30 -07:00
Eric Anholt	4bbd120368	mesa: Fix glPopAttrib() behavior on GL_FRAMEBUFFER_SRGB. I happened to notice this while looking at a blit pass in l4d2, which had an optional push/pop around framebuffer srgb setting. It didn't matter in the end, but the fix is sitting in my tree now. Reviewed-by: Brian Paul <brianp@vmware.com> NOTE: This is a candidate for the 8.0 branch.	2012-08-08 16:21:30 -07:00
Ian Romanick	9f7b3d1713	Make shared-glapi the default You can't practically have desktop OpenGL and OpenGL ES on the same system without this. The benefits of not having it (e.g., a more compact dispatch table) are irrelevant. v2: Don't mark shared-glapi as experimental. Review suggestion by Chad. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-08 10:06:26 -07:00
Ian Romanick	5602f0f955	mesa/tests: Fix trivial typos in src/mapi/glapi tests Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-08 10:06:26 -07:00
Ian Romanick	45d3d0ad21	mesa/tests: Add tests for the generated shared-glapi dispatch table These are largely based on the src/mapi/glapi/tests. However, shared-glapi provides less external visibility into the dispatch table, so there is less to test. Also, shared-glapi does not implement _glapi_get_proc_name, so that test was removed. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-08 10:06:26 -07:00
Ian Romanick	d9f899bb93	glapi: Prevent accidental use of lies w/shared-glapi Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-08 10:06:26 -07:00
Ian Romanick	99fee476a1	glx: Don't use glapitable.h at all When --enable-shared-glapi is used, all non-ABI entries in the table are lies. Avoiding the use of glapitable.h avoids the lies. The only entries used in this code are entries that are ABI. For these, the ABI offset can be used directly. Since this code is in src/glx, it can't use src/mesa/main/dispatch.h to get the pretty names for these offsets. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-08 10:06:26 -07:00
Ian Romanick	f5dffb7e36	glx: Don't rely on struct _glapi_table When --enable-shared-glapi is used, all non-ABI entries in the table are lies. There are two completely separate code generation paths used to assign dispatch offset. Neither has any clue about the other. Unsurprisingly, the can't agree on what offsets to assign. This adds a bunch of overhead to __glXNewIndirectAPI, but this function is called at most once. The test ExtensionNopDispatch was removed. There was just no way to make this test work with the information provided in shared-glapi. Since indirect_glx.c uses _glapi_get_proc_offset now, it was also impossible to make the tests work without shared-glapi. So much pain. This fixes indirect rendering with shared-glapi. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-08 10:06:26 -07:00
Ian Romanick	52d6df8aa7	mesa/tests: Don't build glapi tests with shared-glapi This fixes 'make check' on with --enable-shared-glapi. This test cannot work in that environment. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-08 10:06:25 -07:00
Kenneth Graunke	e45a9ce474	i965: Use 64-bit writes for occlusion queries. The hardware seems to use the length of the PIPE_CONTROL command to indicate whether the write is 64-bits or 32-bits. Which makes sense for immediate writes. Daniel discovered this by writing a pattern into the query object bo and noticing that the high 32-bits were left intact, even on those pipe control writes that seemingly worked. Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-08 09:24:23 -07:00
Kenneth Graunke	20c09b82d0	i965: Refactor depth count write PIPE_CONTROLs into a helper function. This consolidates the complexity in one place, which is important because it's about to get even more complicated. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-08 09:24:21 -07:00
Kenneth Graunke	a2cdd5ada8	i965: Emit a CS stall before timestamp writes. This implements one of the Sandybridge PIPE_CONTROL workarounds. It doesn't appear to be required for Ivybridge. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-08 09:24:19 -07:00
Kenneth Graunke	c4c78c275a	i965: Use 64-bit writes for timestamp queries. The hardware seems to use the length of the PIPE_CONTROL command to indicate whether the write is 64-bits or 32-bits. Which makes sense for immediate writes. Daniel discovered this by writing a pattern into the query object bo and noticing that the high 32-bits were left intact, even on those pipe control writes that seemingly worked. Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-08 09:24:16 -07:00
Kenneth Graunke	03f14664b6	i965: Refactor timestamp write PIPE_CONTROLs into a helper function. This consolidates the complexity in one place, which is important because it's about to get even more complicated. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-08 09:24:14 -07:00
Kenneth Graunke	61d0b9f52c	intel: Make the length for PIPE_CONTROL explicit. PIPE_CONTROL has variable length, depending upon generation and whether we want to do 32-bit or 64-bit data writes. Make it explicit, rather than hiding a length of 4 in the #define for _3DSTATE_PIPE_CONTROL. Generated by s/3DSTATE_PIPE_CONTROL/3DSTATE_PIPE_CONTROL \| (4 - 2)/g. This is equivalent since the #define used to have \| 2 in it. A grep through the sources shows that all instances have been converted, so it's safe to remove the \| 2 from the #define. Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-08 09:23:57 -07:00
Brian Paul	ecac178aa2	swrast: add missing switch case for API_OPENGL_CORE To silence compiler warning. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-08 09:39:36 -06:00
Brian Paul	b4d6502fcd	gallivm: remove unused src_elem_type variable Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-08 09:39:36 -06:00
Brian Paul	f21669e9a2	svga: remove unused svga_shader::use_sm30 field, add comments Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-08 09:39:36 -06:00
Brian Paul	16a289195e	svga: remove unused svga_winsys_handle type Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-08 09:39:36 -06:00
Michel Dänzer	82cd9c0fc2	radeonsi: If pixel shader compilation fails, use a dummy shader. Otherwise we're likely to hang the GPU. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-08 15:33:38 +02:00
Christian König	be42a45e02	radeonsi: fix memory leak and/or segfaults Fix a stupid typo that could lead to memory leaks and/or segfaults. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-08 12:36:49 +02:00
Christian König	8c44e5a144	radeon/winsys: fix winsys VM handling Move releasing the VM area after closing the bo handle. This partially fixes: https://bugs.freedesktop.org/show_bug.cgi?id=45018 Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-08 12:35:10 +02:00
Vinson Lee	7528e2104f	translate: Fix typo in is_legal_int_format_combo. Fixes same on both sides defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-07 22:34:28 -07:00
Marek Olšák	1ea263fccb	r600g: remove unused parameters in texture functions	2012-08-07 23:39:52 +02:00
Eric Anholt	4a078516b6	i965: Enable uniform buffer objects on gen6+. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:52 -07:00
Eric Anholt	04871058eb	i965/vs: Add support for loading uniform buffer variables as pull constants. Unlike the FS side in the previous commit, this does variable indexing just fine, using the same code as we used for other variable-indexed pull constants. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:52 -07:00
Eric Anholt	90de96ff0d	i965/fs: Add support for loading uniform buffer variables as pull constants. Variable array indexing isn't finished, because the lowering pass turns it all into conditional moves of constant index accesses so I can't test it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	bb020d09c3	i965/vs: Add a surface index to VS_OPCODE_PULL_CONSTANT instructions. Similar to the previous commit for the fragment shader, now we have a buffer index and an offset. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	454dc83f66	i965/fs: Communicate the pull constant block read parameters through fs_regs. I wanted to add the surface index as a variable value for UBO support, and a reg seemed like the obvious way to go. This exposes more of the information to CSE, which we'll probably want to apply to pull constant loads for UBOs eventually (you might access 4 floats in a row, each of which would produce an oword block read of the same block). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	25d2bf3845	i965: Bind UBOs as surfaces like we do for pull constants. v2: Comment fix, drop extraneous parens (review by Kenneth) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	5bffbd7ba2	i965: Add an offset argument to constant buffer setup. We'll use this for UBO surfaces. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	5fc5b29a54	mesa: Add support for glUniformBlockBinding() in display lists. Fixes piglit GL_ARB_uniform_buffer_object/dlist. v2: Use the .ui fields instead of .i for type consistency (review by Brian Paul) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	bfa046b5f2	mesa: Unbind uniform buffer bindings on glDeleteBuffers(). Fixes piglit GL_ARB_uniform_buffer_object/deletebuffers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	1eb3c06ae8	mesa: Default to GL 3.1's limits on uniform blocks. The ARB spec lets you get away with the default block counting against the blocks for combined size limits. The core spec says you need to be able to support the maximum size of default block and the maximum size of each uniform block. I see no reason that any driver would have a problem with that. Fixes gl 3.1/minmax (with an associated fix to the test) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	803262a5f5	glsl: Refuse to parse uniform block declarations when UBOs aren't available. Fixes piglit GL_ARB_uniform_buffer_object/compiler/extension-disabled-block.frag Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	e45f1b11c0	glsl: Align GL_UNIFORM_BLOCK_DATA_SIZE according to std140 rules. Fixes piglit GL_ARB_uniform_buffer_object/data-size test. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:51 -07:00
Eric Anholt	86e0045578	glsl: Only flag RowMajor on matrix-type variables. We were only propagating it to the API when the variable was a matrix type, but we were still tripping over it in lower_ubo_reference when it was set on a vector. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:50 -07:00
Eric Anholt	ffb2d43059	glsl: Fix calculation of std140 offset alignment for mat2s. We were getting the base offset of a vec2, not of a vec2[2] like the quoted spec text says we should. v2: Fix swapped then/else cases. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:50 -07:00
Eric Anholt	300315fe69	glsl: Fix glGetActiveUniformsiv(GL_UNIFORM_BLOCK_INDEX). Previously, we were returning the index into the UniformBlocks of one of the linked shaders, when it's supposed to be the program global index. Fixes piglit getactiveuniformsiv-uniform_block_index. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:50 -07:00
Eric Anholt	af3fc6bb28	ir_to_mesa: Don't whack the ->location field of uniform block variables. Fixes some failures in GL_ARB_uniform_buffer_object/maxblocks. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:50 -07:00
Eric Anholt	56e82e30cb	mesa: Make glBindBufferBase/glBindBufferRange() work on just-genned names. In between glGenBuffers() and glBindBuffer(), the buffer object points to this dummy buffer with a name of 0, and a glBindBufferBase() would point to that. It seems pretty clear, given that glBindBufferBase() only cares about the current size of the buffer at render time, that it should bind up the buffer that you passed in instead of pointing it at this useless dummy buffer. However, what should glBindBufferRange() do? As of this patch, it will promote the genned buffer to a proper buffer like it had been glBindBuffer()ed, and then detect that the size is greater than the buffer's current size of 0 and throw INVALID_VALUE. It seems like the most reasonable answer here. Note that this also changes the behavior of these two on non-glGenBuffers() bo names. We haven't yet set up the error throwing for glBindBuffers() on gl 3.1+, and my assumption is that these two functions should inherit their behavior on un-genned names from glBindBuffers(). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 13:54:50 -07:00
Eric Anholt	a75f2681d2	glsl: Add a lowering pass to turn complicated UBO references to vector loads. v2: Reduce the impenetrable code in emit_ubo_loads() by 23 lines by keeping the ir_variable as the variable part of the offset from handle_rvalue(), and track the constant offsets from that with a plain old integer value, avoiding a bunch of temporary variables in the array and struct handling. Also, fix file description doxygen. v3: Fix a row vs col typo, and fix spelling in a comment. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-07 13:54:47 -07:00
Eric Anholt	8c2a983835	glsl: Add a variant of the rvalue visitor for handle_rvalue() on the way down. For the UBO lowering pass, I want to see the whole dereference chain for replacing, not the innermost ir_dereference_variable. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 11:47:49 -07:00
Eric Anholt	2ea3ab14f2	glsl: Add a "ubo_load" expression type for fetches from UBOs. Drivers will probably want to be able to take UBO references in a shader like: uniform ubo1 { float a; float b; float c; float d; } void main() { gl_FragColor = vec4(a, b, c, d); } and generate a single aligned vec4 load out of the UBO. For intel, this involves recognizing the shared offset of the aligned loads and CSEing them out. Obviously that involves breaking things down to loads from an offset from a particular UBO first. Thus, the driver doesn't want to see variable_ref(ir_variable("a")), and even more so does it not want to see array_ref(record_ref(variable_ref(ir_variable("a")), "field1"), variable_ref(ir_variable("i"))). where a.field1[i] is a row_major matrix. Instead, we're going to make a lowering pass to break UBO references down to expressions that are obvious to codegen, and amenable to merging through CSE. v2: Fix some partial thoughts in the ir_binop comment (review by Kenneth) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 11:47:49 -07:00
Eric Anholt	71ba6de342	glsl: Fix a reference to UniformBlocks during uniform linking. When converting var->location from pointing at the program's UniformBlocks to pointing at the linked shader's UniformBlocks, I missed this change. It usually worked out in the end because the two lists happen to be the same in many testcases. Fixes a valgrind complaint on oglconform ubo-compile.cpp advanced.std140.2stage Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 11:47:49 -07:00
Eric Anholt	7e42302e71	glsl: Update the notes on adding a new expression type. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 11:47:49 -07:00
Eric Anholt	9c1b41879a	mesa: Replace VersionMajor/VersionMinor with a Version field. As we get into supporting GL 3.x core, we come across more and more features of the API that depend on the version number as opposed to just the extension list. This will let us more sanely do version checks than "(VersionMajor == 3 && VersionMinor >= 2) \|\| VersionMajor >= 4". v2: Fix a bad <= 30 check. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 11:47:19 -07:00
Eric Anholt	3aaeb3e5e7	intel: Fix compiler warnings from winsys msaa.	2012-08-07 11:47:11 -07:00
Chad Versace	e943e5c291	intel: Advertise multisample DRI2 configs on gen >= 6 This turns on window system MSAA. This patch changes the id of many GLX visuals and configs, but that couldn't be prevented. I attempted to preserve the id's of extant configs by appending the multisample configs to the end of the extant ones. But somewhere, perhaps in the X server, the configs are reordered with multisample configs interspersed among the singlesample ones. Test results: Tested with xonotic and `glxgears -samples 1` on Ivybridge. No piglit regressions on Ivybridge. On Sandybridge, passes 68/70 of oglconform's winsys multisample tests. The two failing tests are: multisample(advanced.pixelmap.depth) multisample(advanced.pixelmap.depthCopyPixels) These tests hang the gpu (on kernel 3.4.6) due to a glDrawPixels/glReadPixels pair on an MSAA depth buffer. I don't expect realworld apps to do that, so I'm not too concerned about the hang. On Ivybridge, passes 69/70. The failing case is multisample(advanced.line.changeWidth). Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:34 -07:00
Chad Versace	8b5d68dd28	intel: Clarify intel_screen_make_configs This function felt sloppy, so this patch cleans it up a little bit. - Rename `color` to `i`. It is not a color value, only an iterator int. - Move `depth_bits[0] = 0` into the non-accum loop because that is where it used. The accum loop later overwrites depth_bits[0]. - Rename `depth_factor` to `num_depth_stencil_bits`. - Redefine `msaa_samples_array` as static const because it is never modified. Rename to `singlesample_samples`. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	a4bf68ca50	dri: Simplify use of driConcatConfigs If either argument to driConcatConfigs(a, b) is null or the empty list, then simply return the other argument as the resultant list. All callers were accomplishing that same behavior anyway. And each caller accopmplished it with the same pattern. So this patch moves that external pattern into the function. Reviewed-by: <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	b2d428cb8d	intel: Refactor creation of DRI2 configs DRI2 configs were constructed in intelInitScreen2. That function already does too much, so move verbatim the code for creating configs to a new function, intel_screen_make_configs. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	61fd684782	intel: Downsample on DRI2 flush Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	e88cfbb95f	intel: Support mapping multisample miptrees Add two new functions: intel_miptree_{map,unmap}_multisample, to which intel_miptree_{map,unmap} dispatch. Only mapping flat, renderbuffer-like miptrees are supported. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	4c0ccc13bd	intel: Refactor use of intel_miptree_map Move the opencoded construction and destruction of intel_miptree_map into new functions, intel_miptree_attach_map and intel_miptree_release_map. This patch prevents code duplication in a future commit that adds support for mapping multisample miptrees. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	81980958d0	intel: Refactor intel_miptree_map/unmap Move the body of intel_miptree_map into a new function, intel_miptree_map_singlesample. Now intel_miptree_map dispatches to the new function. A future commit adds a multisample variant. Ditto for intel_miptree_unmap. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	6b56140b4b	i965: Mark needed downsamples for msaa winsys buffers Add function intel_renderbuffer_set_needs_downsample. It is a no-op except on multisample winsys buffers shared with DRI2. Mark the needed downsamples with the new function at two locations: - Immediately after drawing is complete. - After blitting. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	d3746354fb	intel: Define functions for up/downsampling on miptrees Flesh out the stub functions intel_miptree_{up,down}sample. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	6cc9df331b	i965: Add function brw_blorp_blit_miptrees Define a function, brw_blorp_blit_miptrees, that simply wraps brw_blorp_blit_params + brw_blorp_exec with C calling conventions. This enables intel_miptree.c, in a following commit, to perform blits with blorp for the purpose of downsampling multisample miptrees. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	f4873babdc	intel: Allocate miptree for multisample DRI2 buffers Immediately after obtaining, with DRI2GetBuffersWithFormat, the DRM buffer handle for a DRI2 buffer, we wrap that DRM buffer handle with a region and a miptree. This patch additionally allocates an accompanying multisample miptree if the DRI2 buffer is multisampled. Since we do not yet advertise multisample GL configs, the code for allocating the multisample miptree is currently inactive. This patch adds the following fields to intel_mipmap_tree: singlesample_mt needs_downsample and the following function stubs: intel_miptree_downsample intel_miptree_upsample Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	4eba67285f	intel: Refactor creation of hiz and mcs miptrees Move the logic for creating the ancillary hiz and mcs miptress for winsys and non-texture renderbuffers from intel_alloc_renderbuffer_storage to intel_miptree_create_for_renderbuffer. Let's try to isolate complex miptree logic to intel_mipmap_tree.c. Without this refactor, code duplication would be required along the intel_process_dri2_buffer codepath in order to create the mcs miptree. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	e2f2376e88	intel: Set num samples for winsys renderbuffers Add a new param, num_samples, to intel_create_renderbuffer and intel_create_private_renderbuffer. No multisample GL config is yet advertised, so the value of num_samples is currently 0. For server-owned winsys buffers, gl_renderbuffer::NumSamples is not yet used. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> (v1) Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	53fa28f7b1	intel: Refactor quantize_num_samples Rename quantize_num_samples to intel_quantize_num_samples and change the first param from struct intel_context* to struct intel_screen*. The function will later be used by intelCreateBuffer, which is not bound to any context but is bound to a screen. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Paul Berry <stereotype441@gmail.com> (v1) Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Chad Versace	7a2e40ed28	intel: Update stale comment for intel_miptree_slice::map The comment referred to intel_tex_image_map/unmap, but should more accurately refer to intel_miptree_map/unmap. Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-07 09:30:33 -07:00
Paulo Zanoni	4b40375c43	i965: add more Haswell PCI IDs Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Rodrigo Vivi <rodrigo.vivi@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-07 11:13:47 -03:00
Brian Paul	8433f80add	egl: remove redundant PFNEGLQUERYSTREAMTIMEKHRPROC typedef This typedef is present earlier in the header and isn't part of the EGL_KHR_stream_cross_process_fd extension. Looks like a Khronos glitch.	2012-08-07 07:31:05 -06:00
Brian Paul	99695f58fd	softpipe: fix loop limit for tex_cache[] array Fixes https://bugs.freedesktop.org/show_bug.cgi?id=53199	2012-08-07 08:00:46 -06:00
Vinson Lee	7d65356d8a	st/mesa: Fix a potential memory leak in get_mesa_program. Fixes resource leak defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 22:08:56 -07:00
Vinson Lee	c3894bc2d5	gallivm: Add constructor for raw_debug_ostream. Fixes uninitialized scalar field defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 22:07:31 -07:00
Brian Paul	e622723918	docs: update ARB_debug_output status to DONE	2012-08-06 16:48:00 -06:00
Jason Wood	56c1f55c51	docs: Add OpenGL 4.3 requirements v2: Note that GLSL 4.3 has not been started, and that ARB_compute_shader has been started in Gallium drivers. Signed-off-by: Jason Wood <sandain@hotmail.com> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-06 16:41:24 -06:00
Ian Romanick	45e592c3dd	egl: Import eglext.h version 14 This is necessary for EGL_KHR_create_context work (including writing piglit tests). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-06 15:37:04 -07:00
Ian Romanick	b50703aea5	egl: Replace KHR_surfaceless_* extensions with KHR_surfaceless_context KHR extension name is reserved for Khronos ratified extensions, and there is no such thing as EGL_KHR_surfaceless_{gles1,gles2,opengl}. Replace these three extensions with EGL_KHR_surfaceless_context since that extension actually exists. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-06 15:37:04 -07:00
Ian Romanick	cb77f5dd1f	egl_dri2: Refactor dereference of dri2_ctx_shared Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-06 15:37:04 -07:00
Ian Romanick	05413ddb1d	egl_dri2: Remove swrast version >= 2 checks Since support for swrast version 2 was added (`f55d027a`), it has also been required. In swrast_driver_extensions, version 2 is set for __DRI_SWRAST extension. Remove the spurious version checks sprinked through the code. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Cc: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-06 15:37:04 -07:00
Ian Romanick	63adb6b9ea	dri2: Fix bug in attribute handling for non-desktop OpenGL contexts Previously an error would be generated if any attributes were specified when creating a non-desktop OpenGL context. This was a mistake, and it will prevent old drivers from working with new EGL libraries that add support for the createContextAttribs interface. Instead, match the behavior of EGL_KHR_create_context: allow versions that make sense, reject non-zero flags. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Cc: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-06 15:37:04 -07:00
Andreas Boll	102617bc52	docs: update piglit url Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-06 16:23:43 -06:00
Andreas Boll	933e13e2af	docs/helpwanted: add r600g and i915g todo lists Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-06 16:23:43 -06:00
Kenneth Graunke	caa4ae5d7d	i965: Allocate dummy slots for point sprites before computing VUE map. Commit `f0cecd43d6` moved the VUE map computation to be only once, at VS compile time. However, it did so in slightly the wrong place: it made the one call to brw_vue_compute_map happen right before the allocation of dummy slots for replaced point sprite coordinates, causing a different VUE map to be generated (at least on Ironlake). Fixes a regression in Piglit's point-sprite test on Ironlake. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=46489 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-06 11:16:40 -07:00
Kenneth Graunke	54c045b93c	i965/vs: Don't clobber sampler message MRFs with subexpressions. See the preceding commit for a description of the problem. NOTE: This is a candidate for stable release branches. v2: Use a separate dPdx variable rather than reusing the lod src_reg. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=52129 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-06 11:16:15 -07:00
Kenneth Graunke	c0f60106df	i965/fs: Don't clobber sampler message MRFs with subexpressions. Consider a texture call such as: textureLod(s, coordinate, log2(...)) First, we begin setting up the sampler message by loading the texture coordinates into MRFs, starting with m2. Then, we realize we need the LOD, and go to compute it with: ir->lod_info.lod->accept(this); On Gen4-5, this will generate a SEND instruction to compute log2(), loading the operand into m2, and clobbering our texcoord. Similar issues exist on Gen6+. For example, nested texture calls: textureLod(s1, c1, texture(s2, c2).x) Any texturing call where evaluating the subexpression trees for LOD or shadow comparitor would generate SEND instructions could potentially break. In some cases (like register spilling), we get lucky and avoid the issue by using non-overlapping MRF regions. But we shouldn't count on that. Fixes four Piglit test regressions on Gen4-5: - glsl-fs-shadow2DGradARB-{01,04,07,cumulative} NOTE: This is a candidate for stable release branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=52129 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-06 11:16:11 -07:00
Kenneth Graunke	27bf9c1997	i965/fs: Factor out texcoord setup into a helper function. With the textureRect support and GL_CLAMP workarounds, it's grown sufficiently that it deserves its own function. Separating it out makes the original function much more readable. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-06 11:16:09 -07:00
Kenneth Graunke	82bfb4b41a	i965/fs: Move message header and texture offset setup to generate_tex(). Setting the texture offset bits in the message header involves very specific hardware register descriptions. As such, I feel it's better suited for the lower level "generate" layer that has direct access to the weird register layouts, rather than at the fs_inst abstraction layer. This also parallels the approach I took in the VS backend. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-06 11:16:00 -07:00
Jerome Glisse	2df399c34b	r600g: atomize sampler state v2 Use atom for sampler state. Does not provide new functionality or fix any bug. Just a step toward full atom base r600g. v2: Split seamless on r6xx/r7xx into it's own atom. Make sure it's emited after sampler and with a pipeline flush before otherwise it does not take effect. Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-08-06 12:04:55 -04:00
Alex Deucher	d3f8000bfc	radeonsi: add some new pci ids Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-06 10:55:41 -04:00
Alex Deucher	a6146d2566	r600g: add additional evergreen pci ids Note: this is a candidate for the stable branches. Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-08-06 10:55:41 -04:00
Brian Paul	8eeeef3705	st/mesa: merge fragment/vertex sampler update code Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:50:20 -06:00
Brian Paul	819e786339	st/mesa: massage update_vertex_samplers() code ...to look like update_fragment_samplers() code, as with the previous commit. The next step would be to merge the two functions. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:50:19 -06:00
Brian Paul	2aac0d145a	st/mesa: merge fragment/vertex texture update code Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:50:11 -06:00
Brian Paul	dd6aafcf72	st/mesa: massage the update_vertex_textures() code ...to look like update_fragment_textures() code. The next step would be to merge the two functions. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:41:07 -06:00
Brian Paul	5749ae919e	st/mesa: rename some vertex/fragment state fields for better consistency Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:41:07 -06:00
Brian Paul	29604441de	llvmpipe: consolidate the sampler and sampler view setting code Less code. And as with softpipe, if/when we consolidate the pipe_context functions for binding sampler state, this will make the llvmpipe changes trivial. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:33:17 -06:00
Brian Paul	b3538d3563	llvmpipe: combine vertex/fragment sampler state into an array This will allow code consolidation in the next patch. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:33:17 -06:00
Brian Paul	1f34e1a6cb	softpipe: consolidate vert/frag/geom sampler setting functions The functions for setting samplers and sampler views for vertex, fragment and geometry shaders were nearly identical. Now they use shared code. In the future, if the pipe_context functions for setting samplers and sampler views for vert/frag/geom/compute are combined, this will make updating the softpipe driver a snap. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:33:17 -06:00
Brian Paul	d6c3e6d8f3	softpipe: consolidate sampler-related arrays Combine separate arrays for vertex/fragment/geometry samplers, etc into one array indexed by PIPE_SHADER_x. This allows us to collapse separate code for vertex/fragment/geometry state into loops over the shader stage. More to come. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:33:17 -06:00
Brian Paul	0a14e9f09f	softpipe: combine vert/frag/geom texture caches in an array This lets us consolidate some code now, and more in subsequent patches. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-08-06 08:33:17 -06:00
Vinson Lee	61b62c007a	mesa: Fix off-by-one error in Parse_TextureImageId. Fixes out-of-bounds write defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 21:42:23 -07:00
Vinson Lee	3e7b3a04bf	util: Move dereference after null check in util_resource_copy_region. Fixes dereference before null check defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 21:41:27 -07:00
Brian Paul	a5ca29100b	i915g: silence a const pointer warning	2012-08-04 08:38:11 -06:00
Marek Olšák	f9a498d1bc	radeonsi: fix build failure after blitter changes	2012-08-04 16:34:24 +02:00
Marek Olšák	cb922b63eb	r600g: precompute color buffer state in pipe_surface and reuse it	2012-08-04 14:05:52 +02:00
Marek Olšák	cdc681c3ad	r600g: precompute depth buffer state in pipe_surface and reuse it This is done on-demand, because we don't know in advance if a zbuffer will be bound as depth or color.	2012-08-04 14:05:51 +02:00
Marek Olšák	e6dfc8c77b	r600g: simplify create_surface	2012-08-04 14:05:51 +02:00
Marek Olšák	581f7e3101	r600g: drop the old texture allocation code Made obsolete by the libdrm surface allocator.	2012-08-04 14:05:51 +02:00
Marek Olšák	7c371f4695	r600g: make sure copying of all texture formats is accelerated	2012-08-04 14:05:51 +02:00
Marek Olšák	84645fa613	gallium/u_blitter: add a query for checking whether copying is supported v2: add comments	2012-08-04 14:05:37 +02:00
Marek Olšák	e2f623f1d6	r600g: don't decompress depth or stencil if there isn't any	2012-08-04 13:53:07 +02:00
Marek Olšák	ea72351a91	r600g: correct texture memory size for Z32F_S8X24 on evergreen	2012-08-04 13:53:07 +02:00
Marek Olšák	c8ff737a18	gallium/u_blitter: remove fallback for stencil copy that all drivers skipped Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	ef1bf6d69e	gallium/u_blitter: add ability to blit only depth or only stencil Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	8842678047	gallium: define PIPE_MASK_RGBAZS I need this and it seems like it could be useful. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	8aaf6972d1	gallium/u_blitter: minor cleanup Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	67a3e5bc32	gallium/tgsi: fixup texture name strings Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	6c420b1668	gallium/u_blitter: set sample mask to ~0 Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	9d1ef354f9	gallium/u_blit: bail out if src is a multisample texture Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	6b3f1ae12b	gallium/u_blit: check nr_samples before using resource_copy_region Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:07 +02:00
Marek Olšák	e7689303a8	gallium: set sample mask to ~0 for clear, blit and gen_mipmap The sample mask affects single-sampled rendering too (it's orthogonal to the color mask). Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-04 13:53:06 +02:00
Dave Airlie	cd97a5f660	r600g: fix F2U opcode translation Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-08-04 13:45:27 +02:00
Vinson Lee	5bce0b5175	draw: Ensure channel in convert_to_soa is initialized. Fixes uninitialized pointer read defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-03 22:28:31 -07:00
Vinson Lee	9d36b3abfd	u_blitter: Move a pointer dereference after null check. Fixes dereference before null check defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-08-03 22:27:13 -07:00
Matt Turner	fb85558ab1	Use C99 NAN and INFINITY macros	2012-08-03 15:02:09 -07:00
Brian Paul	65da837fcf	gallium/tests/trivial: updates for CSO interface changes	2012-08-03 11:58:43 -06:00
Brian Paul	c61d3fe8bd	st/xorg: updates for CSO interface changes	2012-08-03 11:56:36 -06:00
Brian Paul	459dd56897	st/xa: updates for CSO interface changes	2012-08-03 11:56:28 -06:00
Brian Paul	3d1bec5d9a	vega: fix build breakage from cso sampler/view changes	2012-08-03 08:33:23 -06:00
Brian Paul	832706a80b	cso: remove unreachable break statements	2012-08-03 07:16:35 -06:00
Brian Paul	076e5eacf1	cso: 80-column wrapping, remove trailing whitespace, etc	2012-08-03 07:16:35 -06:00
Brian Paul	ea6f035ae9	gallium: consolidate CSO sampler and sampler_view functions Merge the vertex/fragment versions of the cso_set/save/restore_samplers() functions. Now we pass the shader stage (PIPE_SHADER_x) to the function to indicate vertex/fragment/geometry samplers. For example: cso_single_sampler(cso, PIPE_SHADER_FRAGMENT, unit, sampler); This results in quite a bit of code reduction, fewer CSO functions and support for geometry shaders. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-08-03 07:16:35 -06:00
Vinson Lee	350f12fb65	st/mesa: Ensure dst in compile_instruction is initialized. Fixes uninitialized scalar variable defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-02 21:10:49 -07:00
Tom Stellard	f6ad8b45c2	radeon/llvm: Add $(LLVM_LDFLAGS) to the loader linker flags	2012-08-02 20:12:11 +00:00
Tom Stellard	4a89a20717	radeon/llvm: Add support for more f32 CMP instructions on SI	2012-08-02 20:12:11 +00:00
Tom Stellard	a35eea7868	radeon/llvm: Add support for fneg on SI	2012-08-02 20:12:10 +00:00
Tom Stellard	4104bae063	radeon/llvm: Add support for fp_to_sint on SI	2012-08-02 20:12:10 +00:00
Tom Stellard	f7fcaa07df	radeon/llvm: Remove CMOVLOG DAG node	2012-08-02 20:12:06 +00:00
Tom Stellard	a5ac8ee2c5	radeonsi: Properly initialize si_shader_ctx.radeon_bld	2012-08-02 13:21:30 -04:00
Michel Dänzer	c2bae6b91d	radeonsi: Handle TGSI TXP opcode. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-02 18:38:47 +02:00
Michel Dänzer	93b4f1f97e	radeonsi: Handle TGSI DIV opcode. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-02 18:38:16 +02:00
Brian Paul	daf4254d07	svga: remove questionable INLINE qualifiers	2012-08-02 09:40:41 -06:00
Brian Paul	421f134028	svga: sort #includes	2012-08-02 09:40:40 -06:00
Brian Paul	81f2f3f65c	svga: add some comments in svga_screen_cache.c	2012-08-02 09:40:40 -06:00
Brian Paul	4b5a5898b1	svga: whitespace, formatting fixes	2012-08-02 09:40:40 -06:00
Brian Paul	bcd8d9713d	svga: remove unneeded 'struct svga_screen' declarations	2012-08-02 09:40:40 -06:00
Brian Paul	8551635242	mesa: fix default_access_mode() result for ES2 The GL_OES_mapbuffer extension is supported by OpenGL ES 1 and ES 2 so return GL_MAP_WRITE_BIT for both ES versions, not just ES 1. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-02 09:40:40 -06:00
Brian Paul	3eb2b5c5e4	mesa: default_access_mode() returns a GLbitfield, not GLenum	2012-08-02 09:40:40 -06:00
José Fonseca	4bd36956f8	scons: set YACCHXXFILESUFFIX to stop needless rebuilding of the parser Before, the GLSL parser was getting rebuilt every time that scons was run. The problem was scons was expecting a glsl_parser.hpp file but we were generating a glsl_parser.h file. Signed-off-by: Brian Paul <brianp@vmware.com>	2012-08-02 09:40:40 -06:00
Christian König	41625afa2f	radeonsi: initial VDPAU target Windowed speed is of course way to slow, but fullscreen works like a charm now. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-02 15:15:23 +02:00
Christian König	a3c6607be1	radeon/llvm: fix fp immediates on SI I don't know if this is a good idea, but it fixes the problem at hand. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-02 15:15:00 +02:00
Christian König	250b7fdd26	radeonsi: fix TEX writemask Using the writemask in the sampler results in packet VGPRS. For now just sample all components and let llvm chose the right one. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-02 12:05:33 +02:00
Christian König	3508815d17	radeonsi: fix shader param and color count Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-02 11:22:57 +02:00
Christian König	92b96a883f	radeonsi: fix texture loads from sampler > 0 The backend is multiplying the offset by the numbers of elements anyway, so doing it twice just makes everything crash. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-02 11:22:52 +02:00
Christian König	9b7dc5e81c	radeonsi: disable tiling until we fixed all bugs Currently there are more important things to worry about. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-08-02 11:22:40 +02:00
Vinson Lee	8734584952	scons: Add support for Intel Compiler. The patch makes the SCons build with Intel Compiler successful. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-01 21:28:47 -07:00
Pauli Nieminen	204bfb904b	meta: Use sampler object in framebuffer blit Framebuffer blit needs to setup texture sampling with no reference to the user's texturing state, and a sampler object lets us avoid a bunch of changes to the user's state setup. We don't bother caching the sampler object since we're changing parameters in it based on the filtering option to glBlitFramebuffer(). Fixes piglit GL_ARB_sampler_objects/framebufferblit and rendering in l4d2 (our setting of srgb decode wasn't being respected due to the user's sampler object being active). Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:57:12 -07:00
Pauli Nieminen	676a563d5b	meta: Add sampler object to texture decompression Sampler objects can be used to shadow texture object state without modifying original application state. Decompression path feels a bit like path where caching shouldn't happen. But as everything else is cached already I decided to cache sampler state too. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:57:12 -07:00
Pauli Nieminen	5a320d5bcf	mesa: Allow meta module to call sampler functions To allow meta module to use sample objects mesa GL functions need to be visible and linkable for meta module. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:57:12 -07:00
Pauli Nieminen	cbdc1d5354	swrast: Support sampler object for texture fetching state swrast needs to pass sampler object into all texture fetching functions to use correct sampling state when sampler object is bound to the unit. The changes were made using half manual regular expression replace. v2: Fix NULL deref in _swrast_choose_triangle(), because the _Current values aren't set yet, so we need to look at our texObj2D. (anholt) Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-01 15:55:51 -07:00
Pauli Nieminen	8129dabb5f	mesa: Make ARB_sampler_objects mandatory To allow meta acceleration operations to use sampler objects the ARB_sampler_objects extension needs to be mandatory for all drivers. Because the extension doesn't have any hardware dependencies it is trivial to implement. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:31:17 -07:00
Pauli Nieminen	ae58f9696c	mesa/program: Use sampler object state if present CompareFailValue is part of Sampler state that needs to be read from bound sampler object if present. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:31:17 -07:00
Pauli Nieminen	cae7636852	mesa/ff_shader: Fix sampler state reading Fixed function fragment shader generator was incorrectly read texture sampling state directly from texture object. To make sure that ARB_sampler_object works correctly shader generator has to use the bound sampler if one exist. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:31:17 -07:00
Pauli Nieminen	6f6bd8aedc	radeon&r200: Add support for ARB_sampler_objects Preparation for the mandatory support of ARB_sampler_objects. I have tested this patch with rv280 only. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-01 15:31:16 -07:00
Pauli Nieminen	10169e7adc	radeon: Fix printf format not to warn in 64bit When I build tested radeon changes I noticed two warnings about format size missmatch in 64bit. I decided to clean them to make relevant compiler warnings easier to spot. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-08-01 15:31:16 -07:00
Pauli Nieminen	54808e560f	nouveau: Add support for ARB_sampler_objects ARB_sampler_objects is very simple software only extension to support. I want to make it a mandatory extension for Mesa drivers to allow the meta module to use it. This patch add support for the extension to nouveau. It is completely untested search and replace patch, except for flagging the texture state as needing to be recomputed when a sampler object is present. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com>	2012-08-01 15:31:16 -07:00
Pauli Nieminen	765509903b	mesa/samplerobj: Support EXT_texture_sRGB_decode sRGBDecode state is part of sampler object state but mesa was missing handlers to access the state. This patch adds the support for required state changes and queries. GL_EXT_texture_sRGB_decode issue 4: "4) Should we add forward-looking support for ARB_sampler_objects? RESOLVED: YES If ARB_sampler_objects exists in the implementation, the sampler objects should also include this parameter per sampler." Fixes piglit GL_ARB_sampler_objects/GL_EXT_texture_sRGB_decode. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:31:16 -07:00
Pauli Nieminen	c37efbfe4c	mesa: Move DepthMode to texture object GL_DEPTH_TEXTURE_MODE isn't meant to be part of sampler state based on compatibility profile specifications. OpenGL specification 4.1 compatibility 20100725 3.9.2: "... The values accepted in the pname parameter are TEXTURE_WRAP_S, TEXTURE_WRAP_T, TEXTURE_WRAP_R, TEXTURE_MIN_- FILTER, TEXTURE_MAG_FILTER, TEXTURE_BORDER_COLOR, TEXTURE_MIN_- LOD, TEXTURE_MAX_LOD, TEXTURE_LOD_BIAS, TEXTURE_COMPARE_MODE, and TEXTURE_COMPARE_FUNC. Texture state listed in table 6.25 but not listed here and in the sampler state in table 6.26 is not part of the sampler state, and remains in the texture object." The list of states is in Table 6.24 "Textures (state per texture object)" instead of 6.25 mentioned in the specification text. Same can be found from 3.3 compatibility specification. Signed-off-by: Pauli Nieminen <pauli.nieminen@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-08-01 15:30:13 -07:00
Paul Berry	c18806cebf	i965/msaa: Allow GL_SAMPLES to be set to 1 prior to Gen6. This patch allows GL_SAMPLES to be set to either 0 or 1 on i965 platforms that don't support MSAA (those prior to Gen6). Setting GL_SAMPLES=1 has the same effect as setting it to 0 on these platforms (because MSAA is unsupported), but is distinguishable via the GL API. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50165 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-01 12:45:20 -07:00
Paul Berry	97fc89c6cb	i965/msaa: Treat GL_SAMPLES=1 as equivalent to GL_SAMPLES=0. EXT_framebuffer_multisample is a required subpart of ARB_framebuffer_object, which means that we must support it even on platforms that don't support MSAA. Fortunately EXT_framebuffer_multisample allows for this by allowing GL_MAX_SAMPLES to be set to 1. This leads to a tricky quirk in the GL spec: since GlRenderbufferStorageMultisamples() accepts any value for its "samples" parameter up to and including GL_MAX_SAMPLES, that means that on platforms that don't support MSAA, GL_SAMPLES is allowed to be set to either 0 or 1. On platforms that do support MSAA, GL_SAMPLES=1 is not used; 0 means no MSAA, and 2 or higher means MSAA. In other words, GL_SAMPLES needs to be interpreted as follows: =0 no MSAA (possible on all platforms) =1 no MSAA (only possible on platforms where MSAA unsupported) >1 MSAA (only possible on platforms where MSAA supported) This patch modifies all MSAA-related code to choose between multisampling and single-sampling based on the condition (GL_SAMPLES > 1) instead of (GL_SAMPLES > 0) so that GL_SAMPLES=1 will be treated as "no MSAA". Note that since GL_SAMPLES=1 implies GL_SAMPLE_BUFFERS=1, we can no longer use GL_SAMPLE_BUFFERS to distinguish between MSAA and non-MSAA rendering. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-08-01 12:45:15 -07:00
Tomeu Vizoso	d5c918f6ad	glsl: Add support for OES_standard_derivatives in GLSL ES. Previously, we advertised the extension but the builtin functions were enabled only for GLSL and not for ES. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=52003 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-08-01 10:44:44 -07:00
Chad Versace	8c94f6bbd8	intel: Use consistent pattern in intelCreateBuffer The 16-bit depth case did not follow the function's prevalent pattern. Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-01 10:33:40 -07:00
Chad Versace	2b4fbc4d7d	intel: Decrease nesting level in intelCreateBuffer Nearly the whole function body was contained in the 'else' branch. The 'if' branch did one thing: return early with an error. Clean things up by moving all the code out of the 'else' branch. Decreases max nesting level from 4 to 3. Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-01 10:33:38 -07:00
Chad Versace	83fa0842ca	intel: Remove dead code in intelAllocateBuffer After commit "intel: Convert to using private depth/stencil buffers", we request from DRI2GetBuffersWithFormat only the front left and back left buffers. We no longer request depth and stencil buffers. Assert that in intelAllocateBuffer and remove the related dead code. Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-08-01 10:33:36 -07:00
Matt Turner	84ead7b4e8	configure.ac: Remove extra ;; Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=53053	2012-08-01 10:12:50 -07:00
Matt Turner	33ae29c93b	configure.ac: Don't duplicate CFLAGS These assignments caused CFLAGS specified on the configure line to appear twice in the final CFLAGS. Removing them makes the behavior reasonable -- USER_CFLAGS are appended at the end of CFLAGS, allowing the builder to override flags added by configure.ac like -fno-strict-aliasing. Reviewed-by: Adam Jackson <ajax@redhat.com>	2012-08-01 10:12:50 -07:00
Matt Turner	14819eb588	configure.ac: Remove contractions to stop breaking syntax highlighting Reviewed-by: Adam Jackson <ajax@redhat.com>	2012-08-01 10:12:50 -07:00
Matt Turner	0e38a3ca52	configure.ac: remove remnants of ppc asm support Missed by `d387899388`. Reviewed-by: Adam Jackson <ajax@redhat.com>	2012-08-01 10:12:22 -07:00
Adam Jackson	33ef67ab20	linux: Default to dri not xlib on all arches Even on s390{,x} where there's no video card, you still want this so GLX protocol works. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Adam Jackson <ajax@redhat.com>	2012-08-01 12:37:25 -04:00
Christoph Bumiller	8592933de8	nv50,nvc0: make resolve sampler objects allow sRGB conversion Just figured out what that bit does. Note: It's converted back to sRGB on write, so no effective conversion occurs.	2012-08-01 15:39:46 +02:00
Christoph Bumiller	6286d9810b	Revert "gallium: specify resource_resolve destination via a pipe_surface" This reverts commit `5d5af7d359`. It turns out the issue this was supposed to fix merely counter-acted a bug in the hardware driver that I wasn't aware of. The resource_resolve is not supposed to do sRGB conversion, period. (This would violate the requirement that source and destination must be of the same format).	2012-08-01 15:39:46 +02:00
Roland Scheidegger	be2dcc5e9f	r200: get rid of dubious aux scissor bits no point in emitting aux scissor values if we a) never enable them b) never set the actual values plus it is enough to have that aux scissor enable reg (which we never set to enable) in one place not two.	2012-08-01 14:58:47 +02:00
Roland Scheidegger	c0c216c469	radeon/r200: get rid of some unneeded cliprect/scissor code Noone was interested in the number of cliprects, and noone cared about the intersect result neither. So just nuke this.	2012-08-01 14:58:38 +02:00
Roland Scheidegger	549470aa1a	r200: get rid of old gart memory functions from old dri1 Those functions are SO dead.	2012-08-01 14:58:29 +02:00
Roland Scheidegger	de694b6b10	radeon/r200: fix bogus clears There were several problems with these functions (which are a remnant of dri1 hyperz mostly - should bring it back somehow someday). First, it would always do a swrast clear if the buffer to clear was a fbo. Second, for buffers we wouldn't handle the clear (I guess aux/accum?) we would actually still have tried to clear that later even when we already cleared it with swrast.	2012-08-01 14:58:23 +02:00
Roland Scheidegger	5b88a2a22d	radeon/r200: fix bogus assert/scissor wrt width/height 2048 This addresses one issue raised in bug #51658 discovered by Eugene St Leger. The assert is bogus since there's no problem with texture width/height being 2048 (the width/height programmed is width/height minus one). OTOH though the programmed size for scissor rect should be width/height minus one too otherwise bad things may happen (as it is inclusive, and there's not enough bits for more than a value of 2047).	2012-08-01 14:58:15 +02:00
Christian König	6574fe3c4a	radeon/llvm: fix calculation of max register number Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-08-01 11:15:06 +02:00
Tom Stellard	a488fdd3d9	radeon/llvm: Add pseudo-support for 64-bit immediate types on SI SI does not support 64-bit immediates natively, but llvm will generate i64 immediates when indexing loads and stores (since SI has 64-bit pointers). The i64 indices will always be small enough to fit into 32-bits (i.e. the high 32 bits will always be all zeros), so we can treat these index values as 32-bits.	2012-07-31 20:19:21 +00:00
Tom Stellard	be46874281	radeon/llvm: Fix incorrect return value in SelectADDRReg() We need to return true when we match the pattern.	2012-07-31 20:19:20 +00:00
Tom Stellard	056b77ca22	radeon/llvm: Move SMRD IMM pattern before SMRD SGPR pattern In tablegen, if two patterns match, the one that comes first in the file is given preference. We want the SMRD IMM pattern to be given preference, because it encodes the pointer offset in its immediate field, which saves us an add instruction.	2012-07-31 20:19:20 +00:00
Eric Anholt	877a897adc	glsl: Reject linking shaders with too many uniform blocks. Part of fixing piglit maxblocks. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:20 -07:00
Eric Anholt	fa08b8ad54	mesa: Return -1 for glGetUniformLocation on UBOs. Fixes piglit ARB_uniform_buffer_object/getuniformlocation. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:20 -07:00
Eric Anholt	bbd1d6124d	glsl: Assign array and matrix stride values according to std140 layout. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:20 -07:00
Eric Anholt	551bdf25bc	glsl: Add support for default layout qualifiers for uniforms. I ended up having to add rallocing of the ast_type_qualifier in order to avoid pulling in ast.h for glsl_parser_extras.h, because I wanted to track an ast_type_qualifier in the state. Fixes piglit ARB_uniform_buffer_object/row-major. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:20 -07:00
Eric Anholt	7b77c64254	glsl: Merge UBO layout qualifiers in a qualifier list. Yes, you get to say things like "layout(row_major, column_major)" and get column major. Part of fixing piglit ARB_uniform_buffer_object/row_major. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:20 -07:00
Eric Anholt	eed967bc9c	mesa: Add support for GL_ARB_ubo's glGetActiveUniformName(). This is like a stripped-down version of glGetActiveUniform that just returns the name, since the other return values (type and size) of that function are now meant to be handled with glGetActiveUniformsiv(). Fixes piglit ARB_uniform_buffer_object/getactiveuniformname Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:19 -07:00
Eric Anholt	dc654370c3	mesa: Add support for most of the other pnames of glGetActiveUniformBlockiv(). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:19 -07:00
Eric Anholt	5a165d1f3a	mesa: Add support for getting active uniform block names. Fixes piglit ARB_uniform_buffer_object/getactiveuniformblockname. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:19 -07:00
Eric Anholt	467304dfe5	mesa: Add support for glUniformBlockBinding() and the API to get it back. Fixes piglit ARB_uniform_buffer_object/uniformbufferbinding. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:19 -07:00
Eric Anholt	fafa394c15	glsl: Incorporate all UBO language changes into GLSL 1.40. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:19 -07:00
Eric Anholt	4070036259	mesa: Add support for glGetProgramiv pnames for UBOs. Fixes piglit ARB_uniform_buffer_object/getprogramiv. v2: Add extension checks. v3: Appease MSVC. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 12:06:19 -07:00
Kenneth Graunke	3a90dc22d1	glsl: Refactor #version validation to be more future-proof. The previous implementation required a flag in _mesa_glsl_parse_state and line of code to initialize it for every version of the shading language we intend to support. As we look to add 150, 330, 400, 410, 420, and beyond, this gets rather unwieldy. This patch retains the switch statement (to reject, say, #version 111), but removes all the bits. Code to check for ctx->API == API_OPENGL_CORE could easily be added to the 110 and 120 cases to reject those. v2: Use _mesa_is_desktop_gl to preserve the existing behavior in the presence of the new API_OPENGL_CORE enumeration. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v1]	2012-07-31 11:20:49 -07:00
Eric Anholt	19bd5936af	i965: Add support for GL_SKIP_DECODE_EXT on other SRGB formats. Fixes some failures in getteximage-formats. v2: Remove stray include, and drop extra test for encoding == GL_SRGB -- _mesa_get_srgb_format_linear() returns the same format if it wasn't SRGB. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=48120 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1) NOTE: This is a candidate for the 8.0 branch.	2012-07-31 11:14:23 -07:00
Kenneth Graunke	03ac5c54b5	glsl: Fix #pragma invariant(all) language version check. It was using state->Const.GLSL_100ES, which is set if the driver supports ARB_ES2_compatibility or we're in ES2 mode. Instead, it should use state->language_version, as that represents the actual GLSL version of the shader being compiled. Since the correct logic is < 120 && !100, just make it == 110. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-31 10:52:54 -07:00
Kenneth Graunke	d84b3a5a3c	mesa: Support glGetString(GL_SHADING_LANGUAGE_VERSION) for >= 1.40. This will need to get refactored when we add support for core profiles or forward-compatible contexts, but we may as well have it in the meantime. This allows us to override the GLSL version and experiment. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-31 10:52:54 -07:00
Brian Paul	591594ea1e	ir_to_mesa: make size_swizzles[] array static const	2012-07-31 09:00:41 -06:00
Jon TURNEY	27013e5164	Move installing osmesa.pc to drivers/osmesa Move installing osmesa.pc to drivers/osmesa, where it belongs better This also restores the installation of gl.pc if we are building osmesa at the same time as libGL, which was broken in commit `39785488` when the .pc installation was converted to automake v2: Remove HAVE_OSMESA_DRIVER automake conditional, it's now pointless as we will only be building in the drivers/osmesa directory if the condition it checked was true. Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-31 12:48:33 +01:00
Vinson Lee	2faa2b4f7e	gallium/util: Use GCC built-in functions for NaN and infinity. This patch fixes this build failure with Intel Compiler. src/gallium/auxiliary/util/u_format_tests.c(903): error: floating-point operation result is out of range {PIPE_FORMAT_R16_FLOAT, PACKED_1x16(0xffff), PACKED_1x16(0x7c01), UNPACKED_1x1( NAN, 0.0, 0.0, 1.0)}, Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-30 23:27:19 -07:00
Jordan Justen	3d0b54c7c6	mesa: don't enable legacy GL functions when using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-30 16:25:56 -07:00
Jordan Justen	1fea3df6f4	intel: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-30 16:25:56 -07:00
Jordan Justen	0f099df567	meta: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-30 16:25:56 -07:00
Jordan Justen	4aecd8f031	glsl: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-30 16:25:56 -07:00
Jordan Justen	09714c09a4	mesa: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-30 16:18:57 -07:00
Jordan Justen	3d284dcba6	mesa: add api check functions These functions make it easier to check for multiple API types. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-30 16:18:57 -07:00
Jordan Justen	1c29b73f4d	mesa: add API_OPENGL_CORE api Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-30 16:18:57 -07:00
Ian Romanick	d3de40742f	glsl: Fix ir_last_opcode value. Now that ir_quadop_vector exists, ir_last_binop and ir_last_opcode are no longer the same. Only one place currently uses this enumeration, and already handles ir_quadop_vector correctly. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Olivier Galibert <galibert@pobox.com>	2012-07-30 15:15:48 -07:00
Ian Romanick	9d998a2a59	glsl: Request an Nx1 type instance in ir_quadop_vector lowering pass. No types have 0 columns. The glsl_type::get_instance method contains if ((rows < 1) \|\| (rows > 4) \|\| (columns < 1) \|\| (columns > 4)) return error_type; To get a vector, use columns = 1. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Olivier Galibert <galibert@pobox.com>	2012-07-30 15:14:34 -07:00
Kenneth Graunke	13cb99dc73	glsl: Make bvec and ivec types accessible without using get_instance. It's more convenient to use shortcuts like glsl_type::bvec2_type than the longwinded glsl_type::get_instance(GLSL_TYPE_BOOL, 2, 1). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Olivier Galibert <galibert@pobox.com>	2012-07-30 15:14:09 -07:00
Tom Stellard	cd0949eb28	radeon/llvm: Cleanup AMDIL.h	2012-07-30 21:10:14 +00:00
Tom Stellard	2f921101c0	radeon/llvm: Rename all AMDIL* classes to AMDGPU*	2012-07-30 21:10:14 +00:00
Tom Stellard	b72ab79d73	radeon/llvm: Merge AMDILSubtarget into AMDGPUSubtarget	2012-07-30 21:10:13 +00:00
Tom Stellard	27ae41c83d	radeon/llvm: Merge AMDILTargetLowering class into AMDGPUTargetLowering	2012-07-30 21:10:13 +00:00
Tom Stellard	c96490e3b5	radeon/llvm: Remove IL_cmp DAG node	2012-07-30 21:10:13 +00:00
Tom Stellard	aece7970eb	radeon/llvm: Cleanup and reorganize AMDIL .td files	2012-07-30 21:10:13 +00:00
Tom Stellard	0ce6e50601	radeon/llvm: Remove lowering code for unsupported features e.g. function calls, load/store from stack	2012-07-30 21:10:08 +00:00
Tom Stellard	caeaf43dad	radeon/llvm: Remove AMDILVersion.td	2012-07-30 20:31:57 +00:00
Tom Stellard	c3111eb639	radeon/llvm: Remove AMDILAlgorithms.tpp	2012-07-30 20:31:57 +00:00
Tom Stellard	ac669c32c6	radeon/llvm: Merge AMDILInstrInfo.cpp into AMDGPUInstrInfo.cpp	2012-07-30 20:31:57 +00:00
Tom Stellard	3a0187b1b5	radeon/llvm: Merge AMDILRegisterInfo into AMDGPURegisterInfo	2012-07-30 20:31:57 +00:00
Tom Stellard	9c42fb6f26	radeon/llvm: Change the tablegen target from AMDIL to AMDGPU	2012-07-30 20:31:56 +00:00
Kenneth Graunke	f56dfc3213	i965: Support MESA_FORMAT_SIGNED_RGBA_16. The hardware supports this format with no known quirks, so we may as well enable it. Alpha blending is not supported until Sandybridge, but as far as I can tell, OpenGL doesn't require alpha blending on SNORM formats. Plus, we already expose R8G8B8A8_SNORM which has a similar restriction. Fixes 6 piglit texwrap-2D-SNORM cases, gl-3.1/required-sized-texture-formats, and 10 oglconform snorm-textures subcases Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-30 09:35:58 -07:00
Elvis Lee	e7a4a2b18b	gbm: Fix build for wayland include backends/gbm_dri.c fails to find wayland-server.h. Signed-off-by: Elvis Lee <kwangwoong.lee@lge.com>	2012-07-30 11:58:02 -04:00
Brian Paul	b51be8786f	mesa: fix _math_matrix_copy(), again The matrix is 16 GLfloats in size. Since from->inv is just a pointer (not an array), sizeof(*from->inv) wasn't right.	2012-07-30 08:30:15 -06:00
Vinson Lee	502c10839e	mesa: Fix wrong sizeof argument in _math_matrix_copy. Fixes Coverity wrong sizeof argument defect. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-07-30 08:13:55 -06:00
Christian König	86490bc150	radeonsi: fix db and stencil setup v2 v2: fix tiling for small pitches, that finally makes glxgears and readPixSanity work Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 15:02:04 +02:00
Christian König	7dace3a3cf	radeonsi: fix stencil op mapping Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 15:02:00 +02:00
Christian König	ad15c8c0f1	radeonsi: fix assertion in si_bind_vs_sampler Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 15:01:55 +02:00
Christian König	1fb8ee62fa	radeonsi: fix shader binding Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 15:01:51 +02:00
Christian König	f18fd255cf	radeonsi: fix dummy export in shaders v2 v2: add assertion for vertex shader Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 15:01:34 +02:00
Christian König	b15e3ae5b4	radeonsi: fix vertex buffer and elements Let's just use the T# descriptors until we get a fetch shader. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 14:45:32 +02:00
Christian König	d51b9b70d5	radeonsi: fix shader size and handling We should always upload the shader here. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 14:45:08 +02:00
Christian König	fe41287ffa	radeonsi: rename r600_resource to si_resource Also split it into seperate header and add some helper functions. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-30 14:44:38 +02:00
Kenneth Graunke	dcf8754cce	glcpp: Add a newline to expanded #line directives. Otherwise, the preprocessor happily outputs #line 2 4 <your next line of code> and the main compiler gets horribly confused and fails to compile. This is not the right solution (line numbers in error messages will likely be off-by-one in certain circumstances), but until Carl comes up with a proper fix, this gets programs running again. Fixes regressions in Regnum Online, Overgrowth, Piglit, and others since commit `aac78ce823`. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=51802 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=51506 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=41152 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-28 13:33:50 -07:00
Christoph Bumiller	5d5af7d359	gallium: specify resource_resolve destination via a pipe_surface The format member of pipe_surface may differ from that of the pipe_resource, which is used to communicate, for instance, whether sRGB encode should be enabled in the resolve operation or not. Fixes resolve to sRGB surfaces in mesa/st when GL_FRAMEBUFFER_SRGB is disabled. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-28 14:58:18 +02:00
Christoph Bumiller	51e41a0d89	st/mesa: call update_renderbuffer_surface for sRGB renderbuffers, too sRGBEnabled should affect both textures and renderbuffers, so we need to check/update the pipe_surface format for both. Fixes, for instance, rendering appearing too bright in wine applications using sRGB multisample renderbuffers. NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-28 13:14:30 +02:00
Christoph Bumiller	acd66ec033	nv50: fix depth/stencil multisample memory storage types Leftover from libdrm_nouveau v2 interface change.	2012-07-28 13:14:03 +02:00
Christoph Bumiller	cd3d85b63d	nv50: fix resource_resolve shader start offsets	2012-07-28 13:11:56 +02:00
Brian Paul	f612e55e45	st/mesa: undo a couple static asserts Hmm, gcc didn't catch these mistakes, but MSVC did.	2012-07-27 16:10:58 -06:00
Brian Paul	322a2938f3	st/mesa: use STATIC_ASSERT in a few places	2012-07-27 15:47:38 -06:00
Brian Paul	59c67f8116	mesa: whitespace, etc. fixes in program.h	2012-07-27 15:43:53 -06:00
Brian Paul	906febaf8b	meta: fix glDrawPixels fallback test, stencil drawing Remove the check for pixel transfer ops. If any RGB/depth scale/bias is in effect, it'll be applied in the glTexImage step. If drawing stencil pixels we need to disable pixel transfer so that alpha scale/bias are not applied to the stencil data. These issues were spotted by Roland. Fixes Blender performance issues reported in http://bugs.freedesktop.org/show_bug.cgi?id=47375 NOTE: This is a candidate for the 8.0 branch. Tested-by: Barto <mister.freeman@laposte.net> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-27 14:53:16 -06:00
Brian Paul	a80b7407f3	radeon: fix 'sowftware' typo	2012-07-27 14:53:16 -06:00
Eric Anholt	fbf86c7f0f	i965/gen7: Reduce GT1 WM thread count according to updated BSpec. Acked-by: Kenneth Graunke <kenneth@whitecape.org> https://bugs.freedesktop.org/show_bug.cgi?id=52382	2012-07-27 11:42:19 -07:00
Kenneth Graunke	cbcf750d5f	i965: Fix typo in shader channel select field name. "chanel" isn't very searchable. I can type, honest! Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-27 11:31:07 -07:00
Paul Berry	ee9f6a34cc	i965/msaa: Use MESA_FORMAT_R8 for MCS buffer. No functional change. This patch modifies intel_miptree_alloc_mcs to allocate the 4x MCS buffer using MESA_FORMAT_R8 instead of MESA_FORMAT_A8. In principle it doesn't matter, since we only access the buffer using MCS-specific hardware mechanisms, so all that's important is to use a format with the correct size. However, MESA_FORMAT_A8 has enough unusual behaviours that it seems prudent to avoid it. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-27 10:42:19 -07:00
Zou Nan hai	588881430a	intel: increase wm thread number to 80 on gen6 GT2 It seems reset is not required for setting the max_wm_threads to 80 on gen6 GT2. Increases performance in the Counter-Strike: Source video stress test by 7.18% (n=5). Signed-off-by: Zou Nan hai <nanhai.zou@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Matt Turner <mattst88@gmail.com> Acked-by: Eric Anholt <eric@anholt.net>	2012-07-27 10:32:17 -07:00
Tom Stellard	fdd8df20e4	r600g: Emit dispatch state for compute directly to the cs We no longer rely on an evergreen_compute_resource for emitting dispatch state. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-27 17:08:09 +00:00
Tom Stellard	dc0b8a4628	r600g: Initialize VGT_PRIMITIVE_TYPE in the start_cs_cmd atom The value of this register will always be DI_PT_POINTLIST for compute shaders. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-27 17:08:09 +00:00
Tom Stellard	d3b0130491	r600g: Atomize compute shader state Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-27 17:08:09 +00:00
Tom Stellard	5497391067	r600g: Add helper functions for emitting compute SET_CONTEXT packets Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-27 17:08:09 +00:00
Tom Stellard	c9ef27276f	radeon/llvm: Add instruction defs for branches on SI	2012-07-27 17:08:09 +00:00
Tom Stellard	ee0f0f03c6	radeon/llvm: Fix VOPC and V_CNDMASK encoding	2012-07-27 17:08:09 +00:00
Tom Stellard	d4bdd09d47	radeon/llvm: Assert if we try to copy SCC reg	2012-07-27 17:08:09 +00:00
Tom Stellard	fd1f19a191	radeon/llvm: Add SI DAG optimizations for setcc, select_cc These are needed for correctly lowering branch instructions in some cases.	2012-07-27 17:08:08 +00:00
Tom Stellard	cd5d4c5073	radeon/llvm: Add support for encoding SI branch instructions	2012-07-27 17:08:08 +00:00
Tom Stellard	50ff2dc0a4	radeon/llvm: Add special nodes for SALU operations on VCC The VCC register is tricky because the SALU views it as 64-bit, but the VALU views it as 1-bit. In order to deal with this we've added some special bitcast and binary operations to help convert from the 64-bit SALU view to the 1-bit VALU view and vice versa.	2012-07-27 17:08:08 +00:00
Tom Stellard	c424975572	radeon/llvm: Add i1 registers for SI.	2012-07-27 17:08:08 +00:00
Tom Stellard	bdda1cb914	radeon/llvm: Fix CCReg definitions on SI	2012-07-27 17:08:08 +00:00
Tom Stellard	ae9be358f2	radeonsi: Enable PIPE_SHADER_CAP_INTEGERS	2012-07-27 17:08:08 +00:00
Tom Stellard	022b54359a	radeonsi: Add support for loading integers from constant memory	2012-07-27 17:08:07 +00:00
Tom Stellard	ad95bcb31f	radeon/llvm: Add bitconvert patterns for SI	2012-07-27 17:08:07 +00:00
Tom Stellard	4cab682184	radeon/llvm: Add custom lowering for SELECT_CC nodes on SI	2012-07-27 17:08:07 +00:00
Tom Stellard	ba76684292	radeon/llvm: Move conditional pattern leafs to common tablegen file	2012-07-27 17:08:07 +00:00
Tom Stellard	d36455ba2c	radeon/llvm: Implement getSetCCResultType for SI	2012-07-27 17:08:07 +00:00
Tom Stellard	e8825ce6e1	radeon/llvm: Custom lower BR_CC for SI	2012-07-27 17:08:07 +00:00
Tom Stellard	87272e9e25	radeon/llvm: Move lowering of BR_CC node to R600ISelLowering SI will handle BR_CC different from R600, so we need to move it out of the shared instruction selector.	2012-07-27 17:08:07 +00:00
Tom Stellard	92823fb72a	radeon/llvm: Move lowering of SETCC node to R600ISelLowering SI will handle SETCC different from R600, so we need to move it out of the shared instruction selector.	2012-07-27 17:08:06 +00:00
Tom Stellard	46d12c99a2	radeon/llvm: Use correct node type when lowering SETCC	2012-07-27 17:08:06 +00:00
Tom Stellard	47d1b0a809	radeon/llvm: Move LowerSELECT_CC into R600ISelLowering SI will handle SELECT_CC different from R600, so we need to move it out of the shared instruction selector.	2012-07-27 17:08:06 +00:00
Eric Anholt	11ff18fcf5	automake: Remove OPT_FLAGS. If you want to change your compiler arguments, just set CFLAGS/CXXFLAGS. Having Mesa have this separate variable is a great way to have your arguments not thoroughly propagated to all compiler invocations. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-26 17:30:06 -07:00
Eric Anholt	87a1c4f233	automake: Remove ARCH_FLAGS. In all current uses, it was appended to CFLAGS, which already had -m32. If you want to do some other flag supplied to compiler invocations, there's CFLAGS/CXXFLAGS. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-26 17:30:06 -07:00
Paul Berry	4df2848786	i965/msaa: use ROUND_DOWN_TO macro. No functional change. This patch modifies brw_blorp_blit.cpp to use the ROUND_DOWN_TO macro instead of open-coded bit manipulations, for clarity. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-26 15:02:10 -07:00
Brian Paul	f37f1a7209	svga: initialize svga_compile_key to zeros to be safe	2012-07-26 16:00:31 -06:00
Brian Paul	dafa77201f	svga: fix invalid memory reference in needs_to_create_zero() The emit->key.fkey info is only valid if we're generating a fragment shader. We should not look at it if we're generating a vertex shader. When generating a vertex shader, the value of emit->key.fkey.num_textures was garbage and the loop over num_textures would read invalid data. At best this would cause us to emit an unused constant. At worse, we could segfault. Just by dumb luck, fkey.num_textures was usually a smallish integer. NOTE: This is a candidate for the 8.0 branch. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-26 16:00:31 -06:00
Brian Paul	38184dcd54	radeon: fix Base/base typo Fixes http://bugs.freedesktop.org/show_bug.cgi?id=52563	2012-07-26 15:57:20 -06:00
Daniel Charles	948c8f502a	android-build: fix dricore build for autogenerated files (v3) Recently more files were removed from control to be auto-generated in the dricore library. Android build was not able to locate the new files if they were not created beforehand. LOCAL_SRC_FILES includes some of those files and Android.gen.mk re-defines this variable by filtering out the auto-generated files. Unfortunately for this variable it is not the same to have the SRCDIR variable defined as the current directory. By re-defining SRCDIR for the autotools build the Android build system is happy again and the new files were actually removed from the sources to use the auto generated versions. Also patch `d5c1801a01` was partially reverted as the files can not be compiled to the LOCAL_PATH, instead they should live on the intermediates folder so that a clean can wipe them out. v3: [chad] Fix the definition of SRCDIR in libdricore/Makefile.am. Signed-off-by: Chad Versace <chad.versace@linux.intel.com> Signed-off-by: Daniel Charles <daniel.charles@intel.com>	2012-07-26 14:51:20 -07:00
Brian Paul	0e893b4261	radeon: set swrast_renderbuffer::ColorType field when mapping renderbuffers Fixes http://bugs.freedesktop.org/show_bug.cgi?id=47375 NOTE: This is a candidate for the 8.0 branch. Tested-by: Barto <mister.freeman@laposte.net>	2012-07-26 13:59:44 -06:00
Brian Paul	a73e9207da	xlib: add X error handler around XGetImage() call XGetImage() will generate a BadMatch error if the source window isn't visible. When that happens, create a new XImage. Fixes piglit 'select' test failures with swrast/xlib driver. NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-26 13:59:44 -06:00
Brian Paul	66adc807c4	mesa: remove obsolete matrix comment	2012-07-26 13:59:44 -06:00
Brian Paul	1e37d54d9d	mesa: fix comment typo: s/pointer/point/	2012-07-26 13:59:44 -06:00
Brian Paul	66d9ac5ac7	mesa: remove _math_matrix_alloc_inv() Always allocate space for the inverse matrix in _math_matrix_ctr() since we were always calling _math_matrix_alloc_inv() anyway. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-26 13:59:44 -06:00
Brian Paul	50db812915	mesa: loosen small matrix determinant check When computing a matrix inverse, if the determinant is too small we could hit a divide by zero. There's a check to prevent this (we basically give up on computing the inverse and return the identity matrix.) This patch loosens this test to fix a lighting bug reported by Lars Henning Wendt. v2: use abs(det) to handle negative values NOTE: This is a candidate for the 8.0 branch. Tested-by: Lars Henning Wendt <lars.henning.wendt@gris.tu-darmstadt.de>	2012-07-26 13:59:43 -06:00
Paul Berry	148c8e639d	i965: Use sendc for all render target writes on Gen6+. The sendc instruction causes the fragment shader thread to wait for any dependent threads (i.e. threads rendering to overlapping pixels) to complete before sending the message. We need to use sendc on the first render target write in order to guarantee that fragment shader outputs are written to the render target in the correct order. Previously, we only used the "sendc" instruction when writing to binding table index 0. This did the right thing for fragment shaders, because our fragment shader back-ends always issue their first render target write to binding table index 0. However, it did the wrong thing for blorp, which performs its render target writes to binding table index 1. A more robust solution is to use sendc for all render target writes. This should not produce any performance penalty, since after the first sendc, all of the dependent threads will have completed. For more information about sendc, see the Ivy Bridge PRM, Vol4 Part3 p218 (sendc - Conditional Send Message), and p54 (TDR Registers). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-26 10:49:38 -07:00
Paul Berry	8f37ea414f	i965/msaa: Remove TODO comments that are no longer relevant. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-26 10:49:38 -07:00
Paul Berry	c738ea1191	intel: Make more consistent use of _mesa_is_{user,winsys}_fbo() A lot of code was still differentiating between between winsys and user fbos by testing the fbo's name against zero. This converts everything in the i915 and 965 drivers over to use _mesa_is_user_fbo() and _mesa_is_winsys_fbo(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-26 10:48:36 -07:00
Paul Berry	284ad9c3b2	mesa: Make more consistent use of _mesa_is_{user,winsys}_fbo() A lot of code was still differentiating between between winsys and user fbos by testing the fbo's name against zero. This converts everything in core mesa, the state tracker, and src/mesa/program over to use _mesa_is_user_fbo() and _mesa_is_winsys_fbo(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-26 10:38:05 -07:00
Oliver McFadden	e72f20641a	glsl: warning: pragma `invariant(all)' not supported in GLSL ES 1.00 The OpenGL(R) ES Shading Language Version 1.00 Revision 17 (12 May, 2009) > 4.6.1 The Invariant Qualifier > ... To force all output variables to be invariant, use the pragma > #pragma STDGL invariant(all) Signed-off-by: Oliver McFadden <oliver.mcfadden@linux.intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-26 13:09:15 +03:00
Kenneth Graunke	16cba717c2	shared-glapi: Install libglapi.so.0.0.0 and .0 links in lib/. We already provided these files on 'make install', but only created a 'libglapi.so' in the top-level lib/ convenience folder. We used to create all three, but at some point in the build system churn, it broke. Various applications (like the ES2 conformance suite) seem to link against libglapi.so.0, so without these links, setting LD_LIBRARY_PATH and LIBGL_DRIVERS_PATH can lead to using /usr/lib/libglapi.so.0 with /home/whatever/libGL.so, which leads to API calls getting routed incorrectly (i.e. glCompileShader -> _mesa_LinkProgramARB), which leads to rage problems. Preserve developer sanity...install links. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-25 22:37:24 -07:00
Vinson Lee	4f109ca4e8	scons: Fix build with clang. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-25 17:04:30 -07:00
Eric Anholt	cc44aa7749	i965: Remove unused param conversion code. Ever since ctx->NativeIntegers was set, the conversion flag has been PARAM_NO_CONVERT. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-25 10:29:56 -07:00
Olivier Galibert	fa76d04aea	softpipe: fix copy/paste error in tex sample code Fixes https://bugs.freedesktop.org/show_bug.cgi?id=52369 Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-25 07:47:19 -06:00
Jon TURNEY	f9089f4022	Remove redundant osmesa shared library install from Makefile.old Since osmesa now has been converted to Makefile.am, an appropriate install: rule is generated to install the shared libary, so we no longer need to do that in src/mesa/Makefile.old This leaves nothing in src/mesa/Makefile.old but the tags: rule, so move that to Makefile.am and remove Makefile.old Also, nothing now uses OSMESA_LIB_GLOB anymore, so remove it Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-25 12:41:07 +01:00
Jon TURNEY	bd4a3cce96	Update mesa/drivers/x11/Makefile.am for xm_image.h removal Commit `6c6803f28d` removed xm_image.[ch], and removed xm_image.c, but not xm_image.h from the Makefile, this was subsequently carried over into Makefile.am Remove xm_image.h from Makfile.am. This allows 'make dist' to succeed, even if it doesn't do anything useful Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-25 12:40:10 +01:00
Jon TURNEY	9f84d645a4	drivers/osmesa: Link OSMesa using -no-undefined libtool flag "Use -no-undefined to assure libtool that the library has no unresolved symbols at link time, so that libtool will build a shared library on platforms require that all symbols are resolved when the library is linked." Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-25 12:39:42 +01:00
Jon TURNEY	50b13217ba	drivers/X11: Link X11 libGL with -no-undefined libtool flag "Use -no-undefined to assure libtool that the library has no unresolved symbols at link time, so that libtool will build a shared library on platforms require that all symbols are resolved when the library is linked." Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-25 12:38:38 +01:00
Vinson Lee	491d82e9df	Revert "scons: Add instrumentation component libraries to linking on llvm-3.2." This reverts commit `e2e7b467d8`. No longer needed after llvm-3.2svn r160611. Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2012-07-24 22:49:49 -07:00
Paul Berry	497bf5dd2b	i965/msaa: Switch on 8x MSAA for Gen7. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:59 -07:00
Paul Berry	7285612713	i965/msaa: Adjust MCS buffer allocation for 8x MSAA. MCS buffers use 32 bits per pixel in 8x MSAA, and 8 bits per pixel in 4x MSAA. This patch adjusts the format we use to allocate the buffer so that enough memory is set aside for 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	304be9db14	i965/msaa: Remove assertion in 3DSTATE_SAMPLE_MASK to allow 8x MSAA. The code to emit 3DSTATE_SAMPLE_MASK was already correct for 8x MSAA--this patch just removes an assertion that would have prevented it from being used for 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	2a9ab29ed9	i965/msaa: Adjust 3DSTATE_MULTISAMPLE packet for 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	7fae97c98b	i965/blorp: Encode and decode IMS format for 8x MSAA correctly. This patch updates the blorp functions encode_msaa() and decode_msaa() to properly handle the encoding of IMS MSAA buffers when num_samples=8. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	619471dc32	i965/blorp: Compute sample number correctly for 8x MSAA. When operating in persample dispatch mode, the blorp engine would previously assume that subspan N always represented sample N (this is correct assuming 4x MSAA and a 16-wide dispatch). In order to support 8x MSAA, we must compute which sample is associated with each subspan, using the "Starting Sample Pair Index" field in the thread payload. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	082874e389	i965/blorp: Properly adjust primitive size for 8x MSAA. When rendering to an IMS MSAA surface on Gen7, blorp sets up the rendering pipeline as though it were rendering to a single-sampled surface; accordingly it must adjust the size of the primitive it sends down the pipeline to account for the interleaving of samples in an IMS surface. This patch modifies the size adjustment code to properly handle 8x MSAA, which makes room for the extra samples by using an interleaving pattern that is twice as wide as 4x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	17eae9762c	i965/blorp: Parameterize manual_blend() by num_samples. This patch adds a num_samples argument to the blorp function manual_blend(), allowing it to be told how many samples need to be blended together. Previously it assumed 4x MSAA, since that was all we supported. We also bump up LOG2_MAX_BLEND_SAMPLES from 2 to 3, so that manual_blend() will be able to handle 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	4afee38a2f	i965/msaa: Remove comment about falsely claiming to support MSAA. Gen6+ hardware now supports MSAA properly. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:58 -07:00
Paul Berry	ff9313fac7	i965/blorp: Handle DrawBuffers properly. When the client program uses glDrawBuffer() or glDrawBuffers() to select more than one color buffer for drawing into, and then performs a blit, we need to blit into every single enabled draw buffer. +2 oglconforms. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50407 Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	fa1d267beb	i965/blorp: Rearrange order of blit validation and preparation steps. This patch rearranges the order of steps performed by a blorp blit from this: - Sync up state of window system buffers. - Find buffers. - Find miptrees. - Make sure buffer formats match. - Handle mirroring. - Make sure width and height match. - Handle clipping/scissoring. - Account for window system origin conventions. - Do depth resolves, if applicable. - Do the blit. - Record the need for a future HiZ resolve, if applicable. To this: - Sync up state of window system buffers. - Handle mirroring. - Make sure width and height match. - Handle clipping/scissoring. - Account for window system origin conventions. - Find buffers. - Make sure buffer formats match. - Find miptrees. - Do depth resolves, if applicable. - Do the blit. - Record the need for a future HiZ resolve, if applicable. The steps are the same, but they are now performed in an order that will make it possible to implement correct DrawBuffers support. Note that the last four steps are now in a separate function (do_blorp_blit), since they will need to be executed repeatedly when DrawBuffers support is added. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	eac4f1a707	i965/blorp: Don't fall back to swrast when miptrees absent. Previously, the blorp engine would fall back to swrast if the source or destination of a blit had no associated miptree. This was unnecessary, since _mesa_BlitFramebufferEXT() already takes care of making the blit silently succeed if there are no buffers bound, so the fallback paths could never actually happen in practice. Removing these fallback paths will simplify the implementation of correct DrawBuffers support in blorp. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	0dbec6ae07	i965/blorp: Fixup scissoring of blits to window system buffers. This patch modifies the order of operations in the blorp engine so that clipping and scissoring are performed before adjusting the coordinates to account for the difference in origin convention between window system buffers and framebuffer objects. Previously, we would do clipping and scissoring after adjusting for origin conventions, so we would get scissoring wrong in window system buffers. Fixes Piglit test "fbo-scissor-blit window". Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	da54d2e576	i965/blorp: Simplify check that src/dst width/height match. When checking that the source and destination dimensions match, we don't need to store the width and height in variables; doing so just risks confusion since right after the check, we do clipping and scissoring, which may alter the width and height. No functional change. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	bac43b8bb7	i965/msaa: Work around problems with null render targets on Gen6. On Gen6, multisampled null render targets don't seem to work properly--they cause the GPU to hang. So, as a workaround, we render into a dummy color buffer. Fortunately this situation (multisampled rendering without a color buffer) is rare, and we don't have to waste too much memory, because we can give the workaround buffer a very small pitch. Fixes piglit test "EXT_framebuffer_multisample/no-color {2,4} depth-computed *" on Gen6. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	0aeb87023e	i965: Set width, height, and tiling properly for null render targets. The HW docs say that the width and height of null render targets need to match the width and height of the corresponding depth and/or stencil buffers, and that they need to be marked as Y-tiled. Although leaving these values at 0 doesn't seem to cause any ill effects, it seems wise to follow the documented requirements. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	691c55f356	i965/msaa: Control multisampling behaviour via the visual. Previously, we used the number of samples in draw buffer 0 to determine whether to set up the 3D pipeline for multisampling. Using the visual is cleaner, and has the benefit of working properly when there is no color buffer. Fixes all piglit tests "EXT_framebuffer_multisample/no-color" on Gen7. On Gen6, the "depth-computed" variants of these tests still fail; this will be addresed in a later patch. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	48fdfbcb58	msaa: Compute visual samples/sampleBuffers from all buffers. This patch ensures that Visual.samples and Visual.sampleBuffers are set correctly even in the case where there is no color buffer. Previously, these values would retain their default value of 0 in this circumstance, even if the depth or stencil buffer was multisampled. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:56 -07:00
Anthony G. Basile	f35e380dd2	Fix compile time errors when building against uclibc Mesa misses a few checks when compiling on a uclibc system which cause it to fall back on glibc-ism. This patch addresses those issues. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Anthony G. Basile <blueness@gentoo.org>	2012-07-24 13:00:47 -07:00
Jerome Glisse	1ffac44e83	r600g: enable streamout only on 2.14 or latter kernel The kernel streamout support was supposed to get into 3.3 along the tiling change and thus use the same kernel version bump of 2.13 to report userspace that streamout register were supported. This is not what happen. So as streamout kernel support did not bump the kernel driver version, rely on kernel 2.14 version bump to know if streamout is enabled or not. Which means you need at least 3.4 kernel. Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-07-24 15:08:31 -04:00
Jordan Justen	881bb4ac72	intel: move error on create context to proper path The error was being set on the non-error path, rather than the error path. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-24 11:59:19 -07:00
Jordan Justen	01168df4d9	mesa context: generate an error for uninstalled context functions For 'non-legacy' contexts we will want to generate an error if an uninstalled function is called. The effect of this change will be that we can avoid installing legacy functions, and they will then generate an error as needed for deprecated functions in GL >= 3.1. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-24 11:50:35 -07:00
Brian Paul	1f9239ec8d	nouveau: include glformats.h to get missing prototype Fixes http://bugs.freedesktop.org/show_bug.cgi?id=52449	2012-07-24 10:33:20 -06:00
Brian Paul	a271a0c9f6	mesa: improve comment in build_tnl_program()	2012-07-24 09:54:50 -06:00
Brian Paul	8f2a13c5e3	docs: the legacy makefile system is removed in Mesa 8.1	2012-07-24 08:49:02 -06:00
Brian Paul	7e18a039ee	mesa: move _mesa_error_check_format_and_type() to glformats.c Now all the format/type-related helper functions are in glformats.c and image.c is just image-related functions.	2012-07-24 08:37:29 -06:00
Brian Paul	a1287f549a	mesa: move more format helper functions to glformats.c	2012-07-24 08:37:29 -06:00
Brian Paul	8b762ebd72	mesa: move some format helper functions to glformats.c	2012-07-24 08:37:29 -06:00
Christian König	de3335dba8	radeonsi: remove old state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	9b213c871a	radeonsi: move everything else into the new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	53d47889e6	radeonsi: move format handling into si_state.c Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	73dd906ba0	radeonsi: move remaining sampler state into si_state.c Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	ca9cf611b6	radeonsi: move draw state into new handling Split it out into si_state_draw.c Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	0d6b0b512a	radeonsi: move constants to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	baf2039756	radeonsi: move sampler states into new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	3c09f11e5c	radeonsi: move shaders to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	bd2a5cf328	radeonsi: move spi into new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	840f05da6b	radeonsi: move init state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	e4e6f954ae	radeonsi: move draw_info to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	76660dfcce	radeonsi: move CB_TARGET_MASK into fb/blend state Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	e6937211da	radeonsi: move stencil_ref to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	b41b3eb989	radeonsi: move dsa state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	bd18a316e1	radeonsi: move infeered fb/rs state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	f67fae0e43	radeonsi: move rasterizer state into new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	835098a529	radeonsi: move framebuffer to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	7e011d92c9	radeonsi: move viewport to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	43f414f7b7	radeonsi: move scissor state to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	9cbbe0d4e6	radeonsi: move clip state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	0a091a4824	radeonsi: move blend color to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	63636ae52a	radeonsi: move blender to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	bf7302a6e1	radeonsi: rework state handling v2 Add a complete new state handling for SI. v2: fix spelling error Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Brad King	27382c0f7b	automake: Honor GL_LIB for mangled/custom lib names Commit `2d4b77c7` (automake: Convert src/mesa/drivers/x11/Makefile to automake, 2012-06-12) dropped the old Makefile, which used GL_LIB, and replaced it with a Makefile.am hard-coding the name "GL". This broke handling of --enable-mangling and --with-gl-lib-name options which depend on GL_LIB to specify the GL library name. Use "@GL_LIB@" in src/mesa/drivers/x11/Makefile.am to configure the library name. Also use this approach to simplify src/glx/Makefile.am and drop the HAVE_MANGLED_GL conditional. While at it, fix the compatibility link we create in "lib" for the software-only driver to use version GL_MAJOR instead of hard-coding "1". Reviewed-by: Dan Nicholson <dbn.lists@gmail.com>	2012-07-23 22:34:13 -07:00
Marek Olšák	82fc813ca8	st/mesa: fix DDY opcode for FBOs This fixes piglit/fbo-deriv. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:53 +02:00
Marek Olšák	f40b5723f0	st/mesa: set the centroid qualifier in fragment shader inputs This fixes some centroid tests in the EXT_framebuffer_multisample piglit group. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:53 +02:00
Marek Olšák	162b3ad94d	st/mesa: flush the glBitmap cache before changing framebuffer state This fixes the piglit EXT_framebuffer_multisample/bitmap tests. Note that we must not rely on ctx->DrawBuffer when flushing the cache, because that's already updated with a new framebuffer. We want to draw into the old framebuffer where glBitmap was called. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:53 +02:00
Marek Olšák	07b9b3c37b	st/mesa: set the correct window renderbuffer internal format The multisample-resolve blit relies on this being correct. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:52 +02:00
Marek Olšák	5927227576	mesa: fix format checking when doing a multisample resolve v2: make it more bullet-proof Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:52 +02:00
José Fonseca	c30bf68946	gallivm: Prefer the standard JIT engine whenever possible. Testing shows that the standard JIT engine retrofited with AVX support is quite stable and as capable to handle AVX instructions as MC-JIT is. And the old JIT is much more memory efficient, as we don't need to allocate one engine instance per shader, as we do for MC-JIT due to its incompleteness. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-23 17:46:38 +01:00
Jerome Glisse	cb149bf9e1	r600g: don't emit forbidden reg with old kernel on evergreen Fix https://bugs.freedesktop.org/show_bug.cgi?id=52313 Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-07-23 11:42:36 -04:00
Jerome Glisse	b7b5a77ec0	r600g: don't emit forbidden register on old kernel Fix https://bugs.freedesktop.org/show_bug.cgi?id=52313 Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-07-23 11:28:25 -04:00
Vincent Lejeune	bc4b4c605c	radeon/llvm: Fix a bug with IF LOGICALNZ with int operand Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-23 15:04:36 +00:00
Tom Stellard	044de40cb0	pipe_loader: Try to connect with the X server before probing pciids v2 When X is running it is neccesary for pipe_loader to authenticate with DRM, in order to be able to use the device. This makes it possible to run OpenCL programs while X is running. v2: - Fix C++ style comments - Drop Xlib-xcb dependency - Close the X connection when done - Split auth code into separate function Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-07-23 13:25:36 +00:00
Tom Stellard	17f6c9195f	configure.ac: Add --with-llvm-prefix option This option allows you to specify the llvm install prefix. It is useful for switching between different versions of LLVM.	2012-07-23 13:25:36 +00:00
Kenneth Graunke	c3bc41011f	mesa: Prevent repeated glDeleteShader() from blowing away our refcounts. Calling glDeleteShader() should mark shaders as pending for deletion, but shouldn't decrement the refcount every time. Otherwise, repeated glDeleteShader() is not safe. This is particularly bad since glDeleteProgram() frees shaders: if you first call glDeleteShader() on the shaders attached to the program (thus decrementing the refcount), then called glDeleteProgram(), it would try to free them again (decrementing the refcount another time), causing a refcount > 0 assertion to fail. Similar to commit `d950a778`. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-22 14:34:44 -07:00
Matt Turner	cfdf60f236	imports.h: Correct ceilf typo. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-22 14:06:08 -07:00
Marek Olšák	f96405f254	st/mesa: remove st_flush_bitmap wrapper just a cleanup	2012-07-22 03:32:55 +02:00
Jordan Justen	749c9060ac	mesa formats: add MESA_FORMAT_ABGR2101010_UINT Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Jordan Justen	1c8812c244	mesa formats: unpack ARGB8888/XRGB8888 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Jordan Justen	8c265cf5ef	mesa pack: use _mesa_problem instead of assert If the pack type is not supported, use _mesa_problem rather than asserting. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Jordan Justen	9ad8f431b2	mesa: add glformats integer type/format detection routines _mesa_is_integer_format is moved to formats.c and renamed as _mesa_is_enum_format_integer. _mesa_is_format_unsigned, _mesa_is_type_integer, _mesa_is_type_unsigned, and _mesa_is_enum_format_or_type_integer are added. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Vinson Lee	e2e7b467d8	scons: Add instrumentation component libraries to linking on llvm-3.2. llvm-3.2svn r160587 moved createBoundsCheckingPass from lib/Transforms/Scalar to lib/Transforms/Instrumentation. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-21 10:38:25 -07:00
Matt Turner	d24cf88a1a	Remove unused _mesa_memset16 Unused since commit `fd104a845`. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
Matt Turner	f58ba6ca91	Remove _mesa_inv_sqrtf in favor of 1/SQRTF Except for a couple of explicit uses, _mesa_inv_sqrtf was disabled since its addition in 2003 (see `f9b1e524`). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
Matt Turner	948b1c541f	Remove _mesa_sqrt* in favor of plain sqrt Temporarily disabled since 2003 (see `386578c5b`). This saves us from calling sqrt() 128 times to generate the sqrttab in one_time_init(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
Matt Turner	ec79138138	Use INV_SQRT instead of 1/SQRTF Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
José Fonseca	bd9bf7a424	autoconf: Only kink mcjit component when available. Should fix build failures with older LLVM version, but only tested on LLVM 3.1.	2012-07-21 11:43:35 +01:00
Chad Versace	735070c45b	i830: Fix stack corruption Found by compiler warning: i830_texstate.c:131:28: warning: argument to 'sizeof' in 'memset' call is the same expression as the destination; did you mean to dereference it? [-Wsizeof-pointer-memaccess] memset(state, 0, sizeof(state)); ~~~~~ ^~~~~ On 64-bit systems, memset here would write an extra 4 bytes. Note: This is a candidate for the stable branches. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-20 16:01:57 -07:00
José Fonseca	1a8f6ac5a4	mesa: disable MSVC global optimization in pack.c To reduce excessive compilation time in release mode. NOTE: This is a candidate for the 8.0 branch. Tested-by: Brian Paul <brianp@vmware.com>	2012-07-20 16:23:22 -06:00
Brian Paul	9fd4e9e9e6	mesa: whitespace fixes in pbo.c	2012-07-20 16:22:59 -06:00
Brian Paul	ac14f569fe	mesa: update texstore.c comment	2012-07-20 15:13:19 -06:00
Roland Scheidegger	70a969f123	llvmpipe: use runtime loop instead of static loop for looping over quads This can potentially cut shader program size by a factor of 4 for 4-wide execution respectively 2 for 8-wide execution and while this ratios aren't quite reached for more complex shaders it can be close. Could not really measure a performance difference so far except for trivial shaders (glxgears). There seems to be a fair amount of unnecessary move's generated especially at the beginning it might be possible to optimize those away somehow. Things aren't quite as clean, some additional stuff needs to be done for keeping both paths working (though llvm might be able to optimize this away). glxgears seems to lose about 5-10% of performance, looking at the generated shaders this is actually less than I'd think it would be - both 4 and 8-wide shaders, despite containing a loop actually have about 10% more instructions in total, and will have roughly 50% more executed instructions (though mostly cheap ones). Need to figure out how to reduce overhead... v2: keep complex interpolation for 4-wide mode, adapt to interface changes. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-20 20:17:15 +01:00
Roy Spliet	542bd6941f	nv30: Support negative offsets in indirect constant access. Fixes piglit vp-address-01 amongst several others. Signed-off-by: Roy Spliet <r.spliet@student.tudelft.nl> Reviewed-by: Lucas Stach <dev@lynxeye.de> Tested-by: Lucas Stach <dev@lynxeye.de>	2012-07-20 20:31:40 +02:00
Bryan Cain	248e6f0331	nv50/ir: set position before i instead of i->next in NV50LoweringPreSSA::visit Fixes rendering glitches in Psychonauts such as Raz's eyes flickering white. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=51962.	2012-07-20 20:30:07 +02:00
Eric Anholt	b2a44cde64	i965/gen7: Increase the WM threads to hardware limits. This thread count is only supposed to be enabled when "WIZ Hashing Disable in GT_MODE register enabled." I've always been confused whether that means the bit in the register should be 1 or 0. For my IVB GT2's register 0x7008 value of 0x0, this appears to work fine. Improves l4d2 performance at 640x480 by 0.88 +/- 0.11% (n=88). Improves performance with rasterization at 1280x1024 by 1.45% +/- 0.36% (n=6). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-20 11:05:39 -07:00
Eric Anholt	8ab5842a6d	glsl: Assign locations for uniforms in UBOs using the std140 rules. Fixes piglit layout-std140. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:44:04 -07:00
Eric Anholt	9feb403b0e	glsl: Don't resize arrays in uniform blocks. This is a requirement for std140 uniform blocks, and optional for packed/shared blocks. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:59 -07:00
Eric Anholt	0cea8a56b6	glsl: Don't dead-code eliminiate uniforms declared in uniform blocks. This is a requirement for std140 uniform blocks, and optional for packed/shared blocks. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:52 -07:00
Eric Anholt	548bce4733	mesa: Implement the UBO-specific pnames of glGetActiveUniformsiv. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:50 -07:00
Eric Anholt	a74507dc94	glsl: Propagate uniform block information into gl_uniform_storage. Now we can actually return information on uniforms in uniform blocks in the new queries. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:47 -07:00
Eric Anholt	ddc88fbf51	mesa: Add implementation of glGetUniformBlockIndex(). Now that we finally have a list of uniform blocks in the linked shader program, we can tell what their indices are. Fixes piglit GL_ARB_uniform_buffer_object/getuniformblockindex. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:44 -07:00
Eric Anholt	093b20666d	glsl: Set the uniform_block index for the linked shader variables. At this point in the linking, we've totally lost track of the struct gl_uniform_buffer that this pointed to in the original unlinked shader, so we do a nasty n^2 walk to find it the new one based on the variable name. Note that these point into the shader's list of gl_uniform_buffers, not the linked program's. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:42 -07:00
Eric Anholt	9f1a4a6340	mesa: Add support for glGetActiveUniformsiv on non-UBO pnames. We'll need to propagate the UBO fields to the uniform storage records before we can handle the other pnames. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:40 -07:00
Eric Anholt	acfbdfcbc8	mesa: Add support for glGetUniformIndices(). This is a single entrypoint that maps from a series of names to the indices of those names within the active uniforms list. Each index is like glGetUniformLocation()'s return value, except that it doesn't encode an array offset. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:35 -07:00
Eric Anholt	abcdbdf9cc	mesa: Move the _mesa_uniform_merge_location_offset to glGetUniformLocation(). With the upcoming GL_ARB_uniform_buffer_object changes, the only other caller that will want the cooked value is state_tracker. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:33 -07:00
Eric Anholt	f609cf782a	glsl: Merge the lists of uniform blocks into the linked shader program. This attempts error-checking, but the layout isn't done yet. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:28 -07:00
Eric Anholt	b3c093c79c	glsl: Translate the AST for uniform blocks into some IR structures. We're going to need this structure to cross-validate the uniform blocks between shader stages, since unused ir_variables might get dropped. It's also the place we store the RowMajor qualifier, which is not part of the GLSL type (since that would cause a bunch of type equality checks to fail). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:19 -07:00
Eric Anholt	f7561e8ecd	glsl: Turn UBO variable declarations into ir_variables and check qualifiers. Fixes piglit layout--non-uniform and layout--within-block. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:12 -07:00
Lucas Stach	cdad337fec	st/xorg: fix masked transformations Someone tried to be clever and "optimized" add_vertex_data2() to just use two points for the texture coordinates and then reuse individual components. Sadly this is not how matrix multiplication works. Fixes rendercheck -t tmcoords Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-20 18:47:54 +02:00
Paul Berry	60c3e69dbf	i965/blorp: Use IMS layout when texturing from depth/stencil surfaces. Previously, on Gen7, when texturing from a depth or stencil surface, the blorp engine would configure the 3D pipeline as though the input surface was non-multisampled, and perform the necessary coordinate transformations in the fragment shader to account for the IMS layout. This meant outputting a lot of extra fragment shader code, and it raised some uncertainty about how to deal with very large surfaces. This patch modifies blorp to configure the 3D pipeline properly for IMS layout when reading from depth and stencil surfaces. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:38 -07:00
Paul Berry	0dd5e98aa5	i965/blorp: Loosen assertions in compute_msaa_layout_for_pipeline. Previously, on Gen7, compute_msaa_layout_for_pipeline() would verify that IMS layout is not used. However, now that we configure SURFACE_STATE correctly for IMS surfaces, IMS layout is available. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:38 -07:00
Paul Berry	989218b980	i965/blorp: Configure SURFACE_STATE correctly for IMS surfaces. This patch modifies gen7_set_surface_num_multisamples() to set up the SURFACE_STATE appropriately for texturing from IMS format MSAA surfaces (which are only used on Gen7 for depth and stencil buffers). Since the function now sets more than just the number of multisamples, it's been renamed to gen7_set_surface_msaa(). This will make it possible to remove some kludginess from the blorp engine. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:38 -07:00
Paul Berry	f91b4d92b9	i965/blorp: Optimize manual_blend() for compressed multisampled surfaces. When downsampling a compressed multisampled surface, we can take a shortcut to downsample any pixels that were completely covered by a single primitive. In this case, the first color value we fetch is the correct final color for the downsampled pixel, so we can skip the rest of the blending operation. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	e5d983267a	i965/blorp: Fix integer downsampling on Gen7. When downsampling an integer-format buffer on Gen7, we need to use the "avg" instruction rather than the "add" instruction, to ensure that we don't overflow the range of 32-bit integers. Also, we need to use the proper register type (BRW_REGISTER_TYPE_D or BRW_REGISTER_TYPE_UD) for intermediate color data and for writing to the render target. Note: this patch causes blorp to use the proper register type for all operations (downsampling, upsampling, and ordinary blits). Strictly speaking, this is only necessary for downsampling, because the other operations exclusively use MOV instructions on the color data. But it's simpler to use the proper register type in all cases. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	b961d37e61	i965/blorp: Modify manual_blend() to avoid unnecessary loss of precision. When downsampling from an MSAA image to a single-sampled image, it is inevitable that some loss of numerical precision will occur, since we have to use 32-bit floating point registers to hold the intermediate results while blending. However, it seems reasonable to expect that when all samples corresponding to a given pixel have the exact same color value, there will be no loss of precision. Previously, we averaged samples as follows: blend = (((sample[0] + sample[1]) + sample[2]) + sample[3]) / 4 This had the potential to lose numerical precision when all samples have the same color value, since ((sample[0] + sample[1]) + sample[2]) may not be precisely representable as a 32-bit float, even if the individual samples are. This patch changes the formula to: blend = ((sample[0] + sample[1]) + (sample[2] + sample[3])) / 4 This avoids any loss of precision in the event that all samples are the same, by ensuring that each addition operation adds two equal values. As a side benefit, this puts the formula in the form we will need in order to implement correct blending of integer formats. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	6a27506181	i965: Add support for AVG instruction. From the Ivy Bridge PRM, Vol4 Part3 p152: "The avg instruction performs component-wise integer average of src0 and src1 and stores the results in dst. An integer average uses integer upward rounding. It is equivalent to increment one to the addition of src0 and src1 and then apply an arithmetic right shift to this intermediate value." Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	9544e44262	i965: Replace fs_visitor::kill_emitted with gl_fragment_program::UsesKill. The kill_emitted variable was duplicating the functionality of gl_fragment_program::UsesKill. There's no need for both. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-20 09:33:07 -07:00
Paul Berry	0f1f2ff8db	mesa: Set gl_fragment_program::UsesKill in do_set_program_inouts. Previously, the code for setting this flag for GLSL programs was duplicated in three places: brw_link_shader(), glsl_to_tgsi_visitor, and ir_to_mesa_visitor. In addition to the unnecessary duplication, there was a performance problem on i965: brw_link_shader() set the flag before doing its final round of optimizations, which meant that if the optimizations managed to eliminate all the discard operations, the flag would still be set, resulting (at least in theory) in slower performance. This patch consolidates all of the code that sets UsesKill for GLSL programs into do_set_program_inouts(), which already is doing a similar job for UsesDFdy, and which occurs after i965's final round of optimizations. Non-GLSL programs (ARB programs and the state tracker's glBitmap program) are unaffected. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-20 09:33:07 -07:00
Kristian Høgsberg	a8c092266e	gallium-egl: Move wayland query_buffer implementation Move it to native_wayland_drm_bufmgr_helper.c which only gets compiled when wayland is enabled and which already includes the right headers. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 16:11:06 -04:00
Olivier Galibert	fbe3fa74e5	softpipe: Fix segfault with fbo-cubemap. The cube sampler generates two-dimensional texture coordinates and hence passes NULL for the array for the third one. The actual 2D sampler, lower in the pipe, knew not to used that array since it didn't need it. But the samplers have become single-texel and the coordinate array dereference has been moved up one step, to a level where the code does not know only two coordinates are used. Hence the segfault. The simplest fix by far is to add a third dummy coordinate array in the call to the next pipe step, which will be dereferenced to an harmless 0 which then will be happily ignored by the sampler. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=52250 Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-19 13:19:14 -06:00
Kristian Høgsberg	d7522ed130	wayland: Support EGL_WIDTH and EGL_HEIGHT queries for wl_buffer We're going to make the public wl_buffer struct as small as possible. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 14:03:17 -04:00
Kristian Høgsberg	e23bfdb329	wayland: Use existing EGL_TEXTURE_FORMAT for querying wl_buffer texture format We also reuse EGL_TEXTURE_RGBA and EGL_TEXTURE_RGB, adding only the new planar YUV texture formats: EGL_TEXTURE_Y_U_V_WL, EGL_TEXTURE_Y_UV_WL and EGL_TEXTURE_Y_XUXV_WL. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 14:03:17 -04:00
Kristian Høgsberg	e1b45a3c06	gallium-egl: Implement eglQueryWaylandBufferWL Support this query for gallium EGL too. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 14:03:17 -04:00
Kenneth Graunke	d43f4181e1	glsl: Remove open coded version of ir_variable::interpolation_string(). Presumably the function didn't exist when we wrote this code. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 11:00:00 -07:00
Paul Berry	d08fdacd58	i965: Avoid unnecessary recompiles for shaders that don't use dFdy(). The i965 back-end needs to compile dFdy() differently for FBOs and window system framebuffers, because Y coordinates are flipped between the two (see commit `82d2596`: i965: Compute dFdy() correctly for FBOs). This patch avoids unnecessarily recompiling shaders that don't use dFdy(), by only setting render_to_fbo in the wm program key if the shader actually uses dFdy(). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:25 -07:00
Paul Berry	ce1d2f08f9	glsl: Set UsesDFdy appropriately for GLSL shaders. This patch updates the ir_set_program_inouts_visitor so that it also sets gl_fragment_program::UsesDFdy. This is a bit of a hack (since dFdy() isn't an input or an output), but there's no other obvious visitor to squeeze this functionality into, and it would be silly to create a brand new visitor just for this purpose. v2: use local 'fprog' var to avoid repeated casting. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:21 -07:00
Paul Berry	a0f7b86959	mesa: Set UsesDFdy appropriately for assembly programs. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:19 -07:00
Paul Berry	5e310e9f83	mesa: Add UsesDFdy to struct gl_fragment_program. The i965 back-end needs to compile dFdy() differently for FBOs and window system framebuffers, because Y coordinates are flipped between the two (see commit `82d2596`: i965: Compute dFdy() correctly for FBOs). This boolean will allow it to avoid unnecessarily recompiling shaders that don't use dFdy(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:01 -07:00
Kenneth Graunke	658a63e5d9	drirc: Add disable_blend_func_extended workaround for Unigine OilRush. The previous commit implemented the workaround, cited a bug report about OilRush, but actually only enabled the workaround for the demos. Turn it on for OilRush too. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50291 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-19 01:40:24 -07:00
Kenneth Graunke	040894391a	i965: Add a driconf option to disable GL_ARB_blend_func_extended. Unigine Heaven (at least) has a bug where it incorrectly uses the GL_ARB_blend_func_extended extension. Dual source blending allows two color outputs per render target; individual shader outputs can be assigned to be either the first or second blending input by setting the 'index' via one of two methods: - An API call: glBindFragDataLocationIndexed() - The GLSL 'layout' qualifier provided by GL_ARB_explicit_attrib_location Both of these only work on user defined fragment shader outputs; it's an error to use either on built-in outputs like gl_FragData. Unigine uses gl_FragData and gl_FragColor exclusively, and doesn't even attempt to use either method to set index == 1. However, it does set the blending function to SRC1 enums, which requires a fragment shader output with index == 1 or else rendering is undefined. In other words, enabling ARB_blend_func_extended causes Unigine to render incorrectly, resulting in an apparent regression, even though our driver code (as far as I can tell) is perfectly fine. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50291 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-19 01:22:34 -07:00
Brian Paul	768be75c44	mesa: remove stale comment	2012-07-18 16:51:47 -06:00
Brian Paul	e4f8d33aea	mesa: use gl_program cast wrappers In a few cases, remove unneeded casts. And fix a few other const-correctness issues. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 16:51:47 -06:00
Brian Paul	1170b5aa9f	mesa: add some gl_program cast wrappers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 16:51:47 -06:00
Marek Olšák	c3c83af380	r600g: setup streamout before calling last r600_need_cs_space before drawing This fixes CS checker errors due to registers not being initialized, because the flush occured after dirty state was emitted but before drawing.	2012-07-18 22:42:58 +02:00
Eric Anholt	a40c1f9522	i965/fs: Make register spill/unspill only do the regs for that instruction. Previously, if we were spilling the result of a texture call, we would store all 4 regs, then for each use of one of those regs as the source of an instruction, we would unspill all 4 regs even though only one was needed. In both lightsmark and l4d2 with my current graphics config, the shaders that produce spilling do so on split GRFs, so this doesn't help them out. However, in a capture of the l4d2 shaders with a different snapshot and playing the game instead of using a demo, it reduced one shader from 2817 instructions to 2179, due to choosing a now-cheaper texture result to spill instead of piles of texcoords. v2: Fix comment noted by Ken, and fix the if condition associated with it for the current state of what constitutes a partial write of the destination. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2012-07-18 12:30:06 -07:00
Eric Anholt	a454f8ec6d	i965/fs.h: Refactor tests for instructions modifying a register. There's one instance of a potential behavior change: propagate_constants may now propagate into a part of a vgrf after a different part of it was overwritten by a send that returns multiple registers. I don't think we ever generate IR that meets that condition, but it's something to note if we bisect behavior change to this. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Eric Anholt	fc01376c50	i965/fs: Replace usage is_tex() with regs_written() checks. In these places, we care about any sort of send that hits more than one reg, not just textures. We don't yet have anything else returning more than one reg, so there's no change. v2: Use mlen instead of is_tex() for the is-it-a-send check. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Eric Anholt	a6411520b4	i965/fs: Rename virtual_grf_next to virtual_grf_count. "count" is a more useful name, since most of the time we're using it for looping over the variables. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Eric Anholt	40cd60a315	i965/fs: Move a block out of a loop in live variables setup. This was accidentally copy-and-pasted inside. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Anuj Phogat	cd5cd85a43	i965/msaa: Disable alpha-to-{coverage, one} when drawbuffer zero is in integer format OpenGL specification 3.3 (page 196), section 4.1.3 says: If drawbuffer zero is not NONE and the buffer it references has an integer format, the SAMPLE_ALPHA_TO_COVERAGE and SAMPLE_ALPHA_TO_ONE operations are skipped." This should work properly even if there are other draw buffers that are not in integer format. This patch makes following piglit tests pass on mesa: int-draw-buffers-alpha-to-coverage int-draw-buffers-alpha-to-one Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-18 11:54:12 -07:00
Lucas Stach	fb18ec4f27	st/xorg: attach EDID to outputs Allows tools like GNOME's monitor configuration to show meaningful names. v2: fix resource leak Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-18 17:19:16 +02:00
Lucas Stach	9de16ac0a8	st/xorg: remove superfluous memset exaDriverAlloc() uses calloc, which already initialises pExa to zero. Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-18 17:19:07 +02:00
Lucas Stach	70f0eda127	st/xorg: reorder exa context creation and use screen param queries Gives the x-server a more accurate description of the exa hardware capabilities. v2: drop NPOT check Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-18 17:18:55 +02:00
Olivier Galibert	229a1a7e4d	softpipe: Take all lods into account when texture sampling. This patch churns a lot because it needs to change 4-wide filters into single pixel filters, since each fragment may use a different filter. The only case not entirely supported is the anisotropic filtering. Not sure what we want to do there, since a full quad is required by that filter. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-18 08:02:39 -06:00
Marek Olšák	99c65bac34	r600g: implement wait-free buffer transfer for DISCARD_RANGE Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2012-07-18 07:16:30 +02:00
Marek Olšák	8ac9801669	r600g: accelerate buffer copying This will be useful for efficient handling of the DISCARD transfer flags. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2012-07-18 06:32:57 +02:00
Marek Olšák	f237fd431b	r600g: update R600_MAX_DRAW_CS_DWORDS to take draw-opaque into account	2012-07-18 06:25:37 +02:00
Marek Olšák	30257c3291	r600g: move VGT_STRMOUT_DRAW_OPAQUE_OFFSET initialization into invariant state	2012-07-18 06:25:37 +02:00
Marek Olšák	d9ba1b0beb	r600g: only set the index type if drawing is indexed	2012-07-18 06:25:37 +02:00
Marek Olšák	1cfb55c509	r600g: remove debug code for streamout	2012-07-18 06:25:37 +02:00
Marek Olšák	ff9a49328e	r600g: inline r600_context_draw_opaque_count	2012-07-18 06:25:37 +02:00
Marek Olšák	1b699a4832	r600g: fix alphatest without a colorbuffer on evergreen	2012-07-18 06:25:36 +02:00
Marek Olšák	82a1d24175	r600g: fix alphatest without a colorbuffer on r6xx-r7xx	2012-07-18 04:35:38 +02:00
Marek Olšák	de4fd087cb	r600g: always derive alphatest state from the first colorbuffer	2012-07-18 04:17:11 +02:00
Marek Olšák	bc2f5fc01e	r600g: atomize alphatest state	2012-07-18 03:45:25 +02:00
Marek Olšák	5130196c0b	r600g: try to fix line stippling with lineloops The piglit test is failing, but visually it looks almost correct.	2012-07-18 02:17:10 +02:00
Marek Olšák	43e226b6ef	r600g: optimize uploading depth textures Make it only copy the portion of a depth texture being uploaded and not the whole 2D layer. There is also a little code cleanup.	2012-07-18 00:32:50 +02:00
Marek Olšák	b242adbe5c	r600g: remove needless wrapper r600_texture_depth_flush	2012-07-18 00:21:53 +02:00
Marek Olšák	611dd52942	r600g: init_flushed_depth_texture should be able to report errors	2012-07-18 00:21:53 +02:00
Paul Berry	e9b908b014	msaa: Generate proper error for operations prohibited on MSAA buffers. From the GL 3.0 spec, section 4.3.3, in the documentation for CopyPixels(): "An INVALID_OPERATION error will be generated if the object bound to READ_FRAMEBUFFER_BINDING is framebuffer complete and the value of SAMPLE_BUFFERS is greater than zero." The same applies to CopyTexImage...() and CopyTexSubImage...() functions, since they are defined in terms of CopyPixels(). Previously we were generating an INVALID_FRAMEBUFFER_OPERATION error in these cases. Fixes piglit tests "EXT_framebuffer_multisample/negative-{copypixels,copyteximage}". Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 14:40:01 -07:00
Brian Paul	c4d2a14d6e	gallivm: silence uninitialized variable warnings	2012-07-17 14:41:29 -06:00
Marek Olšák	9d699cd845	r600g: fix lockups with and enable dual source blending on evergreen GL_ARB_blend_func_extended is now enabled on all chipsets.	2012-07-17 21:22:15 +02:00
Marek Olšák	c26fadf195	r600g: remove unused code after conversion of sampler views	2012-07-17 21:22:15 +02:00
Marek Olšák	5d8d4252f2	r600g: convert sampler view emission into atoms Vertex and constant buffers are emitted in the same way. This is mainly a simplification of the code. The cleanup is in another patch.	2012-07-17 21:22:15 +02:00
Marek Olšák	7022f49b52	r600g: only make constant buffers dirty if there's something to update	2012-07-17 21:22:15 +02:00
Marek Olšák	80755ff563	r600g: properly track which textures are depth This fixes the issue with have_depth_texture never being set to false.	2012-07-17 21:22:15 +02:00
Marek Olšák	e5de73cafd	r600g: consolidate and optimize sampler states changes for evergreen Only set sampler states which changed.	2012-07-17 21:22:14 +02:00
Marek Olšák	883c43cdd4	r600g: don't invalidate texture caches when setting sampler states Changing sampler states doesn't change resource bindings.	2012-07-17 21:22:14 +02:00
Marek Olšák	ba48f47ebf	r600g: consolidate code for setting sampler views and fix bugs in the process Issues fixed: - set_vs_sampler_views for evergreen is now properly implemented. - Added the missing inval_texture_cache call for evergreen. - have_depth_texture was sometimes incorrectly set to false on evergreen even if there were depth textures in other shader stages. To fix this, set it to true once and never set it to false again. It's stupid, but it matches the r600 code. The proper fix is left to another patch. - Optimizaton: The sampler views which aren't changed aren't updated.	2012-07-17 21:22:14 +02:00
Marek Olšák	d1ca16b273	r600g: remove unused flag have_depth_fb This is a leftover from: commit `fe1fd67556` Author: Marek Olšák <maraeo@gmail.com> Date: Sun Jul 8 03:10:37 2012 +0200 r600g: don't flush depth textures set as colorbuffers	2012-07-17 21:22:14 +02:00
Marek Olšák	585baac652	r600g: do fine-grained vertex buffer updates If only some buffers are changed, the other ones don't have to re-emitted. This uses bitmasks of enabled and dirty buffers just like emit_constant_buffers does.	2012-07-17 21:22:14 +02:00
Marek Olšák	f4f2e8ebe1	r600g: don't call inval_shader_cache in r600_context_flush twice It's already called in r600_constant_buffers_dirty.	2012-07-17 21:22:14 +02:00
Marek Olšák	6694a68d89	gallium/util: add util_bit_last - finds the last bit set in a word	2012-07-17 21:22:14 +02:00
Marek Olšák	018e3f75d6	r600g: fix all failing depth-stencil tests for evergreen	2012-07-17 21:22:14 +02:00
Michel Dänzer	761131ce45	configure.ac: Further LLVM fixups. * Also add mcjit in the non-OpenCL case. * Replace hardcoded llvm-config with $LLVM_CONFIG everywhere. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellad <thomas.stellard@amd.com>	2012-07-17 19:12:01 +02:00
Michel Dänzer	39c4bc7fdf	glsl: Drop obsolete .gitignore entries. Helps spotting and removing the obsolete generated files, which otherwise break the build. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2012-07-17 18:30:32 +02:00
Tom Stellard	ed41a559dc	configure.ac: Add libLLVMMCJIT to the LLVM_LDFLAGS This is neccessary for linking the llvmpipe tests. It appears this dependency was introduced by the "wider native register" changes. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-07-17 12:08:24 -04:00
Eric Anholt	fadc9eaf97	intel: Add a comment explaining why we early return on matching BO names. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:08 -07:00
Eric Anholt	2b311fd802	intel: Drop other checks for old loader version. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:06 -07:00
Eric Anholt	1b4374d364	intel: Replace the non-getBuffersWithFormat compat path with an error message. It's been broken (using NULL getBuffersWithFormat() instead of getBuffers()) due to a copy and paste error for a year now. GetBuffersWithFormat has been around since 2009, so I don't feel any guilt in not supporting it. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:04 -07:00
Eric Anholt	9bbf7c139b	intel: Remove dead intel_framebuffer_has_hiz(). Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:02 -07:00
Eric Anholt	bce58e155d	intel: Convert to using private depth/stencil buffers (v2) This means that GLX buffer sharing of these no longer works. On the other hand, just look at this code reduction. v2: - [chad] Fix intelCreateBuffer for gen < 6. When the branch for !screen->hw_has_separate_stencil was taken, intel_create_private_renderbuffer was incorrectly not used. - [chad] Remove all code in intel_process_dri2_buffer for processing depth, stencil, and hiz buffers. That code is now dead. CC: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:17:56 -07:00
Eric Anholt	433ff3e16e	intel: Add a function for creating a private window system buffer. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:17:38 -07:00
Roland Scheidegger	bf484024b9	gallivm: (trivial) remove unnecessary bogus include	2012-07-17 17:11:18 +02:00
Kristian Høgsberg	2023bf996e	gbm: Add gbm_bo_import for gallium gbm backend	2012-07-17 10:54:00 -04:00
Elvis Lee	1f2c87cc8f	st/egl: Fix build for wayland includes common/native_wayland_drm_bufmgr_helper.c fails to find wayland-server.h Signed-off-by: Elvis Lee <kwangwoong.lee@lge.com>	2012-07-17 10:54:00 -04:00
Elvis Lee	23f1e551cc	st/gbm: renaming pitch to stride on gallium commit '7250cd506baa0bd4649b30d87509cdd0cbc06a57' changes struct gbm_bo, renaming it's 'pitch' to 'stride'. This applies to Gallium. Signed-off-by: Elvis Lee <kwangwoong.lee@lge.com>	2012-07-17 10:54:00 -04:00
Matt Turner	f42e601ce0	glx: build tests after libglx.la Previously, if you ran make followed by make check it would work, but if you just ran make check the test program would fail to compile. Reviewed-by: Jon TURNEY <jon.turney@dronecode.org.uk>	2012-07-17 06:59:00 -07:00
José Fonseca	3469715a8a	gallivm,draw,llvmpipe: Support wider native registers. Squashed commit of the following: commit 7acb7b4f60dc505af3dd00dcff744f80315d5b0e Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 9 17:46:31 2012 +0100 draw: Don't use dynamically sized arrays. Not supported by MSVC. commit 5810c28c83647612cb372d1e763fd9d7780df3cb Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 9 17:44:16 2012 +0100 gallivm,llvmpipe: Don't use expressions with PIPE_ALIGN_VAR(). MSVC doesn't accept exceptions in _declspec(align(...)). Use a define instead. commit 8aafd1457ba572a02b289b3f3411e99a3c056072 Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 9 17:41:56 2012 +0100 gallium/util: Make u_cpu_detect.h header C++ safe. commit 5795248350771f899cfbfc1a3a58f1835eb2671d Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 2 12:08:01 2012 +0100 gallium/util: Add ULL suffix to large constants. As suggested by Andy Furniss: it looks like some old gcc versions require it. commit 4c66c22727eff92226544c7d43c4eb94de359e10 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Jun 29 13:39:07 2012 +0100 gallium/util: Truly disable INF/NAN tests on MSVC. Thanks to Brian for spotting this. commit 8bce274c7fad578d7eb656d9a1413f5c0844c94e Author: José Fonseca <jfonseca@vmware.com> Date: Fri Jun 29 13:39:07 2012 +0100 gallium/util: Disable INF/NAN tests on MSVC. Somehow they are not recognized as constants. commit 6868649cff8d7fd2e2579c28d0b74ef6dd4f9716 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jul 5 15:05:24 2012 +0200 gallivm: Cleanup the 2 x 8 float -> 16 ub special path in lp_build_conv. No behaviour change intended, like 7b98455fb40c2df84cfd3cdb1eb7650f67c8a751. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 5147a0949c4407e8bce9e41d9859314b4a9ccf77 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jul 5 14:28:19 2012 +0200 gallivm: (trivial) fix issues with multiple-of-4 texture fetch Some formats can't handle non-multiple of 4 fetches I believe, but everything must support length 1 and multiples of 4. So avoid going to scalar fetch (which is very costly) just because length isn't 4. Also extend the hack to not use shift with variable count for yuv formats to arbitrary length (larger than 1) - doesn't matter how many elements we have we always want to avoid it unless we have variable shift count instruction (which we should get with avx2). Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 87ebcb1bd71fa4c739451ec8ca89a7f29b168c08 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jul 4 02:09:55 2012 +0200 gallivm: (trivial) fix typo for wrap repeat mode in linear filtering aos code This would lead to bogus coordinates at the edges. (undetected by piglit because this path is only taken for block-based formats). Signed-off-by: José Fonseca <jfonseca@vmware.com> commit 3a42717101b1619874c8932a580c0b9e6896b557 Author: José Fonseca <jfonseca@vmware.com> Date: Tue Jul 3 19:42:49 2012 +0100 gallivm: Fix TGSI integer translation with AVX. commit d71ff104085c196b16426081098fb0bde128ce4f Author: José Fonseca <jfonseca@vmware.com> Date: Fri Jun 29 15:17:41 2012 +0100 llvmpipe: Fix LLVM JIT linear path. It was not working properly because it was looking at the JIT function before it was actually compiled. Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit a94df0386213e1f5f9a6ed470c535f9688ec0a1b Author: José Fonseca <jfonseca@vmware.com> Date: Thu Jun 28 18:07:10 2012 +0100 gallivm: Refactor lp_build_broadcast(_scalar) to share code. Doesn't really change the generated assembly, but produces more compact IR, and of course, makes code more consistent. Reviewed-by: Brian Paul <brianp@vmware.com> commit 66712ba2731fc029fa246d4fc477d61ab785edb5 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Jun 27 17:30:13 2012 +0100 gallivm: Make LLVMContextRef a singleton. There are any places inside LLVM that depend on it. Too many to attempt to fix. Reviewed-by: Brian Paul <brianp@vmware.com> commit ff5fb7897495ac263f0b069370fab701b70dccef Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jun 28 18:15:27 2012 +0200 gallivm: don't use 8-wide texture fetch in aos path This appears to be a slight loss usually. There are probably several reasons for that: - fetching itself is scalar - filtering is pure int code hence needs splitting anyway, same for the final texel offset calculations - texture wrap related code, which can be done 8-wide, is slightly more complex with floats (with clamp_to_edge) and float operations generally more costly hence probably not much faster overall - the code needed to split when encountering different mip levels for the quads, adding complexity So, just split always for aos path (but leave it 8-wide for soa, since we do 8-wide filtering there when possible). This should certainly be revisited if we'd have avx2 support. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit ce8032b43dcd8e8d816cbab6428f54b0798f945d Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 27 18:41:19 2012 +0200 gallivm: (trivial) don't extract fparts variable if not needed Did not have any consequences but unnecessary. commit aaa9aaed8f80dc282492f62aa583a7ee23a4c6d5 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 27 18:09:06 2012 +0200 gallivm: fix precision issue in aos linear int wrap code now not just passes at a quick glance but also with piglit... If we do the wrapping with floats, we also need to set the weights accordingly. We can potentially end up with different (integer) coordinates than what the integer calculations would have chosen, which means the integer weights calculated previously in this case are completely wrong. Well at least that's what I think happens, at least recalculating the weights helps. (Some day really should refactor all the wrapping, so we do whatever is fastest independent of 16bit int aos or 32bit float soa filtering.) Reviewed-by: José Fonseca <jfonseca@vmware.com> commit fd6f18588ced7ac8e081892f3bab2916623ad7a2 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Jun 27 11:15:53 2012 +0100 gallium/util: Fix parsing of options with underscore. For example GALLIVM_DEBUG=no_brilinear which was being parsed as two options, "no" and "brilinear". commit 09a8f809088178a03e49e409fa18f1ac89561837 Author: James Benton <jbenton@vmware.com> Date: Tue Jun 26 15:00:14 2012 +0100 gallivm: Added a generic lp_build_print_value which prints a LLVMValueRef. Updated lp_build_printf to share common code. Removed specific lp_build_print_vecX. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> commit e59bdcc2c075931bfba2a84967a5ecd1dedd6eb0 Author: José Fonseca <jfonseca@vmware.com> Date: Wed May 16 15:00:23 2012 +0100 draw,llvmpipe: Avoid named struct types on LLVM 3.0 and later. Starting with LLVM 3.0, named structures are meant not for debugging, but for recursive data types, previously also known as opaque types. The recursive nature of these types leads to several memory management difficulties. Given that we don't actually need recursive types, avoid them altogether. This is an attempt to address fdo bugs 41791 and 44466. The issue is somewhat random so there's no easy way to check how effective this is. Cherry-picked from `9af1ba565d` commit df6070f618a203c7a876d984c847cde4cbc26bdb Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 27 14:42:53 2012 +0200 gallivm: (trivial) fix typo in faster aos linear int wrap code no longer crashes, now REALLY tested. commit d8f98dce452c867214e6782e86dc08562643c862 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 18:20:58 2012 +0200 llvmpipe: (trivial) remove bogus optimization for float aos repeat wrap This optimization for nearest filtering on the linear path generated likely bogus results, and the int path didn't have any optimizations there since the only shader using force_nearest apparently uses clamp_to_edge not repeat wrap anyway. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit c4e271a0631087c795e756a5bb6b046043b5099d Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 23:01:52 2012 +0200 gallivm: faster repeat wrap for linear aos path too Even if we already have scaled integer coords, it's way faster to use the original float coord (plus some conversions) rather than use URem. The choice of what to do for texture wrapping is not really tied to int aos or float soa filtering though for some modes there can be some gains (because of easier weight calculations). Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 1174a75b1806e92aee4264ffe0ffe7e70abbbfa3 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 14:39:22 2012 +0200 gallivm: improve npot tex wrap repeat in linear soa path URem gets translated into series of scalar divisions so just about anything else is faster. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit f849ffaa499ed96fa0efd3594fce255c7f22891b Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 00:40:35 2012 +0100 gallivm: (trivial) fix near-invisible shift-space typo I blame the keyboard. commit 5298a0b19fe672aebeb70964c0797d5921b51cf0 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 16:24:28 2012 +0200 gallivm: add new intrinsic helper to deal with arbitrary vector length This helper will split vectors which are too large for the hw, or expand them if they are too small, so a caller of a function using intrinsics which uses such sizes need not split (or expand) the vectors manually and the function will still use the intrinsic instead of dropping back to generic llvm code. It can also accept scalars for use with pseudo-vector intrinsics (only useful for float arguments, all x86 scalar simd float intrinsics use 4vf32). Only used for lp_build_min/max() for now (also added the scalar float case for these while there). (Other basic binary functions could use it easily, whereas functions with a different interface would need different helpers.) Expanding vectors isn't widely used, because we always try to use build contexts with native hw vector sizes. But it might (or not) be nicer if this wouldn't need to be done, the generated code should in theory stay the same (it does get hit by lp_build_rho though already since we didn't have a intrinsic for the scalar lp_build_max case before). v2: incorporated Brian's feedback, and also made the scalar min/max case work instead of crash (all scalar simd float intrinsics take 4vf32 as argument, probably the reason why it wasn't used before). Moved to lp_bld_intr based on José's request, and passing intrinsic size instead of length. Ideally we'd derive the source type info from the passed in llvm value refs and process some llvmtype return type so we could handle intrinsics where the source and destination type isn't the same (like float/int conversions, packing instructions) but that's a bit too complicated for now. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 01aa760b99ec0b2dc8ce57a43650e83f8c1becdf Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 16:19:18 2012 +0200 gallivm: (trivial) increase max code size for shader disassembly 64kB was just short of what I needed (which caused a crash) hence increase to 96kB (should probably be smarter about that). commit 74aa739138d981311ce13076388382b5e89c6562 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 11:53:29 2012 +0100 gallivm: simplify aos float tex wrap repeat nearest just handle pot and npot the same. The previous pot handling ended up with exactly the same instructions plus 2 more (leave it in the soa path though since it is probably still cheaper there). While here also fix a issue which would cause a crash after an assert. commit 0e1e755645e9e49cfaa2025191e3245ccd723564 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 11:29:24 2012 +0100 gallivm: (trivial) skip floor rounding in ifloor when not signed This was only done for the non-sse41 case before, but even with sse41 this is obviously unnecessary (some callers already call itrunc in this case anyway but some might not). commit 7f01a62f27dcb1d52597b24825931e88bae76f33 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 11:23:12 2012 +0100 gallivm: (trivial) fix bogus comments commit 5c85be25fd82e28490274c468ce7f3e6e8c1d416 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Jun 20 11:51:57 2012 +0100 translate: Free elt8_func/elt16_func too. These were leaking. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit 0ad498f36fb6f7458c7cffa73b6598adceee0a6c Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 19 15:55:34 2012 +0200 gallivm: fix bug for tex wrap repeat with linear sampling in aos float path The comparison needs to be against length not length_minus_one, otherwise the max texel is never chosen (for the second coordinate). Fixes piglit texwrap-1D-npot-proj (and 2D/3D versions). Reviewed-by: José Fonseca <jfonseca@vmware.com> commit d1ad65937c5b76407dc2499b7b774ab59341209e Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 19 16:13:43 2012 +0200 gallivm: simplify soa tex wrap repeat with npot textures and no mip filtering Similar to what is already done in aos sampling for the float path (but not the int path since we don't get normalized float coordinates there). URem is expensive and the calculation is done trivially with normalized floats instead (at least with sse41-capable cpus). (Some day should probably do the same for the mip filter path but it's much more complicated there hence the gain is smaller.) Reviewed-by: José Fonseca <jfonseca@vmware.com> commit e1e23f57ba9b910295c306d148f15643acc3fc83 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 18 20:38:56 2012 +0200 llvmpipe: (trivial) remove duplicated function declaration Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 07ca57eb09e04c48a157733255427ef5de620861 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 18 20:37:34 2012 +0200 llvmpipe: destroy setup variants on context destruction lp_delete_setup_variants() used to be called in garbage collection, but this no longer exists hence the setup shaders never got freed. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit ed0003c633859a45f9963a479f4c15ae0ef1dca3 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 18 16:25:29 2012 +0100 gallivm: handle different ilod parts for multiple quad sampling This fixes filtering when the integer part of the lod is not the same for all quads. I'm not fully convinced of that solution yet as it just splits the vector if the levels to be sampled from are different. But otherwise we'd need to do things like some minify steps, and getting mip level base address separately anyway hence it wouldn't really look like much of a win (and making the code even more complex). This should now give identical results to single quad sampling. commit 8580ac4cfc43a64df55e84ac71ce1a774d33c0d2 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jun 14 18:14:47 2012 +0200 gallivm: de-duplicate sample code common to soa and aos sampling There doesn't seem to be any reason why this code dealing with cube face selection, lod and mip level calculation is separate in aos and soa sampling, and I am sick of having it to change in both places. commit fb541e5f957408ce305b272100196f1e12e5b1e8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jun 14 18:15:41 2012 +0200 gallivm: do mip filtering with per quad lod_fpart This gives better results for mip filtering, though the generated code might not be optimal. For now it also creates some artifacts if the lod_ipart isn't the same for all quads, since instead of using the same mip weight for all quads as previously (which just caused non-smooth gradients) this now will use the right weights but with the wrong mip level in this case (can easily be seen with things like texfilt, mipmap_tunnel). v2: use logic helper suggested by José, and fix issue with negative lod_fpart values commit f1cc84eef7d826a20fab6cd8ccef9a275ff78967 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 13 18:35:25 2012 +0200 gallivm: (trivial) fix bogus assert in lp_build_unpack_broadcast_aos_scalars commit 7c17dbae8ae290df9ce0f50781a09e8ed640c044 Author: James Benton <jbenton@vmware.com> Date: Tue Jun 12 12:11:14 2012 +0100 util: Reimplement half <-> float conversions. Removed u_half.py used to generate the table for previous method. Previous implementation of float to half conversion was faulty for denormalised and NaNs and would require extra logic to fix, thus making the speedup of using tables irrelevant. commit 7762f59274070e1dd4b546f5cb431c2eb71ae5c3 Author: James Benton <jbenton@vmware.com> Date: Tue Jun 12 12:12:16 2012 +0100 tests: Updated tests to properly handle NaN for half floats. commit fa94c135aea5911fd93d5dfb6e6f157fb40dce5e Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 11 18:33:10 2012 +0200 gallivm: do mip level calculations per quad This is the final piece which shouldn't change the rendering output yet. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 23cbeaddfe03c09ca18c45d28955515317ffcf4c Author: Roland Scheidegger <sroland@vmware.com> Date: Sat Jun 9 00:54:21 2012 +0200 gallivm: do per-quad cube face selection Doesn't quite fix the piglit cubemap test (not sure why actually) but doing per-quad face selection is doing the right thing and definitely an improvement. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit abfb372b3702ac97ac8b5aa80ad1b94a2cc39d33 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 11 18:22:59 2012 +0200 gallivm: do all lod calculations per quad Still no functional change but lod is now converted to scalar after lod calculations. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 519368632747ae03feb5bca9c655eccbc5b751b4 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 16:46:10 2012 +0100 gallivm: Added support for half-float to float conversion in lp_build_conv. Updated various utility functions to support this change. commit 135b4d683a4c95f7577ba27b9bffa4a6fbd2c2e7 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 16:02:46 2012 +0100 gallivm: Added function for half-float to float conversion. Updated lp_build_format_aos_array to support half-float source. commit 37d648827406a20c5007abeb177698723ed86673 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 14:55:18 2012 +0100 util: Updated u_format_tests to rigidly test half-float boundary values. commit 2ad18165d96e578aa9046df7c93cb1c3284d8c6b Author: James Benton <jbenton@vmware.com> Date: Tue May 22 14:54:16 2012 +0100 llvmpipe: Updated lp_test_format to properly handle Inf/NaN results. commit 78740acf25aeba8a7d146493dd5c966e22c27b73 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 14:53:30 2012 +0100 util: Added functions for checking NaN / Inf for double and half-floats. commit 35e9f640ae01241f9e0d67fe893bbbf564c05809 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 24 21:05:13 2012 +0200 gallivm: Fix calculating rho for 3d textures for the single-quad case Discovered by accident, this looks like a very old typo bug. commit fc1220c636326536fd0541913154e62afa7cd1d8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 24 21:04:59 2012 +0200 gallivm: do calcs per-quad in lp_build_rho Still convert to scalar at the end of the function. commit 50a887ffc550bf310a6988fa2cea5c24d38c1a41 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon May 21 23:21:50 2012 +0200 gallivm: (trivial) return scalar in lp_build_extract_range for length 1 vectors Our type system on top of llvm's one doesn't generally support vectors of length 1, instead using scalars. So we should return a scalar from this function instead of having to bitcast the vector with length 1 later elsewhere. commit 80c71c621f9391f0f9230460198d861643324876 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 17:49:15 2012 +0100 draw: Fixed bad merge error commit c47401cfad0c9167de20ff560654f533579f452c Author: James Benton <jbenton@vmware.com> Date: Tue May 22 15:29:30 2012 +0100 draw: Updated store_clip to store whole vectors instead of individual elements. commit 2d9c1ad74b0b0b41861fffcecde39f09cc27f1cf Author: James Benton <jbenton@vmware.com> Date: Tue May 22 15:28:32 2012 +0100 gallivm: Added lp_build_fetch_rgba_aos_array. A version of lp_build_fetch_rgba_aos which is targeted at simple array formats. Reads the whole vector from memory in one, instead of reading each element individually. Tested with mesa tests and demos. commit ff7805dc2b6ef6d8b11ec4e54aab1633aef29ac8 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 15:27:40 2012 +0100 gallivm: Added lp_build_pad_vector. This function pads a vector with undef to a desired length. commit 701f50acef24a2791dabf4730e5b5687d6eb875d Author: James Benton <jbenton@vmware.com> Date: Fri May 18 17:27:19 2012 +0100 util: Added util_format_is_array. This function checks whether a format description is in a simple array format. commit 5e0a7fa543dcd009de26f34a7926674190fa6246 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 19:13:47 2012 +0100 draw: Removed draw_llvm_translate_from and draw/draw_llvm_translate.c. This is "replaced" by adding an optimised path in lp_build_fetch_rgba_aos in an upcoming patch. commit 8c886d6a7dd3fb464ecf031de6f747cb33e5361d Author: James Benton <jbenton@vmware.com> Date: Wed May 16 15:02:31 2012 +0100 draw: Modified store_aos to write the vector as one, not individual elements. commit 37337f3d657e21dfd662c7b26d61cb0f8cfa6f17 Author: James Benton <jbenton@vmware.com> Date: Wed May 16 14:16:23 2012 +0100 draw: Changed aos_to_soa to use lp_build_transpose_aos. commit bd2b69ce5d5c94b067944d1dcd5df9f8e84548f1 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 19:14:27 2012 +0100 draw: Changed soa_to_aos to use lp_build_transpose_aos. commit 0b98a950d29a116e82ce31dfe7b82cdadb632f2b Author: James Benton <jbenton@vmware.com> Date: Fri May 18 18:57:45 2012 +0100 gallivm: Added lp_build_transpose_aos which converts between aos and soa. commit 69ea84531ad46fd145eb619ed1cedbe97dde7cb5 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 18:57:01 2012 +0100 gallivm: Added lp_build_interleave2_half aimed at AVX unpack instructions. commit 7a4cb1349dd35c18144ad5934525cfb9436792f9 Author: José Fonseca <jfonseca@vmware.com> Date: Tue May 22 11:54:14 2012 +0100 gallivm: Fix build on Windows. MC-JIT not yet supported there. Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit afd105fc16bb75d874e418046b80d9cc578818a1 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:17:26 2012 +0100 llvmpipe: Added a error counter to lp_test_conv. Useful for keeping track of progress when fixing errors! Signed-off-by: José Fonseca <jfonseca@vmware.com> commit b644907d08c10a805657841330fc23db3963d59c Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:16:46 2012 +0100 llvmpipe: Changed known failures in lp_test_conv. To comply with the recent fixes to lp_bld_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit d7061507bd94f6468581e218e61261b79c760d4f Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:14:38 2012 +0100 llvmpipe: Added fixed point types tests to lp_test_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit 146b3ea39b4726dbe125ac666bd8902ea3d6ca8c Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:26:35 2012 +0100 llvmpipe: Changed lp_test_conv src/dst alignment to be correct. Now based on the define rather than a fixed number. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit f3b57441f834833a4b142a951eb98df0aa874536 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:06:44 2012 +0100 gallivm: Fixed erroneous optimisation in lp_build_min/max. Previously assumed normalised was 0 to 1, but it can be -1 to 1 if type is signed. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit a0613382e5a215cd146bb277646a6b394d376ae4 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:04:49 2012 +0100 gallivm: Compensate for lp_const_offset in lp_build_conv. Fixing a /FIXME/ to remove errors in integer conversion in lp_build_conv. Tested using lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit a3d2bf15ea345bc8a0664f8f441276fd566566f3 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:01:25 2012 +0100 gallivm: Fixed overflow in lp_build_clamped_float_to_unsigned_norm. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit e7b1e76fe237613731fa6003b5e1601a2e506207 Author: José Fonseca <jfonseca@vmware.com> Date: Mon May 21 20:07:51 2012 +0100 gallivm: Fix build with LLVM 2.6 Trivial, and useful. commit d3c6bbe5c7f5ba1976710831281ab1b6a631082d Author: José Fonseca <jfonseca@vmware.com> Date: Tue May 15 17:15:59 2012 +0100 gallivm: Enable MCJIT/AVX with vanilla LLVM 3.1. Add the necessary C++ glue, so that we don't need any modifications to the soon to be released LLVM 3.1. Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit 724a019a14d40fdbed21759a204a2bec8a315636 Author: José Fonseca <jfonseca@vmware.com> Date: Mon May 14 22:04:06 2012 +0100 gallivm: Use HAVE_LLVM 0x0301 consistently. commit af6991e2a3868e40ad599b46278551b794839748 Author: José Fonseca <jfonseca@vmware.com> Date: Mon May 14 21:49:06 2012 +0100 gallivm: Add MCRegisterInfo.h to silence benign warnings about missing implementation. Trivial. commit 6f8a1d75458daae2503a86c6b030ecc4bb494e23 Author: Vinson Lee <vlee@freedesktop.org> Date: Mon Apr 2 22:14:15 2012 -0700 gallivm: Pass in a MCInstrInfo to createMCInstPrinter on llvm-3.1. llvm-3.1svn r153860 makes MCInstrInfo available to the MCInstPrinter. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit 62555b6ed8760545794f83064e27cddcb3ce5284 Author: Vinson Lee <vlee@freedesktop.org> Date: Tue Mar 27 21:51:17 2012 -0700 gallivm: Fix method overriding in raw_debug_ostream. Use matching type qualifers to avoid method hiding. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 6a9bd784f4ac68ad0a731dcd39e5a3c39989f2be Author: Vinson Lee <vlee@freedesktop.org> Date: Tue Mar 13 22:40:52 2012 -0700 gallivm: Fix createOProfileJITEventListener namespace with llvm-3.1. llvm-3.1svn r152620 refactored the OProfile profiling code. createOProfileJITEventListener was moved from the llvm namespace to the llvm::JITEventListener namespace. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit b674955d39adae272a779be85aa1bd665de24e3e Author: Vinson Lee <vlee@freedesktop.org> Date: Mon Mar 5 22:00:40 2012 -0800 gallivm: Pass in a MCRegisterInfo to MCInstPrinter on llvm-3.1. llvm-3.1svn r152043 changes createMCInstPrinter to take an additional MCRegisterInfo argument. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit 11ab69971a8a31c62f6de74905dbf8c02884599f Author: Vinson Lee <vlee@freedesktop.org> Date: Wed Feb 29 21:20:53 2012 -0800 Revert "gallivm: Change getExtent and readByte to non-const with llvm-3.1." This reverts commit `d5a6c17254`. llvm-3.1svn r151687 makes MemoryObject accessor members const again. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit 339960c82d2a9f5c928ee9035ed31dadb7f45537 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon May 14 16:19:56 2012 +0200 gallivm: (trivial) fix assertion failure for mipmapped 1d textures In lp_build_rho, we may end up with a 1-element vector (for mipmapped 1d textures), but in this case we require the type to be a non-vector type, so need a cast. commit 9d73edb727bd6d196030dc3026b7bf0c574b3e19 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 10 18:12:07 2012 +0200 gallivm: prepare for per-quad lod calculations for large vectors to be able to handle multiple quads at once in texture sampling and still do lod calculations per quad, it is necessary to get the per-quad derivatives into the lp_build_rho function. Until now these derivative values were just scalars, which isn't going to work. So we now use vectors, and since the interface needs to change we also do some different (slightly more efficient) packing of the values. For 8-wide vectors the packed derivative values for 3 coords would look like this, this scales to a arbitrary (multiple of 4) vector size: ds1dx ds1dy dt1dx dt1dy ds2dx ds2dy dt2dx dt2dy dr1dx dr1dy _____ _____ dr2dx dr2dy _____ _____ The second vector will be unused for 1d and 2d textures. To facilitate future changes the derivative values are put into a struct, since quite some functions just pass these values through. The generated code seems to be very slightly better for 2d textures (with 4-wide vectors) than before with sse2 (if you have a cpu with physical 128bit simd units - otherwise it's probably not a win). v2: suggestions from José, rename variables, add comments, use swizzle helper commit 0aa21de0d31466dac77b05c97005722e902517b8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 10 18:10:31 2012 +0200 gallivm: add undefined swizzle handling to lp_build_swizzle_aos This is useful for vectors with "holes", it lets llvm choose the most efficient shuffle instructions if some elements aren't needed without having to worry what elements to manually pick otherwise. commit 00faf3f370e7ce92f5ef51002b0ea42ef856e181 Author: José Fonseca <jfonseca@vmware.com> Date: Fri May 4 17:25:16 2012 +0100 gallivm: Get the LLVM IR optimization passes before JIT compilation. MC-JIT engine compiles the module immediately on creation, so the optimization passes were being run too late. So now we create a target data layout from a string, that matches the ABI parameters reported by the compiler. The backend optimization passes were always been run, so the performance improvement is modest (3% on multiarb mesa demo). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> commit 40a43f4e2ce3074b5ce9027179d657ebba68800a Author: Roland Scheidegger <sroland@vmware.com> Date: Wed May 2 16:03:54 2012 +0200 gallivm: (trivial) fix wrong define used in lp_build_pack2 should fix stack-smashing crashes. commit e6371d0f4dffad4eb3b7a9d906c23f1c88a2ab9e Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Apr 30 21:25:29 2012 +0200 gallivm: add perf warnings when not using intrinsics with 256bit vectors Helper functions using integer sse2 intrinsics could split the vectors with AVX instead of using generic fallback (which should be faster). We don't actually expect to hit these paths (hence don't fix them up to actually do the vector splitting) so just emit warnings (for those functions where it's obvious doing split/intrinsic is faster than using generic path). Only emit warnings for 256bit vectors since we _really_ don't expect to hit arbitrary large vectors which would affect a lot more functions. The warnings do not actually depend on avx since the same logic applies to plain sse2 too (but of course again there's _really_ no reason we should hit these functions with 256bit vectors without avx). commit 8a9ea701ea7295181e846c6383bf66a5f5e47637 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue May 1 20:37:07 2012 +0200 gallivm: split vectors manually for avx in lp_build_pack2 (v2) There's 2 reasons for this: First, there's a llvm bug (fixed in 3.1) which generates tons of byte inserts/extracts otherwise, and second, more importantly, we want to use pack intrinsics instead of shuffles. We do this in lp_build_pack2 and not the calling code (aos sample path) because potentially other callers might find that useful too, even if for larger sequences of code using non-native vector sizes it might be better to manually split vectors. This should boost texture performance in the aos path considerably. v2: fix issues with intrinsics types with old llvm commit 27ac5b48fa1f2ea3efeb5248e2ce32264aba466e Author: Roland Scheidegger <sroland@vmware.com> Date: Tue May 1 20:26:22 2012 +0200 llvmpipe: refactor lp_build_pack2 (v2) prettify, and it's unnecessary to assert when there's no intrinsic due to unsupported bit width - the shuffle path will work regardless. In contrast lp_build_packs2, should only rely on lp_build_pack2 doing the clamping for element sizes for which there is a sse2 intrinsic. v2: fix bug spotted by Jose regarding the intrinsic type for packusdw on old llvm versions. commit ddf279031f0111de4b18eaf783bdc0a1e47813c8 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue May 1 20:13:59 2012 +0200 gallivm: add src width check in lp_build_packs2() not doing so would skip clamping even if no sse2 pack instruction is available, which is incorrect (in theory only, such widths would also always hit a (unnecessary) assertion in lp_build_pack2(). commit e7f0ad7fe079975eae7712a6e0c54be4fae0114b Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Apr 27 15:57:00 2012 +0200 gallivm: (trivial) fix crash-causing typo for npot textures with avx commit 28a9d7f6f655b6ec508c8a3aa6ffefc1e79793a0 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Apr 25 19:38:45 2012 +0200 gallivm: (trivial) remove code mistakenly added twice. commit d5926537316f8ff67ad0a52e7242f7c5478d919b Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Apr 24 21:16:15 2012 +0200 gallivm: add a new avx aos sample path (v2) Try to avoid mixing float and int address calculations. This does texture wrap modes with floats, and then the offset calculations still with ints (because of lack of precision with floats, though we could do some effort to make it work with not too large (16MB) textures). This also handles wrap repeat mode with npot-sized textures differently than either the old soa or aos int path (likely way faster but untested). Otherwise the actual address wrap code is largely similar to the soa path (not quite the same as this one also has some int code), it should get used by avx soa sampling later as well but doesn't handle more complex address modes yet (this will also have the benefit that we can use aos sampling path for all texture address modes). Generated code for that looks reasonable, but still does not split vectors explicitly for fetch/filter which means still get hit by llvm (fixed upstream) which generates hundreds of pinsrb/pextrb instead of two shuffles. It is not obvious though if it's much of a win over just doing address calcs 4-wide but with ints, even if it is definitely much less instructions on avx. piglit's texwrap seems to look exactly the same but doesn't test neither the non-normalized nor the npot cases. v2: fix comments, prettify based on Brian's and Jose's feedback. commit bffecd22dea66fb416ecff8cffd10dd4bdb73fce Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Apr 19 01:58:29 2012 +0200 gallivm: refactor aos lp_build_sample_image_nearest/linear split them up to separate address calculations and fetching/filtering. Need this for being able to do 8-wide float address calcs and 4-wide fetch/filter later (for avx). Plus the functions were very big scary monsters anyway (in particular lp_build_sample_image_linear). commit a80b325c57529adddcfa367f96f03557725c4773 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Apr 16 17:17:18 2012 +0200 gallivm: fix lp_build_resize when truncating width but expanding vector size Missed this case which I thought was impossible - the assertion for it was right after the division by zero... (AoS) texture sampling may ask us to do this, for things like 8 4x32int vectors to 1 32x8int vector conversion (eventually, we probably don't want this to happen). commit f9c8337caa3eb185830d18bce8b95676a065b1d7 Author: Roland Scheidegger <sroland@vmware.com> Date: Sat Apr 14 18:00:59 2012 +0200 gallivm: fix cube maps with larger vectors This makes the branchless cube face selection code work with larger vectors. Because the complexity is quite high (cannot really be improved it seems, per-face selection would reduce complexity a lot but this leads to errors unless the derivatives are calculated all from the same face which almost doubles the work to be done) it is still slower than the branching version, hence only enable this with large vectors. It doesn't actually do per-quad face selection yet (only makes sense with matching lod selection, in fact it will select the same face for all pixels based on the average of the first four pixels for now) but only different shuffles are required to make it work (the branching version actually should work with larger vectors too now thanks to the improved horizontal add but of course it cannot be extended to really select the face per-quad unless doing branching per quad). commit 7780c58869fc9a00af4f23209902db7e058e8a66 Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 30 21:11:12 2012 +0100 llvmpipe: (trivial) fix compiler warning and also clarify comment regarding availability of popcnt instruction. commit a266dccf477df6d29a611154e988e8895892277e Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 30 14:21:07 2012 +0100 gallivm: remove unneeded members in lp_build_sample_context Minor cleanup, the texture width, height, depth aren't accessed in their scalar form anywhere. Makes it more obvious those values should probably be fetched already vectorized (but this requires more invasive changes)... commit b678c57fb474e14f05e25658c829fc04d2792fff Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 29 15:53:55 2012 +0100 gallivm: add a helper for concatenating vectors Similar to the extract_range helper intended to get around slow code generated by llvm for 128bit insertelements. Concatenating two 128bit vectors this way will result in a single vinsertf128 operation rather than two 64bit stores plus one 128bit load, though it might be mildly useful for other purposes as well. commit 415ff228bcd0cf5e44a4c15350a661f0f5520029 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 19:41:15 2012 +0100 gallivm: add a custom 2x8f->1x16ub avx conversion path Similar to the existing 4x4f->1x16ub sse2 path, shaves off a couple instructions (min/max mostly) because it relies on pack intrinsics clamping. commit 78c08fc89f8fbcc6dba09779981b1e873e2a0299 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 18:44:07 2012 +0100 gallivm: add avx arithmetic intrinsics Add all avx intrinsics for arithmetic functions (with the exception of the horizontal add function which needs another look). Seems to pass basic tests. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit a586caa2800aa5ce54c173f7c0d4fc48153dbc4e Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 15:31:35 2012 +0100 gallivm: add avx logic intrinsics Add the blend intrinsics for 8-wide float and 4-wide double vectors. Since we lack 256bit int instructions these are used for int vectors as well, though obviously not for byte or word element values. The comparison intrinsics aren't extended for avx since these are only used for pre-2.7 llvm versions. commit 70275e4c13c89315fc2560a4c488c0e6935d5caf Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 00:40:53 2012 +0100 gallivm: new helper function for extract shuffles. Based on José's idea as we can need that in a couple places. Note that such shuffles should not be used lightly, since data layout of <4 x i8> is different to <16 x i8> for instance, hence might cause data rearrangement. commit 4d586dbae1b0c55915dda1759d2faea631c0a1c2 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 27 18:27:25 2012 +0100 gallivm: (trivial) don't overallocate shuffle variable using wrong define meant huge array... commit 06b0ec1f6d665d98c135f9573ddf4ba04b2121ad Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 27 17:54:20 2012 +0100 gallivm: don't do per-element extract/insert for vector element resize Instead of doing per-element extract/insert if the src vectors and dst vector differ in total size (which generates atrocious code) first change the src vectors size by using shuffles to destination vector size. We can still do better than that on AVX for packing to color buffer (by exploiting pack intrinsics characteristics hence eleminating the need for some clamps) but this already generates much better code. v2: incorporate feedback from José, Keith and use shuffle instead of bitcasts/extracts. Due to llvm deficiencies the latter cause all data to get moved to GPRs and back in pieces (even though the data in the regs actually stays the same...). commit c9970d70e05f95d3f52fe7d2cd794176a52693aa Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 23 19:33:19 2012 +0000 gallivm: fix bug in simple position interpolation Accidental use of position attribute instead of just pixel coordinates. Caused failures in piglit glsl-fs-ceil and glsl-fs-floor. commit d0b6fcdb008d04d7f73d3d725615321544da5a7e Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 23 15:31:14 2012 +0000 gallivm: fix emission of ceil opcode lp_build_ceil seems more appropriate than lp_build_trunc. This seems to be never hit though someone performs some ceil to floor magic. commit d97fafed7e62ffa6bf76560a92ea246a1a26d256 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 22 11:46:52 2012 +0000 gallivm: new vectorized path for cubemap calculations should be faster when adapted to multiple quads as only selection masks need to be different. The code is more or less a per-pixel version adapted to only do it per quad. A per pixel version would be much simpler (could drop 2 selects, 6 broadcasts and the messy horizontal add of 3 vectors at the expense of only 2 more absolute value instructions - would also just work for arbitary large vectors). This version doesn't yet work with larger vectors because the horizontal add isn't adjusted to be able to work with 2x4 vectors (and also because face selection wouldn't be done per quad just per block though that would be only a correctness issue just as with lod selection). The downside is this code is quite a bit slower. On a Core2 it can be sped up by disabling the hw blend instructions for selection and using logicop fallbacks instead, but it is still slower than the old code, hence leave that in for now. Probably will chose one or the other version based on vector length in the end. commit b375fbb18a3fd46859b7fdd42f3e9908ea4ff9a3 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 21 14:42:29 2012 +0000 gallivm: fix optimized occlusion query intrinsic name commit a9ba0a3b611e48efbb0e79eb09caa85033dbe9a2 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Mar 21 16:19:43 2012 +0000 draw,gallivm,llvmpipe: Call gallivm_verify_function everywhere. commit f94c2238d2bc7383e088b8845b7410439a602071 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 20 18:54:10 2012 +0000 gallivm: optimize calculations for cube maps a bit this does some more vectorized calculations and uses horizontal adds if possible. A definite win with sse3 otherwise it doesn't seem to make much of a difference. In any case this is arithmetically identical, cannot handle larger vectors. Should be useful as a reference point against larger vector version later... commit 21a2c1cf3c8e1ac648ff49e59fdc0e3be77e2ebb Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 20 15:16:27 2012 +0000 llvmpipe: slight optimization of occlusion queries using movmskps when available. While this is slightly better for cpus without popcnt we should really sum the vectors ourselves (it is also possible to cast to i4 before doing the popcnt but that doesn't help that much neither since llvm is using some optimized popcnt version for i32) commit 5ab5a35f216619bcdf55eed52b0db275c4a06c1b Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 20 13:32:11 2012 +0000 llvmpipe: fix occlusion queries with larger vectors need to adjust casts etc. commit ff95e6fdf5f16d4ef999ffcf05ea6e8c7160b0d5 Author: José Fonseca <jfonseca@vmware.com> Date: Mon Mar 19 20:15:25 2012 +0000 gallivm: Restore optimization passes. commit 57b05b4b36451e351659e98946dae27be0959832 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 19:34:22 2012 +0000 llvmpipe: use existing min2 macro commit bc9a20e19b4f600a439f45679451f2e87cd4b299 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 19:07:27 2012 +0000 llvmpipe: add some safeguards against really large vectors As per José's suggestion, prevent things from blowing up if some cpu would have 1024bit or larger vectors. commit 0e2b525e5ca1c5bbaa63158bde52ad1c1564a3a9 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 18:31:08 2012 +0000 llvmpipe: fix mask generation for uberwide vectors this was the only piece preventing 16-wide vectors from working (apart from the LP_MAX_VECTOR_WIDTH define that is), which is the maximum as we don't get more pixels in the fragment shader at once. Hence adjust that so things could be tested properly with that size even though there seems to be no practical value. commit 3c8334162211c97f3a11c7f64e9e5a2a91ad9656 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 18:19:41 2012 +0000 llvmpipe: fix the simple interpolation method with larger vectors so both methods actually _really_ work now. Makes textures look nice with larger vectors... commit 1cb0464ef8871be1778d43b0c56adf9c06843e2d Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 17:26:35 2012 +0000 llvmpipe: fix mask generation and position interpolation with 8-wide vectors trivial bugs, with these things start to look somewhat reasonable. Textures though have some swizzling issues it seems. commit 168277a63ef5b72542cf063c337f2d701053ff4b Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 16:04:03 2012 +0000 llvmpipe: don't overallocate variables we never have more than 16 (stamp size) / 4 (minimum possible vector size). (With larger vectors those variables are still overallocated a bit.) commit 409b54b30f81ed0aa9ed0b01affe15c72de9abd2 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 15:56:48 2012 +0000 llvmpipe: add some 32f8 formats to lp_test_conv Also add the ability to handle different sized vectors. commit 55dcd3af8366ebdac0af3cdb22c2588f24aa18ce Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 15:47:27 2012 +0000 gallivm: handle different sized vectors in conversion / pack only fully generic path for now (extract/insert per element). commit 9c040f78c54575fcd94a8808216cf415fe8868f6 Author: Roland Scheidegger <sroland@vmware.com> Date: Sun Mar 18 00:58:28 2012 +0100 llvmpipe: fix harmless use of unitialized values commit 551e9d5468b92fc7d5aa2265db9a52bb1e368a36 Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 16 23:31:21 2012 +0100 gallivm: drop special path in extract_broadcast with different sized vectors Not needed, llvm can handle shuffles with different sized result vector just fine. Should hopefully generate the same code in the end, but simpler IR. commit 44da531119ffa07a421eaa041f63607cec88f6f8 Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 16 23:28:49 2012 +0100 llvmpipe: adapt interpolation for handling multiple quads at once this is still WIP there are actually two methods possible not quite sure what makes the most sense, so there's code for both for now: 1) the iterative method as used before (compute attrib values at upper left corner of stamp and upper left corner of each quad initially). It is improved to handle more than one quad at once, and also do some more vectorized calculations initially for slightly better code - newer cpus have full throughput with 4 wide float vectors, hence don't try to code up a path which might be faster if there's just one channel active per attribute. 2) just do straight interpolation for each pixel. Method 2) is more work per quad, but less initially - if all quads are executed significantly more overall though. But this might change with larger vector lengths. This method would also be needed if we'd do some kind of active quad merging when operating on multiple quads at once. This path contains some hack to force llvm to generate better code, it is still far from ideal though, still generates far too many unnecessary register spills/reloads. Both methods should work with different sized vectors. Not very well tested yet, still seems to work with four-wide vectors, need changes elsewhere to be able to test with wider vectors. commit be5d3e82e2fe14ad0a46529ab79f65bf2276cd28 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Mar 16 20:59:37 2012 +0000 draw: Cleanup. commit f85bc12c7fbacb3de2a94e88c6cd2d5ee0ec0e8d Author: José Fonseca <jfonseca@vmware.com> Date: Fri Mar 16 20:43:30 2012 +0000 gallivm: More module compilation refactoring. commit d76f093198f2a06a93b2204857e6fea5fd0b3ece Author: José Fonseca <jfonseca@vmware.com> Date: Thu Mar 15 21:29:11 2012 +0000 llvmpipe: Use gallivm_compile/free_function() in linear code. Should had been done before. commit 122e1adb613ce083ad739b153ced1cde61dfc8c0 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 13 14:47:10 2012 +0100 llvmpipe: generate partial pixel mask for multiple quads still works with one quad, cannot be tested yet with more At least for now always fixed order with multiple quads. commit 4c4f15081d75ed585a01392cd2dcce0ad10e0ea8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 8 22:09:24 2012 +0100 llvmpipe: refactor state setup a bit Refactor to make it easier to emit (and potentially later fetch in fs) coefficients for multiple attributes at once. Need to think more about how to make this actually happen however, the problem is different attributes can have different interpolation modes, requiring different handling in both setup and fs (though linear and perspective handling is close). commit 9363e49722ff47094d688a4be6f015a03fba9c79 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 8 19:23:23 2012 +0100 llvmpipe: vectorize tri offset calc cuts number of instructions in quad-offset-factor from 107 to 75. This code actually duplicated the (scalar) code calculating the determinant except it used different vertex order (leading to different sign but it doesn't matter) hence llvm could not have figured out it's the same (of course with determinant vectorized in the other place that wouldn't have worked any longer neither). Note this particular piece doesn't actually vectorize well, not many arithmetic instructions left but tons of shuffle instructions... Probably would need to work on n tris at a time for better vectorization. commit 63169dcb9dd445c94605625bf86d85306e2b4297 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 8 03:11:37 2012 +0100 llvmpipe: vectorize some scalar code in setup reduces number of arithmetic instructions, and avoids loading vector x,y values twice (once as scalars once as vectors). Results in a reduction of instructions from 76 to 64 in fs setup for glxgears (16%) on a cpu with sse41. Since this code uses vec2 disguised as vec4, on old cpus which had physical 64bit sse units (pre-Core2) it probably is less of a win in practice (and if you have no vectors you can only hope llvm eliminates the arithmetic for unneeded elements). commit 732ecb877f951ab89bf503ac5e35ab8d838b58a1 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 7 00:32:24 2012 +0100 draw: fix clipping bug introduced by 4822fea3f0440b5205e957cd303838c3b128419c broke clipping pretty badly (verified with lineclip test) commit ef5d90b86d624c152d200c7c4056f47c3c6d2688 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 6 23:38:59 2012 +0100 draw: don't store vertex header per attribute storing the vertex header once per attribute is totally unnecessary. Some quick look at the generated assembly says llvm in fact cannot optimize away the additional stores (maybe due to potentially aliasing pointers somewhere). Plus, this makes the code cleaner and also allows using a vector "or" instead of scalar ones. commit 6b3a5a57b0b9850854cfbd7b586e4e50102dda71 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 6 19:11:01 2012 +0100 draw: do the per-vertex "boolean" clipmask "or" with vectors no point extracting the values and doing it per component. Doesn't help that much since we still extract the values elsewhere anyway. commit 36519caf1af40e4480251cc79a2d527350b7c61f Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 2 22:27:01 2012 +0100 gallivm: fix lp_build_extract_broadcast with different sized vectors Fix the obviously wrong argument, so it doesn't blow up. commit 76d0ac3ad85066d6058486638013afd02b069c58 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Mar 2 12:16:23 2012 +0000 draw: Compile per module and not per function (WIP). Enough to get gears w/ LLVM draw + softpipe to work on AVX doing: GALLIUM_DRIVER=softpipe SOFTPIPE_USE_LLVM=yes glxgears But still hackish -- will need to rethink and refactor this. commit 78e32b247d2a7a771be9a1a07eb000d1e54ea8bd Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 29 12:01:05 2012 +0000 llvmpipe: Remove lp_state_setup_fallback. Never used. commit 6895d5e40d19b4972c361e8b83fdb7eecda3c225 Author: José Fonseca <jfonseca@vmware.com> Date: Mon Feb 27 19:14:27 2012 +0000 llvmpipe: Don't emit EMMS on x86 We already take precautions to ensure that LLVM never emits MMX code. commit 4822fea3f0440b5205e957cd303838c3b128419c Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Feb 29 15:58:19 2012 +0100 draw: modifications for larger vector sizes We want to be able to use larger vectors especially for running the vertex shader. With this patch we build soa vectors which might have a different length than 4. Note that aos structures really remain the same, only when aos structures are converted to soa potentially different sized vectors are used. Samplers probably don't work yet, didn't look at them. Testing done: glxgears works with both 128bit and 256bit vectors. commit f4950fc1ea784680ab767d3dd0dce589f4e70603 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 29 15:51:57 2012 +0100 gallivm: override native vector width with LP_NATIVE_VECTOR_WIDTH env var for debug commit 6ad6dbf0c92f3bf68ae54e5f2aca035d19b76e53 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 29 15:51:24 2012 +0100 draw: allocate storage with alignment according to native vector width commit 7bf0e3e7c9bd2469ae7279cabf4c5229ae9880c1 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Feb 24 19:06:08 2012 +0000 gallivm: Fix comment grammar. Was missing several words. Spotted by Roland. commit b20f1b28eb890b2fa2de44a0399b9b6a0d453c52 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 19:22:09 2012 +0000 gallivm: Use MC-JIT on LLVM 3.1 + (i.e, SVN) MC-JIT Note: MC-JIT is still WIP. For this to work correctly it requires LLVM changes which are not yet upstream. commit b1af4dfcadfc241fd4023f4c3f823a1286d452c0 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Feb 23 20:03:15 2012 +0100 llvmpipe: use new lp_type_width() helper in lp_test_blend commit 04e0a37e888237d4db2298f31973af459ef9c95f Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Feb 23 19:50:34 2012 +0100 llvmpipe: clean up lp_test_blend a little Using variables just sized and aligned right makes it a bit more obvious what's going on. The test still only tests vector length 4. For AoS anything else probably isn't going to work. For SoA other lengths should work (at least with floats). commit e61c393d3ec392ddee0a3da170e985fda885a823 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 17:48:30 2012 +0000 gallivm: Ensure vector width consistency. Instead of assuming that everything is the max native size. commit 330081ac7bc41c5754a92825e51456d231bf84dd Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 17:44:14 2012 +0000 draw: More simd vector width consistency fixes. commit d90ca002753596269e37297e2e6c139b19f29f03 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 17:43:00 2012 +0000 gallivm: Remove unused lp_build_int32_vec4_type() helper. commit cae23417824d75869c202aaf897808d73a2c1db0 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Feb 23 17:32:16 2012 +0100 gallivm: use global variable for native vector width instead of define We do not know the simd extensions (and hence the simd width we should use) available at compile time. At least for now keep a define for maximum vector width, since a global variable obviously can't be used to adjust alignment of automatic stack variables. Leave the runtime-determined value at 128 for now in all cases. commit 51270ace6349acc2c294fc6f34c025c707be538a Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 15:41:02 2012 +0000 gallivm: Add a hunk inadvertedly lost when rebasing. commit bf256df9cfdd0236637a455cbaece949b1253e98 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 14:24:23 2012 +0000 llvmpipe: Use consistent vector width in depth/stencil test. commit 5543b0901677146662c44be2cfba655fd55da94b Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 14:19:59 2012 +0000 draw: Use a consistent the vector register width. Instead of 4x32 sometimes, LP_NATIVE_VECTOR_WIDTH other times. commit eada8bbd22a3a61f549f32fe2a7e408222e5c824 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 12:08:04 2012 +0000 gallivm: Remove garbagge collection. MC-JIT will require one compilation per module (as opposed to one compilation per function), therefore no state will be shared, eliminating the need to do garbagge collection. commit 556697ea0ed72e0641851e4fbbbb862c470fd7eb Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 10:33:41 2012 +0000 gallivm: Move all native target initialization to lp_set_target_options(). commit c518e8f3f2649d5dc265403511fab4bcbe2cc5c8 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:52:32 2012 +0000 llvmpipe: Create one gallivm instance for each test. commit 90f10af8920ec6be6f2b1e7365cfc477a0cb111d Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:48:08 2012 +0000 gallivm: Avoid LLVMAddGlobalMapping() in lp_bld_assert(). Brittle, complex, and unecesary. Just use function pointer constant. commit 98fde550b33401e3fe006af59db4db628bcbf476 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:21:26 2012 +0000 gallivm: Add a lp_build_const_func_pointer() helper. To be reused in all places where we want to call C code. commit 6cfedadb62c2ce5af8d75969bc95a607f3ece118 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:44:41 2012 +0000 gallivm: Cleanup/simplify lp_build_const_string_variable. - Move to lp_bld_const where it belongs - Rename to lp_build_const_string - take the length from the argument (and don't count the zero terminator twice) - bitcast the constant to generic i8 * commit db1d4018c0f1fa682a9da93c032977659adfb68c Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 11:52:17 2012 +0000 gallivm: Set NoFramePointerElimNonLeaf to true where supported. commit 088614164aa915baaa5044fede728aa898483183 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Feb 22 19:38:47 2012 +0100 llvmpipe: pass in/out pointers rather scalar floats in lp_bld_arit we don't want llvm to potentially optimize away the vectors (though it doesn't seem to currently), plus we want to be able to handle in/out vectors of arbitrary length. commit 3f5c4e04af8a7592fdffa54938a277c34ae76b51 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Feb 21 23:22:55 2012 +0100 gallivm: fix lp_build_sqrt() for vector length 1 since we optimize away vectors with length 1 need to emit intrinsic without vector type. commit 79d94e5f93ed8ba6757b97e2026722ea31d32c06 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 22 17:00:46 2012 +0000 llvmpipe: Remove lp_test_round. commit 81f41b5aeb3f4126e06453cfc78990086b85b78d Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Feb 21 23:56:24 2012 +0100 llvmpipe: subsume lp_test_round into lp_test_arit Much simpler, and since the arguments aren't passed as 128bit values can run on any arch. This also uses the float instead of the double versions of the c functions (which probably was the intention anyway). In contrast to lp_test_round the output is much less verbose however. Tested vector width of 32 to 512 bits - all pass except 32 (length 1) which crashes in lp_build_sqrt() due to wrong type. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit 945b338b421defbd274481d8c4f7e0910fd0e7eb Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 22 09:55:03 2012 +0000 gallivm: Centralize the function compilation logic. This simplifies a lot of code. Also doing this in a central place will make it easier to carry out the changes necessary to use MC-JIT in the future. gallivm: Fix typo in explicit derivative shuffle. Trivial. draw: make DEBUG_STORE work again adapt to lp_build_printf() interface changes Reviewed-by: José Fonseca <jfonseca@vmware.com> draw: get rid of vecnf_from_scalar() just use lp_build_broadcast directly (cannot assign a name but don't really need it, vecnf_from_scalar() was producing much uglier IR due to using repeated insertelement instead of insertelement+shuffle). Reviewed-by: José Fonseca <jfonseca@vmware.com> llvmpipe: fix typo in complex interpolation code Fixes position interpolation when using complex mode (piglit fp-fragment-position and similar) Reviewed-by: José Fonseca <jfonseca@vmware.com> draw: fix clipvertex/position storing again This appears to be the result of a bad merge. Fixes piglit tests relying on clipping, like a lot of the interpolation tests. Reviewed-by: José Fonseca <jfonseca@vmware.com> gallivm: Fix explicit derivative manipulation. Same counter variable was being used in two nested loops. Use more meanigful variable names for the counter to fix and avoid this. gallivm: Prevent buffer overflow in repeat wrap mode for NPOT. Based on Roland's patch, discussion, and review . Reviewed-by: Roland Scheidegger <sroland@vmware.com> gallivm: Fix dims for TGSI_TEXTURE_1D in emit_tex. Reviewed-by: Roland Scheidegger <sroland@vmware.com> gallivm: Fix explicit volume texture derivatives. Reviewed-by: Roland Scheidegger <sroland@vmware.com> gallivm: fix 1d shadow texture sampling Always r coordinate is used, hence need 3 coords not two (the second one is unused). Reviewed-by: José Fonseca <jfonseca@vmware.com> gallivm: Enable AVX support without MCJIT, where available. For now, this just enables AVX on Windows for testing. If the code is stable then we might consider prefering the old JIT wherever possible. No change elsewhere. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-17 13:42:39 +01:00
José Fonseca	ba9c1773d7	gallivm: Allow to force nearest filtering on a per-axis basis. Experimental code, not really used yet.	2012-07-17 13:42:39 +01:00
Kristian Høgsberg	b262f56738	wayland: Include wl_drm format enum in wayland-drm.h This gets referenced before we get to generate the header files, so just include the enum that we need and don't include the generated header.	2012-07-17 08:30:39 -04:00
James Benton	e253175c9c	llvmpipe: Fix bug with blend factor in complementary optimisations. Fixes fdo 52168. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-17 13:16:38 +01:00
Christian König	89e755d762	radeonsi: fix vertex element state The vertex element state isn't in registers any more, so remove that old code. That fixes a memory corruption with the blend state and gets eglgears partially working. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-17 10:44:12 +02:00
Christian König	4247fd9928	radeon/llvm: fix compiling when llvm is active, but opencl isn't Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-17 10:43:53 +02:00
Brian Paul	aa0becdbb6	mesa: include inttypes.h to get uint8_t type To fix MSVC build.	2012-07-16 16:12:02 -06:00
Brian Paul	fe2a7b7e7f	st/egl: fix uninitialized pointer bug If no format is matched in the loop the value of xconf was undefined. NOTE: This is a candidate for the 8.0 branch.	2012-07-16 16:03:31 -06:00
Brian Paul	2f92a9f721	r300g: silence uninitialized var warning	2012-07-16 16:03:31 -06:00
Elvis Lee	cf775c9cbf	egl_dri2: NULL check for EGLNativeWindowType Some application calls eglCreateWindowSurface with EGLNativeWindowType parameter having zero value. It causes SEGV and disturbs error handling like EGL_NO_SURFACE. Signed-off-by: Elvis Lee <kwangwoong.lee@lge.com> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-07-16 16:03:31 -06:00
Jon TURNEY	d80fd04639	Fix building mesa with assembly enabled since `a112ca5d` `a112ca5d` rather crassly smashed all the compiler flags together into AM_CFLAGS. Separate them out the way they were before, putting pre-processor flags into AM_CPPFLAGS, so assembly source gets preprocessed with the correct pre-processor flags as well. Also, remove unneeded CFLAGS from AM_CFLAGS, and CXXFLAGS from AM_CXXFLAGS Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Tested-by: Brian Paul <brianp@vmware.com>	2012-07-16 22:54:36 +01:00
Chad Versace	8dc074cd92	intel: Fix build broken by ETC1 patch I suck at resolving merge conflicts and broke the build in `a5a34b1`. This patch adds the missing field intel_mipmap_tree::wraps_etc1. Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:29:24 -07:00
Chad Versace	a5a34b153d	intel: Enable GL_OES_compressed_ETC1_RGB8_texture Enable it for all hardware. No current hardware supports ETC1, so this patch implements it by translating the ETC1 data to RGBX data during the call to glCompressedTexImage2D(). For details, see the doxygen for intel_mipmap_tree::wraps_etc1. Passes the Piglit test spec/OES_compressed_ETC1_RGB8_texture/miptree and the ETC1 test in the GLES2 conformance suite. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:11:12 -07:00
Chad Versace	8ec721264c	mesa: Add function for decoding ETC1 textures Add function _mesa_etc1_unpack_rgba8888. It is intended to be used by glCompressedTexSubImage2D to decode ETC1 textures into RGBA. CC: Chia-I <olv@lunarg.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:07:57 -07:00
Chad Versace	d7458e401e	gallium/util, mesa: Refactor etc1 unpack function Move the body of util_etc1_rgb8_unpack_rgba_unorm8 into a new function that can be shared between gallium and dri drivers, texcompress_etc_tmp.h:etc1_unpack_rgba8888. CC: Chia-I <olv@lunarg.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:07:57 -07:00
Kristian Høgsberg	7250cd506b	gbm: Rename gbm_bo_get_pitch to gbm_bo_get_stride We use pitch for 'pixels per row' and stride for 'bytes per row' pretty consistently in mesa and most other places, so rename the gbm API.	2012-07-16 16:29:16 -04:00
Kristian Høgsberg	44f066b9ff	gbm: Add new gbm_bo_import entry point This generalizes and replaces gbm_bo_create_for_egl_image. gbm_bo_import will create a gbm_bo from either an EGLImage or a struct wl_buffer.	2012-07-16 16:29:15 -04:00
Roland Scheidegger	43ccded1e1	llvmpipe: destroy setup variants on context destruction lp_delete_setup_variants() used to be called in garbage collection, but this no longer exists hence the setup shaders never got freed. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-16 19:00:54 +01:00
James Benton	8684ffc141	llvmpipe: Unified common code between AoS and SoA blending. Added a new file lp_bld_blend.c for the common code. Merged and added some simple optimisations. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-07-16 19:00:54 +01:00
Kristian Høgsberg	636646a481	intel: Don't call _mesa_get_format_bytes for MESA_FORMAT_NONE When we don't intend to texture from or render to a __DRIimage we use __DRI_IMAGE_FORMAT_NONE. In that case, we just create the __DRIimage to reference the underlying buffer, and will create usable __DRIimages from it using createSubImage later. If we try to use _mesa_get_format_bytes() on MESA_FORMAT_NONE in a debug build, we hit an assertion, so let's not do that.	2012-07-16 11:00:16 -04:00
Jon TURNEY	81de0431d6	Fix building glsl when using automake-1.12 after `68e04cc6` Commit `68e04cc6` was tested using automake-1.11. Unfortunately, automake-1.12 made a "slightly backward-incompatible change" in the use of yacc with C++, and for a .yy file, the generated header file is now named .hh, not .h To work with both, write our own rule for running yacc, which generates a header file named .h, rather than using automake's rule. Also, remove things from BUILD_SOURCES which don't need to be there Also, update EXCLUDE rules in doxygen/glsl.doxy, for change of generated files from .cpp -> .cc, and glsl_lexer.h has never existed. Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk>	2012-07-15 15:27:26 +01:00
Marek Olšák	bc6bff7947	r600g: compute needed CS space for vertex buffers correctly	2012-07-15 15:26:14 +02:00
Marek Olšák	15ca9d159e	r600g: don't check the R600_GLSL130 env var GLSL 1.3 has been enabled by default for quite a while.	2012-07-15 02:16:46 +02:00
Jerome Glisse	e634651024	r600g: fix DB decompression on evergreen Separated out of the hyperz patch by Marek with minor modifications. Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-15 02:06:44 +02:00
Tom Stellard	c2f444c54d	r600g: Emit vertex buffers using the same method as constant buffers Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-15 02:00:27 +02:00
Tom Stellard	9b76ee70b2	r600g: Unify 3D and compute vertex buffer emission Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-15 02:00:21 +02:00
Marek Olšák	0b4c5dbb8c	r600g: fix grammar constant_buffer -> constant_buffers	2012-07-15 01:41:11 +02:00
Andreas Boll	e3ff4d4c10	radeon/llvm: Fix CR/LF in AMDILSIDevice.h	2012-07-13 16:35:22 +00:00
Tom Stellard	cc3907856e	radeon/llvm: Clean up AMDILIntrinsicInfo.cpp	2012-07-13 16:29:46 +00:00
Tom Stellard	f323c6260d	radeon/llvm: Coding style fixes	2012-07-13 16:29:46 +00:00
Jon TURNEY	39d82a1b20	Fix linking gallium drivers and with dricore after `defadf2b1` Commit `defadf2b1` erroneously tries to make gallium drivers link with libdricore as a static library, not a shared library Also, change uses of DRI_LIB_DEPS in gallium driver Makefiles to GALLIUM_DRI_LIB_DEPS, so the libraries added are used in the linking the gallium driver Also, fix the path to the libdricore.so symlink, it's made in LIB_DIR, not in the libdricore directory Also repair quoting of dricore settings of DRI_LIB_DEPS and GALLIUM_DRI_LIB_DEPS variables so VERSION is interpolated in configure but TOP and LIB_DIR are interpolated later (where they are known, but VERSION isn't) Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-13 17:20:39 +01:00
Christoph Bumiller	9ed65301e0	nouveau: implement missing timer query functionality	2012-07-13 17:28:00 +02:00
Kristian Høgsberg	426a23af14	wayland: Stop trying to use make rules from aclocal, just copy and paste Defeated by autotool, copy and paste to the rescue. https://bugs.freedesktop.org/show_bug.cgi?id=51997 https://bugs.freedesktop.org/show_bug.cgi?id=51531 Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-13 11:20:17 -04:00
José Fonseca	b3ba0a7afa	mesa/st: Generates TGSI that always recognizes INSTANCEID/VERTEXID as integers. Tested by running piglit draw-instanced, and by forcing llvmpipe advertise no native integer support, which now produces: VERT DCL IN[0] DCL SV[0], INSTANCEID DCL OUT[0], POSITION DCL OUT[1], COLOR DCL CONST[0..19] DCL TEMP[0], LOCAL DCL TEMP[1], LOCAL DCL TEMP[2], LOCAL DCL ADDR[0] 0: U2F TEMP[0].x, SV[0] 1: ARL ADDR[0].x, TEMP[0].xxxx 2: MOV TEMP[1].xy, CONST[ADDR[0].x+8].xyxx 3: ADD TEMP[2].x, IN[0].xxxx, TEMP[1].xxxx 4: ADD TEMP[1].x, IN[0].yyyy, TEMP[1].yyyy 5: MUL TEMP[2], CONST[16], TEMP[2].xxxx 6: MAD TEMP[2], CONST[17], TEMP[1].xxxx, TEMP[2] 7: MAD TEMP[2], CONST[18], IN[0].zzzz, TEMP[2] 8: MAD TEMP[2], CONST[19], IN[0].wwww, TEMP[2] 9: ARL ADDR[0].x, TEMP[0].xxxx 10: MOV TEMP[1], CONST[ADDR[0].x] 11: MOV OUT[0], TEMP[2] 12: MOV OUT[1], TEMP[1] 13: END	2012-07-13 13:01:52 +01:00
José Fonseca	6dddd18480	draw,gallivm: Fix draw_get_shader_param. - Use LLVM limits when LLVM is being used, instead of TGSI limits - Provide draw_get_shader_param_no_llvm for when llvm is never used (softpipe) - Eliminate several of the hacks around draw shader caps in several drivers Unfortunately the hack for PIPE_MAX_VERTEX_SAMPLERS is still necessary. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-13 13:01:51 +01:00
Jon TURNEY	99728076ec	Don't explicitly link libOsmesa with libmesa's dependency libglsl The libmesa convenience library is linked with the libglsl convenience library. libOsmesa is linked with libmesa, and also directly with libglsl. When using libtool, this gives rise to duplicate symbol errors. Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:44 +01:00
Jon TURNEY	b2a37e242e	automake: convert libglapi * "configure substitutions are not allowed in _SOURCES variables" in automake, so remove the AC_SUBST'ed GLAPI_ASM_SOURCES and instead use some AM_CONDITIONALS to choose which asm sources are used * Change GLAPI_LIB to point to the .la file in other Makefile.am files, and make a link to the .a file for the convenience of other Makefiles which have not yet been converted to automake v2: - Use AM_CPPFLAGS for cleaner build output - EXTRA_SOURCES is not needed - Remove libglapi.a compatibility link on clean Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:07 +01:00
Jon TURNEY	1e48dfeee6	Rename X86-64_API -> X86_64_API automake doesn't allow hyphens in variable names Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:05 +01:00
Jon TURNEY	defadf2b15	Link dri drivers with mesa or dricore libtool library Now mesa/drivers/dri is converted to automake, we want to update DRI_LIB_DEPS so that we link with the libmesa or libdricore libtool library, as appropriate. However, this is complicated by the fact that gallium/targets is not (yet) converted, so we can't share the DRI_LIB_DEPS autoconf variable with that anymore. Add an additional autoconf variable GALLIUM_DRI_LIB_DEPS, which is now used in gallium/targets/Makefile.dri, to link with the libdircore or libmesa native library. v2: libdricore$VERSION.a needs to be libdricore$(VERSION).a Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:03 +01:00
Jon TURNEY	cf362d00b9	Remove unused MESA_MODULES autoconf variable Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:01 +01:00
Jon TURNEY	a112ca5d5f	automake: convert libmesa and libmesagallium * "configure substitutions are not allowed in _SOURCES variables" in automake, so instead of MESA_ASM_FILES, use some AM_CONDITIONALS to choose which architecture's asm sources are used in libmesa_la_SOURCES. (Can't remove MESA_ASM_FILES autoconf variable as it's still used in sources.mak) * Update to link with the .la file in other Makefile.am files, and make a link to the .a file for the convenience of other Makefiles which have not yet been converted to automake v2: Remove stray -static from LDFLAGS v3: Remove .a compatibility link on clean Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:58 +01:00
Jon TURNEY	8676890018	Rename sparc/clip.S -> sparc/sparc_clip.S Automake can't handle having both clip.S and clip.c, even though they have different paths "src/mesa/Makefile.am: object `clip.lo' created by `$(SRCDIR)/sparc/clip.S' and `$(SRCDIR)/main/clip.c'" Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:56 +01:00
Jon TURNEY	68e04cc601	automake: convert libglsl v2: Use AM_V_GEN to silence generated code rules. Add BUILT_SOURCES to CLEANFILES v3: - Fix an accidental // in a path - Use automake make rules for lex/yacc rather than writing our own - Update .gitignore appropriately - Build a libglcpp convenience library rather than awkwardly including the files in libglsl and delegating the generation - Remove libglsl.a compatibility link on clean v4: - Automake's rules for lex/yacc make .cc if source is .ll or .yy, and apparently we must use those extensions "because of scons", so update everywhere glsl_parser.cpp -> glsl_parser.cc and glsl_lexer.cpp -> glsl_lexer.cc. This fixes 'make tarballs' and building with dricore enabled. Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:41 +01:00
Laurent Carlier	284325d97b	automake: convert libOSmesa This also currently fix the installation of libOSmesa. v2: Remove old Makefile, libOSmesa is now versioned, fix typos v3: Keep config substitution alphabetized v4: Update .gitignore v5: Libraries will be in the builddir, not the srcdir. Reviewed-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:39 +01:00
Marek Olšák	1a06e8454e	mesa,st/mesa: implement GL_RGB565 from ARB_ES2_compatibility This was not implemented, because the spec was changed just recently. Everything has been in place already. Gallium has PIPE_FORMAT_B5G6R5_UNORM, while Mesa has MESA_FORMAT_RGB565. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-13 01:36:07 +02:00
Kenneth Graunke	fe911c1d43	i965: Move loop over texture units into brw_populate_sampler_prog_key. The whole reason I avoided this was because it might operate on a brw_vertex_program or a brw_fragment_program. However, that isn't a problem: all we need is the gl_program base type. This avoids awkwardly passing the loop counter 'i' as a parameter, simplifies both callers, and also plumbs prog in place for future use. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-12 14:17:44 -07:00
Kenneth Graunke	86e401b771	i965: Always emit alpha when nr_color_buffers == 0. If alpha-testing is enabled, we need to send alpha down the pipeline even if nr_color_buffers == 0. However, tracking whether alpha-testing is enabled in the WM program key is expensive: it causes us to compile multiple specializations of the same shader, using program cache space. This patch removes the check for alpha-testing, and simply emits alpha whenever nr_color_buffers == 0. We believe this will also be necessary for alpha-to-coverage, and it should add minimal overhead to an uncommon case. Saving the recompiles should more than make up the difference. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-12 13:35:46 -07:00
Kenneth Graunke	16060531ba	i965: Use the blitter in intel_bufferobj_subdata for busy BOs on Gen6+. Previously we only did this pre-Gen6, and used pwrite on Gen6+. In one workload, this cuts significant amount of overhead. v2: Simplify the function based on Eric's suggestions. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-12 13:35:46 -07:00
José Fonseca	978807ef01	gallivm: Use %.9g to print floats. So that we can see them in their full denormalized glory. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-12 21:14:35 +01:00
José Fonseca	5b8d80a783	scons: Remove -ffast-math. We rely on proper IEEE 754 behavior in too many places for this. See also commit `2fdbbeca43` with equivalent change for autoconf. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-12 21:14:29 +01:00
José Fonseca	bd3aab8d79	scons: Also require recent XCB. And don't trip when it's not found -- simply skip building src/glx.	2012-07-12 21:13:10 +01:00
Eric Anholt	6882381a2e	mesa: Require current libxcb. Without that, people with buggy apps that looked at just the server string for GLX_ARB_create_context would call this function that just threw an error when you tried to make a context. Google shows plenty of complaints about this. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 12:29:12 -07:00
Tom Stellard	f92873be2c	radeon/llvm: Don't use lp_build_swizzle_aos() for swizzles This function assumes that lp_build_context::type is a vector type, which is not true for r600 or radeonsi. This fixes an assertion failure using glamor 2D accel.	2012-07-12 13:53:22 -04:00
Tom Stellard	185fc9a5ef	radeonsi: Dump TGSI code prior to doing TGSI->LLVM conversion. This way if the conversion fails, we know what the TGSI shader looks like.	2012-07-12 13:53:22 -04:00
Kenneth Graunke	b546aebae9	i965: Delete previous workaround for textureGrad with shadow samplers. It had many problems: - The shadow comparison was done post-filtering. - It required state-dependent recompiles whenever the comparison function changed. - It didn't even work: many cases hit assertion failures. - I never implemented it for the VS. The new lowering pass which converts textureGrad to textureLod by computing the LOD value works much better. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:26 -07:00
Kenneth Graunke	b0c8d3be73	i965: Add a lowering pass to convert TXD to TXL by computing the LOD. Intel hardware doesn't natively support textureGrad with shadow comparisons. So we need to generate code to handle it somehow. Based on the equations of page 205 of the OpenGL 3.0 specification, it's possible to compute the LOD value that would be selected given the gradient values. Then, we can simply convert the TXD to a TXL. Currently, this passes 34/46 of oglconform's shadow-grad subtests; four cubemap tests are regressed. We should investigate this in the future. v2: Apply abs() to the scalar case (thanks to Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:23 -07:00
Kenneth Graunke	d9da350a83	glsl/ir_builder: Add a new swizzle_for_size() function. This swizzles away unwanted components, while preserving the order of the ones that remain. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:20 -07:00
Kenneth Graunke	0bb3d4ba54	glsl/ir_builder: Add a generic constructor for unary expressions. I needed to compute logs and square roots in a patch I was working on, and wanted to use the convenient interface. We already have a similar constructor for binops; adding one for unops seems reasonable. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:18 -07:00
Kenneth Graunke	b656df990f	glsl: Initialize coordinate to NULL in ir_texture constructor. I ran into this while trying to create a TXS query, which doesn't have a coordinate. Since it didn't get initialized to NULL, a bunch of visitors tried to access it and crashed. Most of the time, this won't be a problem, but it's just a good idea. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:19:38 -07:00
José Fonseca	d9a8cd76e5	st/xorg: Fix build failure due to symbol clash.	2012-07-12 16:02:49 +01:00
Marek Olšák	0f3659bb56	docs: update relnotes-8.1 and GL3 status	2012-07-12 13:05:59 +02:00
Marek Olšák	63d8c8baa9	st/mesa: expose new transform feedback extensions	2012-07-12 13:05:59 +02:00
Marek Olšák	d24ece97e5	mesa: add ARB_transform_feedback_instanced extension enable flag Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	db7404defd	mesa: implement new DrawTransformFeedback functions Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	7e0cb473b0	mesa: implement display list support for new DrawTransformFeedback functions Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	ce16ca4635	mesa: implement display list support for indexed query functions Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	553e13dbc2	mesa: implement indexed query functions from ARB_transform_feedback3 Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	375e73d859	mesa: implement glGet queries and error handling for ARB_transform_feedback3 Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	21cb5ed20d	glsl: implement ARB_transform_feedback3 in the linker Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	9576d555e0	glapi: add ARB_transform_feedback_instanced Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	6d13d91f4e	glapi: add ARB_transform_feedback3 Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	e773a48a3b	r600g: fix uploading non-zero mipmap levels of depth textures This fixes piglit/depth-level-clamp. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	fe1fd67556	r600g: don't flush depth textures set as colorbuffers The only case a depth buffer can be set as a color buffer is when flushing. That wasn't always the case, but now this code isn't required anymore. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	6842d5fced	r600g: don't set dirty_db_mask for a flushed depth texture A flush depth texture is never set as a depth buffer and never flushed. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	5a17d8318e	r600g: flush depth textures bound to vertex shaders This was missing/broken. There are also minor code cleanups. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	dee58f94af	r600g: do fine-grained depth texture flushing - maintain a mask of which mipmap levels are dirty (instead of one big flag) - only flush what was requested at a given point and not the whole resource (most often only one level and one layer has to be flushed) Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	df79eb5956	r600g: remove is_flush from DSA state we can just update the state when decompressing, there's no need to add additional info into the DSA state Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	43e3f19c76	r600g: set DISABLE in CB_COLOR_CONTROL if colormask is 0 this will be useful for in-place DB decompression, otherwise should be harmless Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	4fe74412cf	r600g: move CB_SHADER_MASK setup into cb_misc_state Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	a1a1ff5ec0	r600g: move MULTIWRITE setup into cb_misc_state for r6xx-r7xx Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	0ea76916e6	r600g: move CB_TARGET_MASK setup into new cb_misc_state to remove some overhead from draw_vbo. This is a derived state. BTW, I've got no idea how compute interacts with 3D here, but it should use cb_misc_state, so that 3D and compute don't conflict. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	5ba15d8d38	st/mesa: implement accelerated stencil blitting using shader stencil export Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	a7f3697eb8	st/mesa: set colormask to zero when blitting depth Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	5a74e17ab0	gallium/u_blit: remove useless memset calls the structure is calloc'd. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	24e0a26335	gallium/u_blit: drop not-very-useful wrapper around util_blit_pixels_writemask just rename it to util_blit_pixels Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	3f13b5da15	gallium/u_blit: don't do two copies for non-2D textures Because u_blit couldn't sample a 1D, 3D, CUBE and ARRAY texture, we created a 2D texture holding a copy of one slice of the source texture (even for 1D). Let's just do it right. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	2dca61bcb3	gallium/util: move pipe_tex_to_tgsi_tex helper function into u_inlines Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	bdaf0a085b	gallium/u_blitter: accelerate stencil-only copying This doesn't seem to be used by anything yet, but better safe than sorry. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	12fd81f9e7	gallium/u_blitter: accelerate depth-stencil copying using shader stencil export This fixes stencil buffer write transfers on r600g. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	76db2c121c	gallium: add util_format_stencil_only helper function used for stencil sampler views. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	a730838a42	gallium/u_blitter: minify depth0 when initializing last_layer Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	91cf9fe988	gallium/u_gen_mipmap: accelerate depth texture mipmap generation Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	13b0af721a	mesa: remove assertions that do not allow compressed 2D_ARRAY textures NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Paul Berry	33202b4876	i965/msaa: Enable CMS layout on Gen7 for the formats that support it. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:50 -07:00
Paul Berry	4ebbc76621	i965/msaa: Add CMS support to blorp. This patch updates the blorp engine to properly handle the case where the surface being textured from uses Gen7's CMS MSAA layout. The following changes were necessary: - Before reading color values from the surface, we need to read from the MCS buffer using the ld_mcs sampler message. This is done by the mcs_fetch() function, and the result is stored in the mcs_data register. This only needs to be done once per pixel, since the MCS value is shared between all samples belonging to a pixel. - When reading color values from the surface, we need to use the ld2dms sampler message instead of the ld2dss message, and we need to provide the value read from the MCS buffer as an argument. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	754953693d	i965/msaa: Add CMS-related sampler messages to brw_defines.h. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	7b3263af69	i965/msaa: Set SURFACE_STATE properly when CMS MSAA is in use. When a buffer using Gen7's CMS MSAA layout is bound to a texture or a render target, the SURFACE_STATE structure needs to point to the MCS buffer and to indicate its pitch. This patch updates the functions that emit SURFACE_STATE to handle CMS layout properly. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	0ba813506d	i965/msaa: Add CMS MSAA settings to brw_structs.h. Previously the DWORD used to control the CMS MSAA layout was just a pad value, because we didn't use it. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	ccae1b1cd7	i965/msaa: Allocate MCS buffer when CMS MSAA is in use. To implement Gen7's CMS MSAA layout, we need an extra buffer, the MCS (Multisample Control Surface) buffer. This patch introduces code for allocating and deallocating the buffer, and storing a pointer to it in the intel_mipmap_tree struct. No functional change, since the CMS layout is not enabled yet. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	1bd4d456cd	i965/msaa: Add an enum to describe MSAA layout. From the Ivy Bridge PRM, Vol 1 Part 1, p112: There are three types of multisampled surface layouts designated as follows: - IMS Interleaved Multisampled Surface - CMS Compressed Mulitsampled Surface - UMS Uncompressed Multisampled Surface Previously, the i965 driver only used IMS and UMS formats, and distinguished beetween them using the boolean intel_mipmap_tree::msaa_is_interleaved. To facilitate adding support for the CMS format, this patch replaces that boolean (and other booleans derived from it) with an enum INTEL_MSAA_LAYOUT_{IMS,CMS,UMS}. It also updates the terminology used in comments throughout the driver to match the IMS/CMS/UMS terminology used in the PRM. CMS layout is not yet used. The enum has a fourth possible value, INTEL_MSAA_LAYOUT_NONE, which is used for non-multisampled surfaces. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	67b0f7c7dd	i965/msaa: Move {rt,tex}_interleaved into blorp program key. On Gen6, MSAA buffers always use an interleaved layout and non-MSAA buffers always use a non-interleaved layout, so it is not strictly necessary to keep track of the layout of the texture and render target surfaces in the blorp program key. However, it is cleaner to do so, since (a) it makes the blorp compiler less dependent on implicit knowledge about how the GPU pipeline is configured, and (b) it paves the way for implementing compressed multisampled surfaces in Gen7. This patch won't cause any redundant compiles, because the layout of the texture and render target surfaces depends on other parameters that are already in the blorp program key. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Kristian Høgsberg	2adfce4a18	mapi: Move GL_NV_draw_buffers extension to es_EXT.xml We don't generate public entrypoints for GLES extensions, so move the GL_NV_draw_buffers definition from ARB_draw_buffers.xml to es_EXT.xml. When the extension is defined in ARB_draw_buffers.xml, we end up with a public entry point for it, but no prototype, which gives an error when compiled with --disable-asm and --disable-shared-glapi. Instead, just move the GLES extension to es_EXT.xml so this doesn't happen. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:36 -04:00
Kristian Høgsberg	e6a33570b7	egl: Add EGL_WAYLAND_PLANE_WL attribute This lets us specify the plane to create the image for for multiplanar wl_buffers. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:36 -04:00
Kristian Høgsberg	1aaec8c609	wayland-drm: Add protocol to create planar buffers	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	379eb47ea6	wayland-drm: Pass struct wl_drm_buffer to the driver We're going to extend this to support multi-plane buffers, so pass this to the driver so it can access the details.	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	95bc0527e9	intel: Implement __DRIimage::createSubImage and bump supported version to 5 We use the new miptree offset to pick out the sub-image when we bind the EGLImage to a texture. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	02ebad900d	intel: Add offset field to miptree This lets us specify an offset into the bo where the miptree starts, which will let us set up a texture for a single plane in a planar buffer. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	44a2b57f93	intel: Add support for new __DRIimage formats	2012-07-11 15:28:34 -04:00
Kristian Høgsberg	c029834808	__DRIimage: version 5, add new formats and createSubImage The additions in version 5 enables creating EGLImages for different planes of a YUV buffer. createImageFromName is still used to create the containing __DRIimage, and createSubImage can then be used no that __DRIimage to create __DRIimages that correspond to the y, u, and v planes (__DRI_IMAGE_FORMAT_R8) or the uv planes (__DRI_IMAGE_FORMAT_RG88) for formats such as NV12 where the u and v components are interleaved. Packed formats such as YUYV etc doesn't require any special treatment, we just sample those as a regular ARGB texture. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:34 -04:00
Tom Stellard	c0f7fe7b79	r600g/compute: Disable growing the memory pool The code for growing the memory pool (which is used for storing all of the global buffers) wasn't working. There seem to be two separate issues with the memory pool code. The first was the way it was growing the pool. When the memory pool needed more space, it would: 1. Copy the data from the memory pool's backing texture to system memory. 2. Delete the memory pool's texture 3. Create a bigger backing texture for the memory pool. 4. Copy the data from system memory into the bigger texture. The copy operations didn't seem to be working, and I suspect that since they were using fragment shaders to do the copy, that there might have been a problem with the mixing of compute and 3D state. The other issue is that the size of 1D textures is limited, and I was having trouble getting 2D textures to work. I think these problems will be easier to solve once more code is shared between 3D and compute, which is why I decided to disable it for now rather than continue searching for a fix.	2012-07-11 17:53:54 +00:00
Tom Stellard	49ae102ee3	radeon/llvm: Use multiclasses for floating point loads The original strategy for handling floating point loads, which was to lower (f32 load) to (f32 bitcast (i32 load)) wasn't really working. The main problem was that the DAG legalizer couldn't handle replacing a node with two results (load) with a node with only one result (bitcast).	2012-07-11 17:47:20 +00:00
Tom Stellard	bbdf3af857	radeon/llvm: Don't set the IMM bit in SMRD instruction definitions. The IMM bit is already being set in SICodeEmitter.	2012-07-11 17:47:20 +00:00
Tom Stellard	d36499aa62	r600g/compute: Add more debugging output	2012-07-11 17:46:59 +00:00
Eric Anholt	f9b3e257d1	i965: Revert the VBOs-in-system-memory hack. It didn't change performance on Lightsmark or Nexuiz, which both used DYNAMIC_DRAW buffers, but it was killing performance (40% CPU wasted pwriting buffers) on a closed-source app we're looking at. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-11 09:20:21 -07:00
Eric Anholt	b5c037f6b1	Add emacs setup for the docs/devinfo.html comment wrapping recommendation. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-11 09:20:21 -07:00
Ian Romanick	a8724d85f8	glx/dri2: Add support for GLX_ARB_create_context_robustness Add the infrastructure required for this extension. There is no xserver support and no driver support yet. Drivers can enable this be advertising DRI2 version 4 and accepting the __DRI_CTX_FLAG_ROBUST_BUFFER_ACCESS flag and the __DRI_CTX_ATTRIB_RESET_STRATEGY attribute in create context. Some additional Mesa infrastructure is needed before drivers can do this. The GL_ARB_robustness spec, which all Mesa drivers already advertise, requires: "If the behavior is LOSE_CONTEXT_ON_RESET_ARB, a graphics reset will result in the loss of all context state, requiring the recreation of all associated objects." It is necessary to land this infrastructure now so that the related infrastructure can land in the xserver. The xserver has very long release schedules, and the remaining Mesa parts should land long, long before the next xserver merge window opens. v2: Expose robustness as a DRI2 extension rather than bumping __DRI_DRI2_VERSION. v3: Add a comment explaining why dri2->base.version >= 3 is also required for GLX_ARB_create_context_robustness. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-11 08:54:50 -07:00
Ian Romanick	de9ed51525	dri2: Hard-code the DRI2 version This allows revising the dri_interface.h separately from adding driver support. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-11 08:54:50 -07:00
Ian Romanick	2879f758b5	glapi: Apply Xorg indent rules to all files generated for the xserver Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-11 08:54:50 -07:00
Kenneth Graunke	a0698b000b	docs: Update GL3.txt. We neglected to list the deprecation model/forward compatible context support. inverse() has been done for a while. None of us know what "highp change" means; GLSL 1.30 already added the ability to recognize precision keywords, and it doesn't look like 1.40 has any new requirements there (precision keywords still have no meaning). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-10 16:53:49 -07:00
Chad Versace	551078bb62	mesa: Remove unneeded extern qualifiers Remove 'extern' from the functions declared in texcompress_etc.h. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-10 16:51:19 -07:00
Vadim Girlin	3770847960	r600g: improve flushed depth texture handling v2 Use r600_resource_texture::flished_depth_texture for GPU access, and allocate it in the VRAM. For transfers we'll allocate texture in the GTT and store it in the r600_transfer::staging. Improves performance when flushed depth texture is frequently used by the GPU, e.g. in Lightsmark (~30%) Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-07-11 02:39:59 +04:00
Kenneth Graunke	860d5bdf98	i965: Add hardware context support. With fixes and updates from Ben Widawsky and comments from Paul Berry. v2: Use drm_intel_gem_context_destroy to destroy hardware context; remove useless initialization of hw_ctx, both suggested by Eric. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Acked-by: Paul Berry <stereotype441@gmail.com>	2012-07-10 15:09:58 -07:00
Ian Romanick	4fae5e32d5	mesa/test: Update name of GL_TIME_ELAPSED `4952caa` caused the _EXT to fall off the name of this enum. This is fine. Update the unit test to expect the new value. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=51956	2012-07-10 14:46:25 -07:00
Andreas Boll	40742fa686	docs/relnotes-8.0.4: fix html markup	2012-07-10 12:59:34 -07:00
Marek Olšák	67a8ee891b	gallium/docs: document interface changes for timestamp query the query type is already documented	2012-07-10 19:04:13 +02:00
Marek Olšák	a3fccafda9	identity: implement get_timestamp	2012-07-10 19:04:13 +02:00
Marek Olšák	e66d90ec6b	noop: implement get_timestamp	2012-07-10 19:04:13 +02:00
Marek Olšák	642539e3f9	trace: implement get_timestamp	2012-07-10 19:04:12 +02:00
Marek Olšák	a471d268ec	galahad: implement get_timestamp	2012-07-10 19:04:12 +02:00
Marek Olšák	768589e836	docs: update relnotes-8.1 and GL3 status	2012-07-10 19:04:12 +02:00
Marek Olšák	5ddcda060c	softpipe: implement get_timestamp and expose ARB_timer_query PIPE_QUERY_TIMESTAMP is already implemented and working.	2012-07-10 19:04:12 +02:00
Marek Olšák	21f78d2189	st/mesa: implement ARB_timer_query	2012-07-10 19:04:12 +02:00
Marek Olšák	bcc735aaca	gallium: add QUERY_TIMESTAMP cap and get_timestamp screen function	2012-07-10 19:04:12 +02:00
Marek Olšák	d5a7866902	mesa: implement glGet(GL_TIMESTAMP) v2 This is adds a new driver function to retrieve the timestamp. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-10 19:04:12 +02:00
Marek Olšák	5094533040	mesa: add ARB_timer_query to the extension list Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Marek Olšák	204777c5dc	mesa: add QueryCounter display list support Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Marek Olšák	f601dcdf70	mesa: implement TIMESTAMP query and glQueryCounter Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Marek Olšák	4952caad2d	glapi: add ARB_timer_query Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Ian Romanick	25fec2e9ca	docs: Add 8.0.4 release notes Also add news story. Extra, extra! Read all about it! Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-10 09:05:39 -07:00
Eric Anholt	2d03f48a65	glsl: Add parsing for GLSL uniform blocks. This doesn't do anything with the uniform block declarations yet, so usage of those uniforms finds them to be undeclared. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:13:33 -07:00
Eric Anholt	912a429bc5	glsl: Don't hide the type of struct_declaration_list. I've been trying to derive from this for UBO support, and the slightly obfuscated types were putting me over the edge. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:12:18 -07:00
Kenneth Graunke	532e99cbf2	glcpp: Add built-in #define for GL_ARB_uniform_buffer_object. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-09 11:11:59 -07:00
Vincent Lejeune	7fabb2b593	glsl: Parser handles "#extension GL_ARB_uniform_buffer_object" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:11:38 -07:00
Eric Anholt	f4fb6bf088	glsl: Reduce a bit of extra code in the merging of layout qualifiers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:05:33 -07:00
Eric Anholt	60a784d56e	glsl: Take advantage of the layout qualifier flags union to clean up parsing. The got_one variable was set iff one of the bits in flags.i was set. v2: Fix incorrect dropping of the ARB_conservative_depth warning. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:04:45 -07:00
Tom Stellard	9b00edc79a	r600g: Don't create a texture for the memory_pool during screen init This fixes a segfault in r600_screen_create() introduced by `eb065f5d9d` Reported by tilman on irc.	2012-07-09 12:14:07 -04:00
Tom Stellard	76b44034b9	radeon/llvm: Rename namespace from AMDIL to AMDGPU	2012-07-09 13:43:11 +00:00
Tom Stellard	39323e8f79	r600g: Update number of gprs when adding a vertex instruction	2012-07-09 13:42:24 +00:00
Tom Stellard	da9c8a73ec	r600g/compute: Use evergreen_cb() for binding RATs	2012-07-09 13:41:18 +00:00
Tom Stellard	960906d16b	r600g: Add support for RATs in evergreen_cb()	2012-07-09 13:41:18 +00:00
Tom Stellard	eb065f5d9d	r600g: Use a texture as the underlying resource for compute_memory_pool This the first step towards being able to use evergreen_cb to bind RATs.	2012-07-09 13:41:18 +00:00
Tom Stellard	9d36441374	r600g: Add is_rat flag to r600_resource_texture	2012-07-09 13:41:18 +00:00
Tom Stellard	3d3194e93c	r600g: Add r600_context_pipe_state_emit() This function is used when dispatching compute shader in order to avoid mixing compute and 3D registers in the context's dirty list. This allows the compute code to resuse 3D functions like evergreen_cb, which return a struct r600_pipe_state and still have control over when and how the register writes are emitted.	2012-07-09 13:41:17 +00:00
Tom Stellard	e00e1586dd	r600g: Add pkt_flag parameter to r600_context_block_emit_dirty() This allows the shader type bit to be set in the pm4 header when emitting registers for compute shaders. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-09 13:41:17 +00:00
Tom Stellard	25145de03e	r600g/compute: Move LOOP_CONST initialization to start_compute_cs atom	2012-07-09 13:41:17 +00:00
Tom Stellard	5016fe2d47	r600g: Add start_compute_cs atom to struct r600_context The start_compute_cs atom initializes some config and context registers to the values needed for running compute shaders. When a compute shader is dispatched, this atom is emitted after the start_cs_cmd atom, which initializes registers that are common to both 3D and compute. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-09 13:41:17 +00:00
Tom Stellard	38be0966c7	r600g: Add pkt_flag member to struct r600_command_buffer Some packets require the shader type bit (bit 1) to be set when used for compute shaders. The pkt_flag will be initialized to RADEON_CP_PACKET3_COMPUTE_MODE for any struct r600_command_buffer used for dispatching compute shaders and it will be or'd against the result of the PKT3 macro when adding a new packet to a struct r600_command buffer. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-09 13:41:17 +00:00
Tom Stellard	7d0c17fe74	r600g: Only emit start_cs_cmd atom once for compute command streams	2012-07-09 13:41:17 +00:00
Marek Olšák	0a21b561c7	r600g: fix stencil texturing with Z32_FLOAT_S8X24_UINT	2012-07-09 13:58:00 +02:00
Marek Olšák	a460df9299	r600g: add assertions after translate_colorswap/colorformat/dbformat/texformat	2012-07-09 13:57:59 +02:00
Marek Olšák	c1e8c845ea	r600g: inline r600_hw_copy_region	2012-07-09 13:57:59 +02:00
Marek Olšák	9974e9ac5d	r600g: enable dual src blending on r7xx No lockups here.	2012-07-09 13:57:59 +02:00
Marek Olšák	6657a7af61	r600g: use depth format from pipe_surface, not pipe_resource	2012-07-09 13:57:59 +02:00
Marek Olšák	b278aba423	r600g: use u_box_origin_2d helper function	2012-07-09 13:57:59 +02:00
Marek Olšák	1f50f463eb	gallium/u_blitter: consolidate some state changes	2012-07-09 13:57:59 +02:00
Marek Olšák	22d032707e	r600g: remove stray semicolon	2012-07-07 15:09:57 +02:00
Marek Olšák	461e9f99c7	docs: document ARB_blend_func_extended and EXT_texture_rg in relnotes-8.1 also sort the extensions	2012-07-07 15:09:57 +02:00
Eric Anholt	1e28f55ab7	i965/fs: Invalidate live intervals after copy propagation. For copy propgation, we've dropped the use of a GRF in favor of a (probably later) use of a different GRF. This definitely requires invalidating intervals. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:33 -07:00
Eric Anholt	2343fe9a5d	i965/fs: Invalidate live intervals in passes that remove an instruction. Since live intervals are based on ip, removing an instruction trashes the intervals unless we were to go do some surgery. These happen to usually remove a use of a grf, so it's time to recalculate, anyway. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> NOTE: This is a candidate for the 8.0 release branch.	2012-07-06 14:20:33 -07:00
Eric Anholt	25ca9cc823	i965/vs: Move the other two src_reg/dst_reg constructors to brw_vec4.cpp. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:33 -07:00
Eric Anholt	b2f5d4c3ec	i965/vs: Move class functions to brw_vec4.cpp. This has less impact than for the FS (4k savings), because it was partially done already, but makes things more consistent. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:32 -07:00
Eric Anholt	fe27916ddf	i965/fs: Move class functions from the header to .cpp files. Cuts compile time for brw_fs.h changes from 2.7s to .7s and reduces i965_dri.so size by 70k. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:32 -07:00
José Fonseca	8b1f1900d1	galahad: Check that texture format is supported.	2012-07-06 20:38:41 +01:00
José Fonseca	ff8ddf399a	galahad: More detailed resource checks.	2012-07-06 20:22:29 +01:00
José Fonseca	f8e13e6d69	galahad: Fix zealous warnings.	2012-07-06 20:12:56 +01:00
José Fonseca	7bd926af89	galahad: Enumerate all methods that are missing.	2012-07-06 19:13:44 +01:00
José Fonseca	3d2550be9c	galahad: Implement render_condition.	2012-07-06 18:45:14 +01:00
José Fonseca	5b45775e41	galahad: Don't implement context methods that are not implemented by the underlying pipe driver.	2012-07-06 18:38:51 +01:00
José Fonseca	3cb994afca	galahad: Use debug_printf. stderr is not visible on windows.	2012-07-06 18:38:39 +01:00
José Fonseca	1abb070633	galahad: Silence creation messages. Let galahad warnings be true warnings.	2012-07-06 18:37:48 +01:00
José Fonseca	d78dee1671	galahad: Use reference counting when destroying the wraped objects. As the wrapped pipe driver may hold internal references.	2012-07-06 18:35:44 +01:00
José Fonseca	fe602da63f	galahad: Point to the galahad objects from the galahad sampler view. And not the wraped driver's objects.	2012-07-06 18:35:32 +01:00
José Fonseca	04d29afb8b	galahad: Don't defer index buffer when it's NULL.	2012-07-06 17:02:39 +01:00
José Fonseca	232073b0d9	target-helpers: Enable debug helpers only on debug builds. Some of these helpers use debug_get_option, which works also on releases.	2012-07-06 15:05:16 +01:00
Marek Olšák	c445b0f76d	st/mesa: only expose ARB_shader_bit_encoding with GLSL 1.3 I don't think it's possible or even useful to use the extension with GLSL 1.2. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-06 00:45:38 +02:00
Kristian Høgsberg	5f5746a692	egl_dri2: Reorganize the EGLImage constructors to share more code We factor out all the EGL book-keeping into dri2_create_image() and simplify the wayland case by using dupImage. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-05 14:22:07 -04:00
Kristian Høgsberg	1bb15c0a08	intel: Share common __DRIimage allocation code We have the same switch and allocation code in two places. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-05 14:22:07 -04:00
Kristian Høgsberg	454fc07dde	intel: Just look up image->internal_format using _mesa_get_format_base_format Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-05 14:22:07 -04:00
Kristian Høgsberg	e408c17767	intel: Remove unused __DRIimage::data_type field Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-05 14:22:06 -04:00
Brian Paul	bbe92dc608	svga: whitespace fixes	2012-07-05 08:07:26 -06:00
Brian Paul	76a6801240	Revert "mesa: #define fprintf to be __mingw_fprintf() on Mingw32" This reverts commit `cbffaf20e9`. Use the PRIx64 macro in the fprintf() call instead, as suggested by Dylan Noblesmith. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:07:26 -06:00
Brian Paul	df2d81ea59	mesa: use the PRIx64 macro for printing 64-bit hexadecimal values We'll revert the #define fprintf __mingw_fprintf change next. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:07:25 -06:00
Brian Paul	1ab37a2284	svga: implement TGSI_OPCODE_ROUND ROUND and TRUNC are implemented with one function to reduce code duplication. Note: ROUND isn't actually used yet, but probably will be soon. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:07:03 -06:00
Brian Paul	d594f72e16	svga: fix CMP translation for vertex shaders Converting CMP to SLT+LRP didn't work when src2 or src3 was Inf/NaN. That's the case for GLSL sqrt(0). sqrt(0) actually happens in many piglit auto-generated tests that use the distance() function. v2: remove debug/devel code, per Jose Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:03:19 -06:00
Brian Paul	30f8575fde	svga: properly implement TRUNC instruction Was previously implemented with FLOOR. Fixes quite a few piglit tests of float->int conversion, integer division, etc. v2: clean up left over debug/devel code, per Jose Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:03:19 -06:00
Brian Paul	0bd3a75de9	svga: fix register collision issue in emit_conditional() If the 'dst' register is the same as the 'pass' register we'll generate invalid code. Use a temporary register in that case. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:03:19 -06:00
Brian Paul	9b3d87b092	svga: emit some debug messages when shader compilation fails	2012-07-05 07:59:20 -06:00
Eric Anholt	33526a2ffe	intel: Fix a comment typo.	2012-07-04 13:59:14 -07:00
Gwenole Beauchesne	69f031cc19	mesa: add GL_EXT_texture_rg extension for OpenGL ES 2.x.	2012-07-04 15:26:22 -04:00
Kristian Høgsberg	3ed8d42853	GLES2: upgrade gl2ext.h to version 18099 Redo this commit, and remove the inclusion of gl2ext.h from src/mapi/glapi/glapi_priv.h. The include was added in `8f3be33985` to fix a missing prototype for glDrawBuffersNV and others, but it's not possible to include both glext.h and gl2ext.h from the same file. I don't see the missing prototype here (with or without shared glapi) so I'm just removing the offending #include. Also, since we're redoing this, update to the most recent gl2ext.2. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-04 15:26:22 -04:00
Olivier Galibert	e620f3e763	mesa/st: gl_ClipDistance must be interpolated in 3d space. That old bug was hidden but the clipper always interpolating in 3d space no matter what it should have been doing. Now that the interpolation has been fixed, the bug shows up. Fixes fdo 51364. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-07-04 10:47:14 +01:00
Stuart Abercrombie	95ce454c8c	gallium/util: Save and restore vertex buffer state in util_gen_mipmap. Calling glGenerateMipmap could overwrite vertex buffer state, leading to incorrect rendering or crashes depending on the Gallium driver. This was happening on WebGL Conformance test texture-size. Before `784dd51198` this was covered up by redundant vertex buffer validation. Reviewed-by: Stéphane Marchesin <marcheu@chromium.org> Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-04 03:48:29 +02:00
Marek Olšák	567fcd2eb9	Revert "GLES2: upgrade gl2ext.h to version 16994." This reverts commit `8818b88748`. I get a lot of errors like this one: In file included from ../../../src/mapi/glapi/glapi_priv.h:49:0, from glapi_dispatch.c:40: ../../../include/GLES2/gl2ext.h:1074:28: error: redefinition of typedef ‘PFNGLRENDERBUFFERSTORAGEMULTISAMPLEEXTPROC’ ../../../include/GL/glext.h:10237:25: note: previous declaration of ‘PFNGLRENDERBUFFERSTORAGEMULTISAMPLEEXTPROC’ was here This with a clean build (with git clean -fdX). I don't get the errors on my other machine. I didn't investigate why, a wild guess is that this depends on the version of gcc.	2012-07-04 01:40:05 +02:00
Marek Olšák	2668aaa557	Revert "mesa: add GL_EXT_texture_rg extension for OpenGL ES 2.x." This reverts commit `d1665388ce`.	2012-07-04 01:39:52 +02:00
Gwenole Beauchesne	d1665388ce	mesa: add GL_EXT_texture_rg extension for OpenGL ES 2.x.	2012-07-03 16:23:38 -04:00
Gwenole Beauchesne	8818b88748	GLES2: upgrade gl2ext.h to version 16994.	2012-07-03 16:23:38 -04:00
Eric Anholt	dd4282e38f	i965/fs: Allow copy propagation on uniforms. This is a big win for savage2, hon and yofrankie. 62 new programs for savage2/hon get 16-wide mode, along with one for humus demos and two for tropics. Even a few shaders from tropics see reductions of 15% or more. total instructions in shared programs: 216536 -> 207353 (-4.24%) instructions in affected programs: 123941 -> 114758 (-7.41%) In benchmarking Tropics, only a .040% +/- 034% performance improvement was observed (n=90). Rather disappointing, but I was primarily motivated to do this patch by a regression in the number of 16-wide shaders compiled after a GRF texturing on IVB patch I'm working on. Hopefully this helps avoid that regression. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-03 12:57:10 -07:00
Eric Anholt	0c4630bae0	i965/fs: Allow copy propagation with source modifiers. This shaves a few instructions off of a ton of programs. For 12 shaders from tropics and sanctuary, it's enough reduction in register pressure to get 16-wide mode. 7 shaders from heroes of newerth and savage2 are hurt by about 1.1%, where copy propagation of negates ends up preventing coalescing, but we could regain that by doing dataflow analysis in our copy propagation. No significant performance difference in tropics (n=11) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-03 12:57:04 -07:00
Eric Anholt	458f7f0141	i965/fs: Move copy propagation test out to a separate function. It's going to get more complicated in a moment. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-03 12:55:47 -07:00
Ian Romanick	5fb178ee43	glx/tests: Fix off-by-one error in allocating extension string buffer NOTE: This is a candidate for the 8.0 release branch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50621 Bugzilla: https://bugs.gentoo.org/show_bug.cgi?id=418161 Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: Markus Oehme <oehme.markus@gmx.de>	2012-07-03 12:28:45 -07:00
Brian Paul	1853f467c6	glsl: fix unop/binop errors in comments	2012-07-03 09:42:59 -06:00
Paul Berry	f34764ea53	msaa: Make meta-ops save and restore state of GL_MULTISAMPLE. The meta-ops _mesa_meta_Clear() and _mesa_meta_glsl_Clear() need to ignore the state of GL_SAMPLE_ALPHA_TO_COVERAGE, GL_SAMPLE_ALPHA_TO_ONE, GL_SAMPLE_COVERAGE, GL_SAMPLE_COVERAGE_VALUE, and GL_SAMPLE_COVERAGE_INVERT when clearing multisampled buffers. The easiest way to accomplish this is to disable GL_MULTISAMPLE during the clear meta-ops. Note: this patch also causes GL_MULTISAMPLE to be disabled during _mesa_meta_GenerateMipmap() and _mesa_meta_GetTexImage() (since those two meta-ops use MESA_META_ALL). Arguably this isn't strictly necessary, since those meta-ops use their own non-MSAA fbo's, but it shouldn't do any harm. Fixes Piglit tests "EXT_framebuffer_multisample/clear {2,4} {color,stencil}" on i965. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-02 14:09:27 -07:00
Paul Berry	8313f44409	i965/msaa: Fix centroid interpolation of unlit pixels. From the Ivy Bridge PRM, Vol 2 Part 1 p280-281 (3DSTATE_WM: Barycentric Interpolation Mode): "Errata: When Centroid Barycentric mode is required, HW may produce incorrect interpolation results when a 2X2 pixels have unlit pixels." To work around this problem, after doing centroid interpolation, we replace the centroid-interpolated values for unlit pixels with non-centroid-interpolated values (which are interpolated at pixel centers). This produces correct rendering at the expense of a slight increase in shader execution time. I've conditioned the workaround with a runtime flag (brw->needs_unlit_centroid_workaround) in the hopes that we won't need it in future chip generations. Fixes piglit tests "EXT_framebuffer_multisample/interpolation {2,4} {centroid-deriv,centroid-deriv-disabled}". All MSAA interpolation tests pass now. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-02 13:27:36 -07:00
Paul Berry	3f929efa28	i965/fs: Add FS_OPCODE_MOV_DISPATCH_TO_FLAGS to fragment shader backend. In order to compute centroid varyings correctly, the fragment shader needs to be able to load the current pixel/sample mask into a flag register. This patch adds an opcode to the fragment shader back-end to do this; the opcode gets translated into the instruction mov(1) f0<1>UW g1.14<0,1,0>UW { align1 WE_all } Since this instruction clobbers f0, instruction scheduling has to treat it the same as instructions that have a conditional modifier. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-02 13:27:36 -07:00
Jordan Justen	8aa78c104a	i965: fix transform feedback with primitive restart When querying GL_PRIMITIVES_GENERATED, if primitive restart is also used, then take the software primitive restart path so GL_PRIMITIVES_GENERATED is returned correctly. GL_TRANSFORM_FEEDBACK_PRIMITIVES_WRITTEN is also updated since it will also affected by the same issue. As noted in brw_primitive_restart.c, with further work we should be able to move this situation back to a hardware handled path. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 11:42:48 -07:00
Kenneth Graunke	14311ef3f2	i965: Re-enable rendering to SNORM formats. Commit `d73f6375f5` fixed the cause of the Piglit failure with ARB_color_buffer_float fragment clamp modes. Now that it's fixed, there's no reason to leave snorm format rendering disabled. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 11:23:37 -07:00
Kenneth Graunke	b1802a2115	glsl: Remove unused ir_loop_jump::loop pointer. Commit `0c005bd7` intended to make ir_loop_jump::mode public, but also accidentally added a new pointer to the enclosing loop. Furthermore, it tried to initialize the new field by adding "this->loop = loop;" to the constructor, but since there is no loop parameter, this only initialized the field to itself---so it will likely be a garbage pointer. A lot of code, such as lower_jumps, allocates new loop jumps without setting this field appropriately, so any uses would probably just crash. Thankfully, there were none, so we can just delete the field. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=51574 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-02 11:08:59 -07:00
Kenneth Graunke	d73f6375f5	meta: Don't alter fragment color clamp in DrawPixels(). DrawPixels uses the MESA_META_CLAMP_FRAGMENT_COLOR flag to save/restore the fragment color clamp mode. This is unnecessary since it never alters it. It's also harmful: when the clamp mode is GL_FIXED_ONLY, setting this flag causes _mesa_meta_begin to force it to GL_FALSE, breaking clamping on SNORM formats. DrawPixels should use the user-specified clamp mode and not change it. Fixes Piglit's spec/ARB_color_buffer_float/GL_RGBA8_SNORM-drawpixels test on i965/Sandybridge (with SNORM render targets re-enabled). Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 11:08:48 -07:00
Marek Olšák	9f0f2f9512	mesa: use FLUSH_CURRENT and not FLUSH_VERTICES in _mesa_validate_* ASSERT_OUTSIDE_BEGIN_END_AND_FLUSH_WITH_RETVAL calls FLUSH_VERTICES, which is not what we want. This fixes a breakage in classic drivers, introduced in: `62b9716739` vbo: first ASSERT_OUTSIDE_BEGIN_END then FLUSH, not the other way around It should fix: https://bugs.freedesktop.org/show_bug.cgi?id=51629 https://bugs.freedesktop.org/show_bug.cgi?id=51642 Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-02 17:48:36 +02:00
Dylan Noblesmith	876889b355	mesa: point to Makefile.old in the srcdir Gets out-of-tree builds slightly closer to working. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:14:46 +00:00
Dylan Noblesmith	91ecba9d05	mesa: fix parser source gen for out-of-tree builds Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:14:39 +00:00
Dylan Noblesmith	261b1389eb	mesa: fix api source gen for out-of-tree builds Add $(srcdir) where needed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:14:27 +00:00
Dylan Noblesmith	43bca86c1b	glapi/gen: fix out of tree build Add "-f $(srcdir)/gl_API.xml" to the arguments of all the scripts that by default look for gl_API.xml in the working directory when run with no arguments, and prepend $(srcdir) to those scripts that are already using an explicit -f argument. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:13:58 +00:00
José Fonseca	f5c41e16d7	gallium/tgsi: Don't declare temps individually when they are all similar. tgsi_ureg was recently enhanced to support local temporaries, and as result temps are declared individually. This change avoids many TEMP register declarations on common shaders. (And fixes performance regression due to mismatches against performance sensitive shaders.) Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-02 12:14:53 +01:00
José Fonseca	e75fe7ba08	gallivm: Cleanup the 4 x float -> 16 ub special path in lp_build_conv. No behaviour change intended. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-02 12:13:52 +01:00
José Fonseca	63e0e4b8f5	gallium/util: Add ULL suffix to large constants. As suggested by Andy Furniss: it looks like some old gcc versions require it.	2012-07-02 12:12:42 +01:00
Tom Stellard	1d21bd057a	clover: Handle NULL devs argument in clBuildProgram If devs is NULL, then the kernel should be compiled for all devices associated with the program.	2012-07-01 15:45:24 +02:00
Francisco Jerez	c6bb41c28b	clover: Define non-templated copy constructor for clover::ref_ptr. The templated copy constructor doesn't prevent the compiler from emitting a default copy constructor, which leads to inconsistent memory handling and was reported to cause segfaults when doing event manipulation. Reported-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-01 15:37:30 +02:00
Brian Paul	db2b6ca504	llvmpipe: fix comment typo	2012-06-29 17:19:12 -06:00
Brian Paul	9dfe92019a	st/mesa: use DEBUG_INCOMPLETE_FBO debug flag	2012-06-29 17:19:12 -06:00
Brian Paul	b186a9df32	mesa: remove some unused gl_dlist_state fields	2012-06-29 17:19:12 -06:00
Tom Stellard	ca8fa02308	clover: Add a function internalizer pass before LTO v2 The function internalizer pass marks non-kernel functions as internal, which enables optimizations like function inlining and global dead-code elimination. v2: - Pass vector arguments by const reference	2012-06-29 18:46:18 +00:00
Tom Stellard	a31b2f7107	radeon/llvm: Enable vec4 loads on R600	2012-06-29 18:46:18 +00:00
Tom Stellard	e17c586d08	radeon/llvm: Enable floating point stores on R600	2012-06-29 18:46:18 +00:00
Tom Stellard	b66ef1f48c	radeon/llvm: Handle floating point loads on R600	2012-06-29 18:46:18 +00:00
Tom Stellard	c01199dfc0	radeon/llvm: Expand UDIV and UREM nodes	2012-06-29 18:46:18 +00:00
Tom Stellard	2c485cda20	radeon/llvm: Emit raw ISA for vertex fetch instructions	2012-06-29 18:46:18 +00:00
José Fonseca	16e0ebccb6	gallium/util: Truly disable INF/NAN tests on MSVC. Thanks to Brian for spotting this.	2012-06-29 14:49:23 +01:00
José Fonseca	c9bada497c	gallium/util: Disable INF/NAN tests on MSVC. Somehow they are not recognized as constants.	2012-06-29 13:39:07 +01:00
José Fonseca	fa8dcb848f	translate: Free elt8_func/elt16_func too. These were leaking. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-06-29 12:21:08 +01:00
James Benton	6dd8e6f9cb	util: Reimplement half <-> float conversions. Removed u_half.py used to generate the table for previous method. Previous implementation of float to half conversion was faulty for denormalised and NaNs and would require extra logic to fix, thus making the speedup of using tables irrelevant. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:21:02 +01:00
James Benton	c8d3481cdb	tests: Updated tests to properly handle NaN for half floats. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:59 +01:00
James Benton	60dca53833	util: Updated u_format_tests to rigidly test half-float boundary values. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:57 +01:00
James Benton	d069d8ef38	util: Added functions for checking NaN / Inf for double and half-floats. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:54 +01:00
James Benton	34075d4133	util: Added util_format_is_array. This function checks whether a format description is in a simple array format. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:37 +01:00
Marek Olšák	fcebb157f0	vbo: optimize validation for glMultiDrawElements Some parameters need to be checked only once. check_valid_to_render needs to be called only once. The validate function is based on the one for DrawElements. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
Marek Olšák	62b9716739	vbo: first ASSERT_OUTSIDE_BEGIN_END then FLUSH, not the other way around Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
Marek Olšák	d9eb1a1225	vbo: don't call twice _mesa_valid_to_render in DrawArraysInstancedBaseInstance It's called in _mesa_validate_DrawArraysInstanced already. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
Marek Olšák	15ac66e331	mesa: rename MaxTransformFeedbackSeparateAttribs to MaxTransformFeedbackBuffers This is a cleanup for ARB_transform_feedback3, where GL_MAX_TRANSFORM_FEEDBACK_BUFFERS is introduced for interleaved attribs and has the same meaning as GL_MAX_.._SEPARATE_ATTRIBS for separate attribs. Also, the maximum number of TFB buffers is reduced from 32 to 4, which makes this patch useful even without the extension. I don't know of any hardware which can do more than 4. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
José Fonseca	638779e445	gallivm: Refactor lp_build_broadcast(_scalar) to share code. Doesn't really change the generated assembly, but produces more compact IR, and of course, makes code more consistent. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 20:20:34 +01:00
Johannes Obermayr	bf679ce1dc	gallivm: Fix potential buffer overflowing in strncat. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-28 11:47:23 +01:00
Marcin Slusarz	1906d2b46b	nv50: dynamically allocate space for shader local storage Fixes 21 piglit tests: spec/glsl-1.10/execution/variable-indexing/ fs-temp-array-mat4-index-col-row-wr vs-temp-array-mat4-index-col-row-wr vs-temp-array-mat4-index-row-wr spec/glsl-1.20/execution/variable-indexing/ fs-temp-array-mat3-index-col-row-rd fs-temp-array-mat3-index-row-rd fs-temp-array-mat4-col-row-wr fs-temp-array-mat4-index-col-row-rd fs-temp-array-mat4-index-col-row-wr fs-temp-array-mat4-index-row-rd fs-temp-array-mat4-index-row-wr vs-temp-array-mat3-index-col-row-rd vs-temp-array-mat3-index-col-row-wr vs-temp-array-mat3-index-row-rd vs-temp-array-mat3-index-row-wr vs-temp-array-mat4-col-row-wr vs-temp-array-mat4-index-col-row-rd vs-temp-array-mat4-index-col-row-wr vs-temp-array-mat4-index-col-wr vs-temp-array-mat4-index-row-rd vs-temp-array-mat4-index-row-wr vs-temp-array-mat4-index-wr ... and prevents a lot of GPU lockups	2012-06-28 00:01:02 +02:00
Marcin Slusarz	0fceaee4fd	nv50: streamline screen_create error handling Remove macro which changes control flow (it's evil). Make all fail paths print (correct) error message.	2012-06-28 00:01:02 +02:00
Marcin Slusarz	96259b5128	nv50/ir: make colorful ir dump output optional	2012-06-28 00:01:02 +02:00
Brian Paul	9881bf6e69	mesa: more const qualifiers to match the latest glext.h For some reason regular gcc on Linux didn't catch these but the mingw compiler did (generated errors, not warnings). v2: include the changes in src/mapi/ too	2012-06-27 15:37:10 -06:00
Brian Paul	827bdee7d1	glapi: add const qualifier to glShaderSourceARB() parameter Fixes the es2 build with gcc. Note: in glext.h the prototypes for glShaderSource() and glShaderSourceARB() disagree: only the former has the extra const qualifier. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-27 15:37:10 -06:00
Jordan Justen	3588098ed8	i965: enable ARB_instanced_arrays extension Set the step_rate value when drawing to implement ARB_instanced_arrays for gen >= 4. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-27 10:35:14 -07:00
Brian Paul	8fb1e4a462	glsl: be more careful about counting varying vars in the linker Previously, we were counting gl_FrontFacing, gl_FragCoord and gl_PointCoord against the limit of varying variables. This prevented some valid shaders from linking. The other potential solution to this is to have the driver advertise more varying vars or set the GLSLSkipStrictMaxVaryingLimitCheck flag. But the above-mentioned variables aren't conventional varying attributes so it doesn't seem right to count them. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-27 11:31:16 -06:00
Andreas Boll	d9d84068e7	docs/helpwanted: add some useful todo lists Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-27 11:19:21 -06:00
Brian Paul	098aa5f9ab	softpipe: fix numFragsEmitted debug code	2012-06-27 07:50:57 -06:00
Brian Paul	81e2a238bc	gallium: minor whitespace, comment changes	2012-06-27 07:50:57 -06:00
Brian Paul	51b0a0b33c	mesa: update glext.h to version 81	2012-06-27 07:50:57 -06:00
Brian Paul	52dd8961eb	mesa: update glxext.h to version 33	2012-06-27 07:50:57 -06:00
Brian Paul	8459f4a63a	mesa: make _mesa_reference_array_object() an inline function As we do for texture objects, buffer objects, etc.	2012-06-27 07:50:57 -06:00
Brian Paul	dcf1dafa9e	mesa: look up enum name for glEnable/Disable errors	2012-06-27 07:50:56 -06:00
Brian Paul	86ccd9aaac	mesa: move TEXGEN defines closer to gl_texgen struct	2012-06-27 07:50:56 -06:00
Brian Paul	4cb3579e52	mesa: rename ColorMaterialBitmask to _ColorMaterialBitmask Since it's a derived field.	2012-06-27 07:50:56 -06:00
Brian Paul	b114ff3783	mesa: re-order, update comments on lighting-related structs	2012-06-27 07:50:56 -06:00
José Fonseca	d1c5ea9207	gallium/util: Fix parsing of options with underscore. For example GALLIVM_DEBUG=no_brilinear which was being parsed as two options, "no" and "brilinear".	2012-06-27 11:16:18 +01:00
James Benton	789436f1e0	gallivm: Added a generic lp_build_print_value which prints a LLVMValueRef. Updated lp_build_printf to share common code. Removed specific lp_build_print_vecX. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-27 11:16:18 +01:00
Stéphane Marchesin	45fc069600	i915g: Implement sRGB textures Since we don't have them in hw we emulate them in the shader. Although not recommended by the spec it is legit. As a side effect we also get GL 2.1. I think this is as far as we can take the i915.	2012-06-26 23:18:15 -07:00
Brian Paul	3bc39414ab	svga: return 120 for PIPE_CAP_GLSL_FEATURE_LEVEL Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-26 17:03:33 -06:00
Brian Paul	ac8613c298	llvmpipe: return 120 for PIPE_CAP_GLSL_FEATURE_LEVEL Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-26 17:03:33 -06:00
Carl Worth	d8e61f8f86	glsl: glcpp: Extend testing of #line directives The most recent commit adds support for comments and macro expansion on #line directives. Add testing to verify the new features. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:23:55 -07:00
Carl Worth	aac78ce823	glsl: glcpp: Move handling of #line directives from lexer to parser. The GLSL specification requires that #line directives be interpreted after macro expansion. Our existing implementation of #line macros in the lexer prevents conformance on this point. Moving the handling of #line from the lexer to the parser gives us the macro expansion we need. An additional benefit is that the preprocessor also now supports comments on the same line as #line directives. Finally, the preprocessor now emits the (fully-macro-expanded) #line directives into the output. This allows the full GLSL compiler to also see and interpret these directives so it can also generate correct line numbers in error messages. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:23:49 -07:00
Carl Worth	39f8c46eaa	glsl: glcpp: Rename and document _glcpp_parser_expand_if This function is currently used only in the expansion of #if lines, but we will soon be using it more generally (for the expansion of (_glcpp_parser_expand_and_lex_from) and some more documentation. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:21:16 -07:00
Carl Worth	1db463ce2e	glsl: Consistently use length-based ralloc string functions for info_log. Commit `b823b99ec0` switched from using functions such as ralloc_asprintf and ralloc_strcat to ralloc_asprintf_rewrite_tail. This change maintains the string's length as a aparamter that is updated by the ralloc functions (rather than recomputing it with strlen over and over). However, the change failed to updated two locations (glcpp_error and glcpp_warning), with the result that the string's length wasn't updated by these calls. Then, subsequent calls to other ralloc_asprintf_rewrite_tail would overwrite the text appended by glcpp_error. This commit fixes the two missing updates, and restores line numbers to the output of glcpp error messages, (as noticed by a glcpp unit test case that has been failing since the above-mentioned commit). Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:20:53 -07:00
Carl Worth	c96b8302a3	glsl: glcpp: Allow "#if undefined-macro' to evaluate to false. A strict reading of the GLSL specification would have this be an error, but we've received reports from users who expect the preprocessor to interepret undefined macros as 0. This is the standard behavior of the rpeprocessor for C, and according to these user reports is also the behavior of other OpenGL implementations. So here's one of those cases where we can make our users happier by ignoring the specification. And it's hard to imagine users who really, really want to see an error for this case. The two affected tests cases are updated to reflect the new behavior. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:20:03 -07:00
Jerome Glisse	b75f1d973c	r600g: enable DUAL_EXPORT mode when possible on r6xx/r7xx DUAL_EXPORT can be enabled on r6xx/r7xx when all CBs use 16-bit export and there is no depth/stencil export. Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-06-27 02:06:55 +04:00
Vadim Girlin	470d00c0e2	r600g: enable DUAL_EXPORT mode when possible It seems DUAL_EXPORT on evergreen may be enabled when all CBs use 16-bit export mode (EXPORT_4C_16BPC), also there should be at least one CB, and the PS shouldn't export depth/stencil. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-06-27 02:06:55 +04:00
Vadim Girlin	0c47d9dcab	r600g: avoid unnecessary shader exports v2 In some cases TGSI shader has more color outputs than the number of CBs, so it seems we need to limit the number of color exports. This requires different shader variants depending on the nr_cbufs, but on the other hand we are doing less exports, which are very costly. v2: fix various piglit regressions Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-06-27 02:06:55 +04:00
Vadim Girlin	4acf71f01e	r600g: cache shader variants instead of rebuilding v3 Shader variants are stored in the list, the key for lookup is based on the states that require different hw shaders - currently it's rctx->two_side (all gpus) and rctx->nr_cbufs (evergreen/cayman, when writes_all property is set). v2: - use simple list instead of keymap as suggested by Marek on irc - call r600_adjust_gprs from r600_bind_vs_shader for r6xx/r7xx (r600_shader_select isn't used for vertex shaders currently) v3: - fix call to r600_adjust_gprs - do it after updating current shader Improves performance for some apps, e.g. FlightGear - see https://bugs.freedesktop.org/show_bug.cgi?id=50360 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-06-27 02:06:55 +04:00
Brian Paul	55a89889ba	svga: handle missing PIPE_CAP_x queries And fix incorrect error message for a bad shader type/number. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-26 15:03:44 -06:00
Brian Paul	056e9b4511	llvmpipe: handle more PIPE_CAP_x queries As with the previous commit for softpipe. v2: remove 'default' case to get compile-time warning Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-26 15:03:44 -06:00
Brian Paul	7d23dcdacc	softpipe: handle more PIPE_CAP_x queries These all return zero. Add a debug_printf() to catch the default case so we don't accidently mishandle something important in the future. v2: remove 'default' case to get compile-time warning Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-26 15:03:43 -06:00
Brian Paul	80efb524ee	svga: return 1 for PIPE_CAP_MIXED_COLORBUFFER_FORMATS This is actually required for GL_ARB_framebuffer_object, but the state tracker doesn't currently check it. Direct3D 9 allows mixed format color buffers with some restrictions. Setting this allows Unigine Heaven 2.5 and 3.0 to run. Tested both on GL and D3D hosts. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com>	2012-06-26 15:03:43 -06:00
Brian Paul	36b3ee2ffc	glsl: fix comment typo	2012-06-26 10:01:03 -06:00
Olivier Galibert	27e94ba4ea	u2f_emit: Fix type parameter in LLVM call. The type is the destination type (i.e. float vector) and not the source type. Fixes piglit fs-{in,de}crement-uint. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-26 16:55:40 +01:00
Paul Berry	6c355cca91	i965/msaa: Set KILL_ENABLE when GL_ALPHA_TO_COVERAGE enabled. i965 hardware needs to be informed of situations in which it's possible for pixels (or samples) to be discarded for reasons other than depth/stencil testing (e.g. due to an explicit "discard" in the fragment shader). One of these situations is when GL_ALPHA_TO_COVERAGE is enabled, since that can cause samples to be discarded by the color calculator when the pixel's alpha value is less than 1.0. Without this patch, GL_ALPHA_TO_COVERAGE does not take effect on depth buffers. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-06-26 07:45:54 -07:00
Paul Berry	bc53e14d98	i965/msaa: Implement GL_SAMPLE_ALPHA_TO_{COVERAGE,ONE}. This patch enables the multisampling parameters GL_SAMPLE_ALPHA_TO_COVERAGE and GL_SAMPLE_ALPHA_TO_ONE, which allow the fragment shader's alpha output to be converted into a sample coverage mask and ignored for blending. i965 supports these parameters through the BLEND_STATE structure. The GL spec allows, but does not require, the implementation to dither the conversion from alpha to a sample coverage mask, so that alpha values that aren't a multiple of 1/num_samples result in the correct proportion of samples being lit. A bit exists in the BLEND_STATE structure to enable this functionality, but according to the hardware docs it must be disabled on Sandy Bridge (see the Sandy Bridge PRM, Vol2, Part1, p379: AlphaToCoverage Dither Enable). So it is enabled for Gen7 only. Fixes piglit tests "EXT_framebuffer_multisample/sample-alpha-to-{coverage,one} {2,4}". Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-06-26 07:45:54 -07:00
Paul Berry	9ea60ce58f	i965/msaa: Implement glSampleCoverage. This patch enables glSampleCoverage() functionality, which allows the client program to specify that only a portion of the samples be lit up when performing multisampled rendering. i965 supports glSampleCoverage() through the 3DSTATE_SAMPLE_MASK command packet, which allows the driver to specify a bitfield indicating which samples to light up. Fixes piglit tests "EXT_framebuffer_multisample/sample-coverage {2,4} {inverted,non-inverted}". Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-06-26 07:45:54 -07:00
José Fonseca	4bde1ba7fb	st/wgl: Add a few more comments.	2012-06-26 10:15:36 +01:00
Marek Olšák	cc2cd8b356	r600g: don't disable streamout if it hasn't been started	2012-06-26 03:37:24 +02:00
Marek Olšák	496399d8e9	u_blitter: disable streamout before rendering This fixes piglit EXT_transform_feedback tests: - intervening-read output - intervening-read prims_written	2012-06-26 03:37:23 +02:00
Chad Versace	cf0bbb30f6	i965/fs: Fix conversions float->bool, int->bool Fixes gles2conform GL.equal.equal_bvec2_frag. This fixes brw_fs_visitor's translation of ir_unop_f2b. It used CMP to convert the float to one of 0 or ~0. However, the convention in the compiler is that true is represented by 1, not ~0. This patch adds an AND to convert ~0 to 1. By inspection, a similar problem existed with ir_unop_i2b, with a similar fix. [v2 kayden]: eliminate extra temporary register. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=49621 Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-06-25 15:56:40 -07:00
Brian Paul	345ee593e9	st/wgl: 80-column wrapping	2012-06-25 16:10:01 -06:00
Andreas Boll	19534579cf	docs/lists: add piglit mailing list Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	24eebf4f88	docs/helpwanted: update some info Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	f29f5e8695	docs/sourcetree: update some info Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	b347bb5dbc	docs/devinfo: update release info Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	398d8be3ab	docs/systems: add some useful driver links Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	297309ce23	docs: update some broken/old links Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	dae9b0f1d8	docs: whitespace cleanup Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	ddb0557868	docs: escape html special char Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	a5447aab96	docs: add missing target attribute target is needed for the frame based layout Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	d52419e0c3	docs/shading: use proper markup use dl instead of ul Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Brian Paul	75e62024c3	docs: document the GALLIUM_LOG_FILE env var	2012-06-25 16:10:01 -06:00
Brian Paul	9ccf5bffe3	mesa: new MESA_LOG_FILE env var to log errors, warnings, etc., to a file Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-25 16:10:01 -06:00
Marek Olšák	0f530d2dff	docs: update GL3.3 status	2012-06-25 23:53:49 +02:00
Marek Olšák	4891c5dc64	r600g: inline r600_blit_push_depth and use resource_copy_region We are going to have a separate resource for depth texturing and transfers and this is just a transfer thing.	2012-06-25 23:53:49 +02:00
Marek Olšák	da98bb6fc1	r600g: split flushed depth texture creation and flushing	2012-06-25 23:53:49 +02:00
Paul Berry	d1056541e2	i965/msaa: Add backend support for centroid interpolation. This patch causes the fragment shader to be configured correctly (and the correct code to be generated) for centroid interpolation. This required two changes: brw_compute_barycentric_interp_modes() needs to determine when centroid barycentric coordinates need to be included in the pixel shader thread payload, and fs_visitor::emit_general_interpolation() needs to interpolate using the correct set of barycentric coordinates. Fixes piglit tests "EXT_framebuffer_multisample/interpolation {2,4} centroid-edges" on i965. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Paul Berry	cf0e7aa9f8	i965/fs: Refactor interpolation code to prepare for adding centroid support. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Paul Berry	6d7ebb21f8	i965/msaa: Adapt clip setup for centroid noperspective interpolation. To save time, we only instruct the clip stage of the pipeline to compute noperspective barycentric coordinates if those coordinates are needed by the fragment shader. Previously, we would determine whether the coordinates were needed by seeing whether the fragment shader used the BRW_WM_NONPERSPECTIVE_PIXEL_BARYCENTRIC interpolation mode. However, with MSAA, it's possible that the fragment shader might use BRW_WM_NONPERSPECTIVE_CENTROID_BARYCENTRIC instead. In the future, when we support ARB_sample_shading, it might use BRW_WM_NONPERSPECTIVE_SAMPLE_BARYCENTRIC. This patch modifies the upload_clip_state() functions to check for all three possible noperspective interpolation modes. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Paul Berry	bebb043811	glsl: Add IsCentroid bitfield to gl_fragment_program. This bitfield tells the back-ends which of a fragment shader's inputs require centroid interpolation. It is only set for GLSL fragment shaders, since assembly fragment shaders don't support centroid interpolation. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Brian Paul	2a4af651e6	st/mesa: added some simple fbo debugging/helper code	2012-06-25 11:28:03 -06:00
Brian Paul	45df3eb1db	llvmpipe: fix the LP_NO_RAST debug option It was only no-oping the clear() function, not actual triangle rasterization. Move the no_rast field from lp_context down into lp_rasterizer so it's accessible where it's needed. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-25 08:14:33 -06:00
Vinson Lee	37d699a296	scons: Add glsl/glcpp to the include path. Fixes this build failure on Solaris. Compiling build/sunos-debug/glsl/glcpp/glcpp-lex.c ... "src/glsl/glcpp/glcpp-lex.l", line 30: cannot find include file: "glcpp-parse.h" Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-23 13:40:09 -07:00
Laurent Carlier	78ac9af580	automake: add missing inclusion of GL headers Building fail when GL headers are not installed in the system, so add inclusion of these headers. Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-22 17:24:37 -06:00
Brian Paul	cbffaf20e9	mesa: #define fprintf to be __mingw_fprintf() on Mingw32 So that formats such as "%llx" are understood. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-22 17:24:37 -06:00
Brian Paul	fe68af6e0d	svga: init pointer to NULL to silence MSVC warning	2012-06-22 17:24:37 -06:00
Tom Stellard	ea76f03310	clover: Add --with-clang-libdir option and verify CLANG_RESOURCE_DIR $CLANG_RESOURCE_DIR is the directory that contains all resources needed by clang to compile programs. When clover uses clang to compile kernels it needs to specify a resource dir, so that clang can find its internal headers (e.g. stddef.h). clang defines $CLANG_RESOURCE_DIR as $CLANG_LIBDIR/clang/$CLANG_VERSION This patch adds the --with-clang-libdir option in order to accommodate clang intalls to non-standard locations, and it also adds a check to the configure script to verify that $CLANG_RESOURCE_DIR/include contains the necessary header files.	2012-06-22 16:59:24 -04:00
Paul Berry	82d25963a8	i965: Compute dFdy() correctly for FBOs. On i965, dFdx() and dFdy() are computed by taking advantage of the fact that each consecutive set of 4 pixels dispatched to the fragment shader always constitutes a contiguous 2x2 block of pixels in a fixed arrangement known as a "sub-span". So we calculate dFdx() by taking the difference between the values computed for the left and right halves of the sub-span, and we calculate dFdy() by taking the difference between the values computed for the top and bottom halves of the sub-span. However, there's a subtlety when FBOs are in use: since FBOs use a coordinate system where the origin is at the upper left, and window system framebuffers use a coordinate system where the origin is at the lower left, the computation of dFdy() needs to be negated for FBOs. This patch modifies the fragment shader back-ends to negate the value of dFdy() when an FBO is in use. It also modifies the code that populates the program key (brw_wm_populate_key() and brw_fs_precompile()) so that they always record in the program key whether we are rendering to an FBO or to a window system framebuffer; this ensures that the fragment shader will get recompiled when switching between FBO and non-FBO use. This will result in unnecessary recompiles of fragment shaders that don't use dFdy(). To fix that, we will need to adapt the GLSL and NV_fragment_program front-ends to record whether or not a given shader uses dFdy(). I plan to implement this in a future patch series; I've left FIXME comments in the code as a reminder. Fixes Piglit test "fbo-deriv". NOTE: This is a candidate for stable release branches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-22 07:59:34 -07:00
Brian Paul	d988ea5e81	mesa: minor transform feedback comments	2012-06-22 08:48:45 -06:00
Brian Paul	09af5783b3	mesa: fix comments on UBO buffer binding functions The old comments were for transform feedback.	2012-06-22 08:44:00 -06:00
Olivier Galibert	b8068afafa	draw: Handle the case when there isn't a fragment shader. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-22 09:58:39 +01:00
Zack Rusin	af98c6b05b	mesa: update the emacs indent files dirvars package has been replaced by built-in functionality of dir-locals. preserve the settings in the new infrastructure	2012-06-21 17:29:11 -04:00
Tom Stellard	ff2b417245	r600g: Unify SURFACE_SYNC packet emission for 3D and compute Drop the compute specific evergreen_set_buffer_sync() function and instead use the r600_surface_sync_command atom for emitting SURFACE_SYNC packets.	2012-06-21 20:42:07 +00:00
Tom Stellard	ff08f1ec6f	r600g: Enable reusing of compute state	2012-06-21 20:42:07 +00:00
Tom Stellard	5cd6ce939d	r600g: Fix reading vtx instruction offset from bytestream	2012-06-21 20:42:07 +00:00
Tom Stellard	563a764110	radeon/llvm: Turn on the BitExtract peephole optimization Thie BitExtract optimization folds a mask and shift operation together into a single instruction (BFE_UINT).	2012-06-21 20:42:06 +00:00
Tom Stellard	c53c8d0555	radeon/llvm: Lower ROTL to BIT_ALIGN	2012-06-21 20:42:06 +00:00
Tom Stellard	cd287301ec	radeon/llvm: Use the VLIW Scheduler for R600->NI It's not optimal, but it's better than the register pressure scheduler that was previously being used. The VLIW scheduler currently ignores all the complicated instruction groups restrictions and just tries to fill the instruction groups with as many instructions as possible. Though, it does know enough not to put two trans only instructions in the same group. We are able to ignore the instruction group restrictions in the LLVM backend, because the finalizer in r600_asm.c will fix any illegal instruction groups the backend generates. Enabling the VLIW scheduler improved the run time for a sha1 compute shader by about 50%. I'm not sure what the impact will be for graphics shaders. I tested Lightsmark with the VLIW scheduler enabled and the framerate was about the same, but it might help apps that use really big shaders.	2012-06-21 20:42:06 +00:00
Brian Paul	b73cf49c91	mesa: set GL_ARB_uniform_buffer_object extension year to 2009	2012-06-21 13:08:34 -06:00
Eric Anholt	cb9f35d16f	mesa: Add a comment explaining my thoughts on glBindBufferBase(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:18 -07:00
Eric Anholt	d103fead19	mesa: Add support for glGetIntegeri_v from GL_ARB_uniform_buffer_object. Fixes piglit ARB_uniform_buffer_object/getintegeri_v. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:10 -07:00
Eric Anholt	fb76ddc133	mesa: Add support for glBindBufferBase/Range on GL_UNIFORM_BUFFER. Fixes piglits: GL_ARB_uniform_buffer_object/bindbuffer-general-point. GL_ARB_uniform_buffer_object/negative-bindbuffer-buffer GL_ARB_uniform_buffer_object/negative-bindbuffer-index GL_ARB_uniform_buffer_object/negative-bindbuffer-target GL_ARB_uniform_buffer_object/negative-bindbufferrange-range Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:07 -07:00
Eric Anholt	b82c472156	mesa: Move glBindBufferBase and glBindBufferRange() to bufferobj. The rest of the TFB implementation remains in transformfeedback.c, and this will be shared with UBOs. v2: Move the size/offset checks shared with UBOs to common code as well. (Kenneth's review) Reviewed-by: Brian Paul <brianp@vmware.com> (v1) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:00 -07:00
Eric Anholt	9627660448	mesa: Move buffer object dispatch setup to bufferobj.c. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:58 -07:00
Eric Anholt	5527c2d220	mesa: Add indexed binding points for uniform buffer objects. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:56 -07:00
Eric Anholt	c5c696e7fb	mesa: Add support for the GL_UNIFORM_BUFFER general binding point. Fixes piglit ARB_uniform_buffer_object/buffer-targets. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:54 -07:00
Eric Anholt	5426b1ade9	mesa: Add state and getters for the GL_ARB_uniform_buffer_object maximums. Fixes piglit GL_ARB_uniform_buffer_object/minmax. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:51 -07:00
Vincent Lejeune	3e17d38457	glapi: Add uniform buffer object API v2: Fix a typo spotted by Eric Anholt. v3: Fix missing "GL" on types, fix style, fix Studly_Caps extension name, drop commented code duplicated with GL3x.xml [anholt] Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:45 -07:00
Eric Anholt	37c3cbe053	dricore: Turn it into a normal library. Our intention is still that it's not abi stable, so make the package version number get included in the library name. Now you can parallel install dricore-using drivers from multiple mesa versions. We can put it into lib now that we're following library versioning rules (assuming that ABIs don't change within a single Mesa point release). LD_LIBRARY_PATH still doesn't work with a non-/, non-/usr prefix because libtool uses rpath instead of runpath for nonstandard prefixes.	2012-06-21 10:10:46 -07:00
Eric Anholt	4113ac6a0f	automake: Convert Mesa built sources generation to automake.	2012-06-21 10:10:46 -07:00
Eric Anholt	2d51ac84fd	mesa: Move GL header installation to automake. This cuts some cruft related to osmesa where we were being careful to not install headers twice.	2012-06-21 10:10:46 -07:00
Eric Anholt	1bbd22ada0	automake: Move mesa subdirs processing to automake.	2012-06-21 10:10:46 -07:00
Eric Anholt	39785488e6	automake: Move .pc installation to automake.	2012-06-21 10:10:46 -07:00
Eric Anholt	417c1a6421	automake: Move the master Mesa makefile to Makefile.old. This will let me incrementally move stuff to automake without converting libmesa.a all at once.	2012-06-21 10:10:46 -07:00
Eric Anholt	bd18a236de	automake: Convert osmesa.pc to be generated by configure.	2012-06-21 10:10:43 -07:00
Eric Anholt	fa4cf4dc0c	mesa: Convert gl.pc to be generated by configure. This saves a step of mashing variables around in our Makefile.	2012-06-21 10:10:08 -07:00
Eric Anholt	2d4b77c7c6	automake: Convert src/mesa/drivers/x11/Makefile to automake. The weird versioning of the libGL where the package version was sort of expressed as a big integer is dropped. libtool didn't like the 0 prefix, and it didn't really make sense anyway -- if you interpret it as an integer version number, old Mesa 071200 was bigger than current Mesa 08100. Instead, just bump the minor version and drop the patchlevel.	2012-06-21 10:09:17 -07:00
Eric Anholt	2fb0f770a4	automake: Convert src/gallium/Makefile to automake.	2012-06-21 10:08:26 -07:00
Eric Anholt	27383cbb0b	automake: Convert src/mapi/glapi/gen to silent build.	2012-06-21 10:08:26 -07:00
Eric Anholt	3a70f7526a	automake: Convert src/mapi/glapi/gen/Makefile to automake.	2012-06-21 10:08:24 -07:00
Eric Anholt	d59149d3f4	automake: Convert src/mesa/drivers/Makefile to automake.	2012-06-21 10:07:38 -07:00
Eric Anholt	9ff2709ca5	automake: Directly generate configs/current instead of symlinking from it.	2012-06-21 10:07:38 -07:00
Eric Anholt	95836b46e7	automake: Convert gen_matypes building to automake.	2012-06-21 10:07:36 -07:00
Eric Anholt	acf27121a5	make: Drop HOST_CC and HOST_CFLAGS. Except for the deleted linux-cell target, these were just the target cc/cflags. The only usage was for gen_matypes, which wants the target's structure packing, not the host, anyway.	2012-06-21 09:58:12 -07:00
Eric Anholt	e426949cf1	make: Fold ASM_CFLAGS into DEFINES. Every place that uses ASM_FLAGS already uses DEFINES. Not including it in DEFINES is just a way to screw up potential users, as I've done several times while working on the build system.	2012-06-21 09:58:12 -07:00
Eric Anholt	07b28af5b5	automake: Convert src/egl/Makefile to automake.	2012-06-21 09:58:12 -07:00
Eric Anholt	a4ff3342d2	automake: Don't warn on gmake portability issues. Even pre-automake, we rely on gmake features for pattern substitutions, and replacing those with reams more make code is not interesting. This will let us turn the old Makefiles using pattern substitutions into automake without spewing warnings. Reviewed-by: Dan Nicholson <dbn.lists@gmail.com>	2012-06-21 09:57:52 -07:00
Marcin Slusarz	19fd04f5ea	nv50: fix buffer reuse issues 1) We need to insert a barrier between consecutive transform feedback calls. 2) VBO cache needs to be flushed when TFB output is used as VBO draw input. Fixes Piglit test EXT_transform_feedback/immediate-reuse. Thanks to Christoph Bumiller for pointing out bugs in previous versions of this patch.	2012-06-20 21:24:53 +02:00
Marcin Slusarz	7e63b613a5	st/mesa: fix transform feedback of unsubscripted gl_ClipDistance array gl_ClipDistance needs special treatment in form of lowering pass which transforms gl_ClipDistance representation from float[] to vec4[]. There are 2 implementations - at glsl linker level (enabled by LowerClipDistance option) and at glsl_to_tgsi level (enabled unconditionally for gallium drivers). Second implementation is incomplete - it does not take into account transform feedback (see commit `642e5b413e` "mesa: Fix transform feedback of unsubscripted gl_ClipDistance array" for details). There are 2 possible fixes: - adding transform feedback support into glsl_to_tgsi version - ripping gl_ClipDistance support from glsl_to_tgsi and enabling gl_ClipDistance lowering on glsl linker side This patch implements 2nd option. All it does is: - reverts most of the commit `59be691638` "st/mesa: add support for gl_ClipDistance" - changes LowerClipDistance to true Fixes Piglit tests "EXT_transform_feedback/builtin-varyings gl_ClipDistance[{2,3,4,5,6,7,8}]-no-subscript" at least on nv50 and evergreen cards.	2012-06-20 21:16:20 +02:00
Paul Berry	f2f05e50b1	glx/tests: Fix signed/unsigned comparison warnings.	2012-06-20 11:42:42 -07:00
Paul Berry	cde6544ad7	i965/msaa: Only do multisample rasterization if GL_MULTISAMPLE enabled. From the GL 3.0 spec (p.116): "Multisample rasterization is enabled or disabled by calling Enable or Disable with the symbolic constant MULTISAMPLE." Elsewhere in the spec, where multisample rasterization is described (sections 3.4.3, 3.5.4, and 3.6.6), the following text is consistently used: "If MULTISAMPLE is enabled, and the value of SAMPLE_BUFFERS is one, then..." So, in other words, disabling GL_MULTISAMPLE should prevent multisample rasterization from occurring, even if the draw framebuffer is multisampled. This patch implements that behaviour by setting the WM and SF stage's "multisample rasterization mode" to MSRAST_ON_PATTERN only when the draw framebuffer is multisampled and GL_MULTISAMPLE is enabled. Fixes piglit test spec/EXT_framebuffer_multisample/enable-flag. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-20 11:28:09 -07:00
Paul Berry	3b0279a693	i965/msaa: Disable unsupported formats. Due to hardware limitations, MSAA is unsupported on Gen6 for formats containing >64 bits of data per pixel. From the Sandy Bridge PRM, vol4 part1, p72 ("Surface Format"): If Number of Multisamples is set to a value other than MULTISAMPLECOUNT_1, this field cannot be set to the following formats: - any format with greater than 64 bits per element - any compressed texture format (BC) - any YCRCB format Gen7 has a similar, but less stringent limitation: formats with >64 bits of data per pixel only support 4x MSAA. This patch causes the unsupported formats to report GL_FRAMEBUFFER_UNSUPPORTED. Fixes piglit "multisample-formats" tests on Gen6. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-20 11:28:09 -07:00
Andreas Boll	3becf98424	mesa: remove obsolete confdiff.sh this script is obsolete since `0cc216676c`	2012-06-20 01:51:38 -07:00
Christian König	0f269c5e7b	st/vdpau: use template size as default for source_rect. Fixes alignment problems with flash player. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-06-20 10:13:38 +02:00
Christian König	d37c3c6ebe	st/vdpau: clear Cb&Cr with 0.5f That makes the output black in case of decoding errors. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-06-20 10:13:29 +02:00
Kenneth Graunke	2f8351a5ac	i965: Don't set brw_wm_prog_key::iz_lookup on Gen6+. Sandy Bridge and later don't use this field, so there's no point in setting it. It can only cause harmful state-based recompiles. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-19 17:36:48 -07:00
Olivier Galibert	c790c2c759	llvmpipe: Add vertex id support. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Olivier Galibert	46931ecf48	llvmpipe: Simplify and fix system variables fetch. The system array values concept doesn't really because it expects the system values to be fixed per call, which is wrong for gl_VertexID and iffy for gl_SampleID. So this patch does two things: - kill the array, have emit_fetch_system_value directly pick the values it needs (only gl_InstanceID for now, as the previous code) - correctly handle the expected type in emit_fetch_system_value Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Olivier Galibert	4625a9b1ad	draw: fix flat shading and screen-space linear interpolation in clipper This includes: - picking up correctly which attributes are flatshaded and which are noperspective - copying the flatshaded attributes when needed, including the non-built-in ones - correctly interpolating the noperspective attributes in screen-space instead than in a 3d-correct fashion. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Olivier Galibert	cfc5b30941	softpipe: Offset is not to be applied to the layer parameter of array texture fetches. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Brian Paul	fc855ed5d9	st/mesa: clamp glDrawPixels size to max texture size	2012-06-19 14:40:44 -06:00
Brian Paul	7f4786ad29	st/mesa: move st_validate_state() call earlier in st_DrawPixels()	2012-06-19 14:40:44 -06:00
Jerome Glisse	b4f0ab0b22	r600g: fix z/stencil texture creation v2 z or stencil texture should not be created with the z/stencil flags for surface creation as they are intended to be bound as texture. v2: remove broken code Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-06-19 15:03:36 -04:00
Török Edwin	988ad7831c	radeon/llvm: Fix CR/LF in Processors.td Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-06-19 16:38:23 -04:00
Török Edwin	7c005d5687	radeon/llvm: Fix sin/cos codegen on R700 Based on https://bugs.freedesktop.org/show_bug.cgi?id=50317#c4 Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=50316 https://bugs.freedesktop.org/show_bug.cgi?id=50317 Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-06-19 16:38:13 -04:00
Fredrik Höglund	4e943c375b	docs: update GL3.txt for ARB_base_instance Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Fredrik Höglund	c4c8c7a8f9	st/mesa: Add support for GL_ARB_base_instance Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Fredrik Höglund	af372129e5	gallium: Add PIPE_CAP_START_INSTANCE Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Fredrik Höglund	ae5d7d5e89	mesa: Add support for GL_ARB_base_instance Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Vinson Lee	ee99647e02	scons: Do not build svga if using Solaris Studio C compiler. Solaris Studio C compiler does not support anonymous structs and anonymous unions. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-18 16:37:46 -07:00
Kenneth Graunke	5b83bdc154	i965: Fix brw_swap_cmod() for LE/GE comparisons. The idea here is to rewrite comparisons like 2 >= x with x <= 2; we want to simply exchange arguments, not negate the condition. If equality was part of the original comparison, it should remain part of the swapped version. This is the true cause of bug #50298. It didn't manifest itself on Sandybridge because we embed the conditional modifier in the IF instruction rather than emitting a CMP. All other platforms use CMP. It also didn't manifest itself on the master branch because commit `be5f27a84d` ("glsl: Refine the loop instruction counting.") papered over the problem. NOTE: This is a candidate for stable release branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50298 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-18 15:25:31 -07:00
Brian Paul	6f7834ad36	docs: start release notes file for 8.1	2012-06-18 12:39:34 -06:00
Tom Stellard	7fab4b648b	radeon/llvm: Update comment in AMDGPU.td	2012-06-18 18:30:36 -04:00
Tom Stellard	984ad0788c	radeon/llvm: Remove unused AMDIL TableGen definitons	2012-06-18 18:30:36 -04:00
Tom Stellard	34ff22b75f	radeon/llvm: Eliminate getRegClassFromType() function We can use TargetLowering::getRegClassFor() instead.	2012-06-18 18:30:36 -04:00
Tom Stellard	440ab9ea02	radeon/llvm: Remove deadcode from AMDILISelLowering.cpp	2012-06-18 18:30:35 -04:00
Vinson Lee	cd62960a2e	gallium: Add support for Solaris Studio C++ compiler. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-18 10:01:26 -07:00
James Benton	f34e2f484b	llvmpipe: Implement cylindrical wrapping. Tested against mesa demos cylwrap and dx9 DCT address.exe which now passes 100%. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-18 17:55:05 +01:00
Vinson Lee	d1acae2bdc	st/glx: Do not undefine _R, _G, and _B. Fixes build error on Cygwin and Solaris. _R, _G, and _B are used in ctype.h on those platforms. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-18 09:42:08 -07:00
Brian Paul	8ae93c68ea	svga: fix synchronization bug between sampler views and surfaces This fixes a bug where a sampler view was using stale texture/resource data when the texture was modified through a surface (render to texture). Bumping the texture and layer ages triggers sampler view revalidation. Fixes piglit fbo-blit failure. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-18 10:22:59 -06:00
Kristian Høgsberg	2d7b2d7a87	gles2: Add GL_NV_read_buffer extension This lets us select the front buffer for reading under GLES2. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-18 11:53:18 -04:00
Kristian Høgsberg	e841a2426e	get.c: Rename EXTRA_VERSION_ES2 to EXTRA_API_ES2 This extra condition checks the API not the version of the API, so rename to reflect that. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-18 11:50:53 -04:00
Andreas Boll	1692d3ad94	docs/relnotes: comment out bug template Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
Andreas Boll	fb918727ef	docs/relnotes: replace tbd with release date Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
Andreas Boll	b9fad90350	docs/relnotes: fix validation errors Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
Andreas Boll	207d52eb46	docs/relnotes: consolidate html header Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
José Fonseca	e48d26bf40	draw: Ensure that the vertex_header type size matches expectation. This is failing sometimes, probably because TargetData keeps a structure layout cache, which can becomes bogus, ever since the InvalidateStructLayoutInfo API was removed in LLVM r135245. This change merely makes the problem easier to diagnose (an assertion failure instead of a random crash).	2012-06-18 12:06:23 +01:00
Marek Olšák	6e7756db14	r600g: enable streamout by default on r7xx and DRM 2.17.0 Now that it's in Linus's tree. Has anyone had a chance to test streamout on Cayman recently?	2012-06-17 18:28:32 +02:00
Marek Olšák	7c3786d780	st/mesa: properly allocate MSAA renderbuffers Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-16 14:20:27 +02:00
Marek Olšák	c760283159	st/mesa: make unsupported renderbuffer formats always fail as FBO incomplete instead of failing to allocate a renderbuffer. This also fixes piglit/get-renderbuffer-internalformat with non-renderable formats. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-16 14:20:27 +02:00
Marek Olšák	e4b2e6b527	st/mesa: separate sw renderbuffer allocation from hw one Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-16 14:20:26 +02:00
Marek Olšák	a82227ce4a	mesa: if AllocStorage doesn't choose a format, report FRAMEBUFFER_UNSUPPORTED This allows drivers not to do any allocation in AllocStorage if the storage cannot be allocated because of an unsupported internalformat + samples combo. The little ugliness is that AllocStorage is expected to return TRUE in this case. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-16 14:20:26 +02:00
Stéphane Marchesin	841eee5d44	i915g: More ops commute. This allows using the optimizations more broadly.	2012-06-15 20:22:26 -07:00
Marek Olšák	cb4d1d377d	r600g: fix lockups with streamout on r7xx This requires the latest streamout kernel patches. Streamout is disabled by default on r7xx, so this patch is safe for regular users. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-15 22:43:00 +02:00
Marek Olšák	f01594be0e	r600g: compute CS space for streamout correctly, add comments SET_CONTEXT_REG was not counted in. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-15 22:43:00 +02:00
Marek Olšák	bb07e25131	r600g: set SMX_ACTION_ENA to fix streamout cache flushes on some chipsets It helps on R7xx. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-15 22:42:59 +02:00
Alexey Shvetsov	f56f03428d	clover: Fix build with LLVM libs installed to non-standard directories Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-06-15 13:22:16 -04:00
Marek Olšák	5e7e7d96b3	st/mesa: don't do srgb->linear conversion in decompress_with_blit This fixes piglit/getteximage-formats on r600g. NOTE: This is a candidate for stable branches. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-15 18:54:14 +02:00
Paul Berry	4d9c3cbce9	glsl: Use ir_unop_f2u to convert floats to uints. Fixes piglit tests spec/glsl-1.30/execution/{vs,fs}-float-uint-conversion on i965. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	9d57d483cb	gallium: Add TGSI_OPCODE_F2U to gallivm backend. Note: for the moment TGSI_OPCODE_F2U is implemented using lp_build_itrunc() (the same function used to implement TGSI_OPCODE_F2I). In the long run, we should create an lp_build_utrunc() function to do the proper conversion. But this should allow us to limp along with mostly correct behaviour for now.	2012-06-15 08:58:55 -07:00
Paul Berry	1be7661110	gallium: Add support for ir_unop_f2u to tgsi backend. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	fa584c50cf	ir_to_mesa: Add support for ir_unop_f2u to ir_to_mesa backend. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	11a7b93592	i965: Add support for ir_unop_f2u to i965 backend. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	613a8170ae	glsl: Add support for ir_unop_f2u to constant folding. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	8e31f961e6	glsl: Add unary operation ir_unop_f2u. Previously, we performed conversions from float->uint by a two step process: float->int->uint. However, on platforms that use saturating conversions (e.g. i965), this didn't work, because if the source value was larger than the maximum representable int (0x7fffffff), then converting it to an int would clamp it to 0x7fffffff. This patch just adds the new opcode; further patches will adapt optimization passes and back-ends to use it, and then finally the ast_to_hir logic will be modified to emit the new opcode. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	75f409d75c	i965/blorp: Implement source clipping. This patch modifies blorp blits (which are used for MSAA) to properly account for clipping of source coordinates. Previously, if we detected the possibility of source clipping, we would fall back to the blit meta-op, which doesn't support MSAA and is very slow for depth and stencil buffers. Fixes piglit tests "EXT_framebuffer_multisample/clip-and-scissor-blit" on i965/Gen6+. Also substantially speeds up the Humble Bundle V game "Psychonauts" on Gen6+ (without this patch, the game's depth buffer blits use the slow blit meta-op). Reviewed-by: Carl Worth <cworth@cworth.org>	2012-06-15 08:58:54 -07:00
Brian Paul	4d9f263d7c	scons: add st_atom_array.c to the build	2012-06-15 09:31:33 -06:00
Christian König	92af184690	winsys/radeon: enable IB submission to compute rings v2 This allows to submit things to the compute only rings on cayman+ v2: rebased on current master and actually make use of the new flag in evergreen_compute.c Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-15 09:52:38 +02:00
Marek Olšák	b4753dafcc	st/mesa: atomize vertex array state This moves the state validation to where all the other states are validated.	2012-06-15 03:15:50 +02:00
Maarten Lankhorst	6bb0151f1f	winsys/radeon: Remove unnecessary pipe_thread_destroy in radeon_drm_cs_destroy Fixes crash bug introduced with `210ddf0819` fd.o #49198 pthread_detach after a pthread_join is unneeded. Signed-off-by: Maarten Lankhorst <m.b.lankhorst@gmail.com> Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-06-15 03:01:23 +02:00
Marcin Slusarz	fc782bcbf0	nv50,nvc0: fix stream output target buffer leak It manifests at exit as: "WARNING: destroying GPU memory cache with some buffers still in use"	2012-06-14 23:38:28 +02:00
Christoph Bumiller	169a0ae40a	nv50: disable stream output before reconfiguring it If we don't, the GPU will just throw an ILLEGAL_OPERATION error.	2012-06-14 23:30:49 +02:00
Christoph Bumiller	ef51ce522b	nv50/ir: handle NEG,ABS modifiers for short RCP encoding	2012-06-14 23:25:48 +02:00
Brian Paul	f677954e07	st/mesa: fix glDrawPixels(GL_DEPTH_COMPONENT) color output When drawing a depth image the fragment shader also needs to emit the current raster color. The new piglit drawpix-z test exercises this. NOTE: This is a candiate for the 8.0 branch.	2012-06-14 14:37:31 -06:00
Brian Paul	8031aa134e	docs: add info about shortlog_mesa.sh script	2012-06-14 14:37:31 -06:00
Paul Berry	4b7b4c46c5	glx/tests and mesa/tests: Update .gitignore files. This patch updates .gitignore files to account for the new build artifacts introduced by the following commits: `ae376f0` glx/tests: Rename test as glx-test `8fecdcc` mesa/tests: Add tests for _mesa_lookup_enum_by_{name,nr} functions `a29ad2b` mesa/tests: Add tests for the generated dispatch table	2012-06-14 10:08:57 -07:00
Christian König	eb024c7488	st/vdpau: fix YCbCr down/up-loads for buffers larger than requested When the video buffer turns out to be larger than requested by the application we shouldn't upload or download more data into / from it original requested. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=39309 Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-06-14 17:54:04 +02:00
Alexander von Gluck IV	cb3054c849	scons: Fix Haiku binary optimizations Haiku targets the Pentium or higher processor. To ensure compatibility we can do march 586 and mtune 686. Mesa will still use sse however if the cpu supports it (and the stack is properly aligned). These flags only effect the internal compiler optimizations.	2012-06-14 08:08:17 -07:00
Andreas Boll	c1dcf9665c	mesa: fix html in shortlog_mesa.sh script Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-14 08:25:42 -06:00
Brian Paul	51c9c67a2f	mesa: added Ian's shortlog_mesa.sh script in bin/	2012-06-14 08:22:54 -06:00
Brian Paul	5234b8902c	svga: make svga_surface_needs_propagation() surface const	2012-06-14 08:20:40 -06:00
Brian Paul	92b65637ab	svga: add svga_surface_const() cast wrapper	2012-06-14 08:20:40 -06:00
Brian Paul	bffb3997c3	svga: fix comment typo	2012-06-14 08:20:40 -06:00
Aaron Watry	fc3bac8a40	rbug: fix make process on Linux Mint 13 x64. Previously, rbug_.c would fail to compile with incomplete prototype errors when make was run from the command line on my machine. My IDE always built fine, and still does after this patch (Netbeans 7.1.2). Most of the includes from files in gallium/auxiliary/rbug/ were assuming an rbug/ subdirectory, while the headers are actually in the same directory as the .c files. The build error was also previously a problem for me on Ubuntu 11.10 and Mint 12. Fixes build for the following configuration: ./autogen.sh --enable-debug --enable-texture-float --with-gallium-drivers=r600 --with-dri-drivers=radeon --enable-r600-llvm-compiler Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-14 08:14:59 -06:00
José Fonseca	93a42d1314	windows/gdi: Remove GL_NV_register_combiners and GL_NV_vertex_array_range exports	2012-06-14 12:02:03 +01:00
Ian Romanick	4bfdc83135	glsl: Fix pi/2 constant in acos built-in function In single precision, 1.5707963 becomes 1.5707962513 which is too small. However, 1.5707964 becomes 1.5707963705 which is just right. The value 1.5707964 is already used in asin.ir. NOTE: This is a candidate for stable release branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Paul Berry <stereotype441@gmail.com>	2012-06-13 18:26:11 -07:00
Ian Romanick	f18d3fe0cb	glapi: Remove GL_NV_vertex_array_range from the dispatch table There is no GLX protocol for these functions. Open-source Linux driver have not supported this extension for many years, and it seems unlikely at this point that this support will return. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:56 -07:00
Ian Romanick	69d1851757	glapi: Remove GL_NV_fence from the dispatch table There is no GLX protocol for these functions. No open-source Linux driver has ever supported this extension, and it seems unlikely at this point that one ever will. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:56 -07:00
Ian Romanick	6db7cf29b5	glapi: Remove GL_NV_register_combiners from the dispatch table There is no GLX protocol for these functions. No open-source Linux driver has ever supported this extension, and it seems unlikely at this point that one ever will. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:56 -07:00
Ian Romanick	a6002909a3	glapi: Remove GL_APPLE_texture_range from the dispatch table There is no GLX protocol for these functions, and no Linux driver has ever supported this extension. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:54 -07:00
Ian Romanick	e62c4c765c	glapi: Remove GL_SGIX_pixel_texture from the dispatch table There is no GLX protocol for this function. Open-source Linux driver have not supported this extension for many years, and it seems unlikely at this point that this support will return. There's no reason to have slots for this function in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:54 -07:00
Ian Romanick	933714aabe	glapi: Remove GL_SGIS_pixel_texture from the dispatch table There is no GLX protocol for these functions, and no Linux driver has ever supported this extension. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:54 -07:00
Ian Romanick	a29ad2b421	mesa/tests: Add tests for the generated dispatch table Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:53 -07:00
Ian Romanick	8fecdcc587	mesa/tests: Add tests for _mesa_lookup_enum_by_{name,nr} functions Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:14:22 -07:00
Ian Romanick	e08f9080ff	glapi: Add missing GL_EXT_texture_sRGB_decode enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	1c25984b23	glapi: Add missing GL_EXT_framebuffer_sRGB enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	75c516c959	glapi: Add missing GL_EXT_packed_float enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	ffbccb8cef	glapi: Add missing framebuffer sRGB enum Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	2d8d85d7fb	glapi: Add uniform buffer object enums These are from OpenGL 3.1 and ARB_uniform_buffer_object. I only added them to 3.1 because that required the least work. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	c5071825b0	glapi: Add missing enums for GL_NV_fragment_program Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	2485a1332e	glapi: Add missing enums for GL_ARB_occlusion_query2 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	22cdd7d817	glapi: Remove extraneous GL_ from TEXTURE_IMMUTABLE_FORMAT Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	21af1e9a0e	glapi: Add missing enums for GL_ATI_fragment_shader Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	502449d71f	glapi: Add texture swizzle enums These are from OpenGL 3.3, ARB_texture_swizzle, and EXT_texture_swizzle (with different names). I only added them to 3.3 because that required the least work. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	a4a0c1f09d	glapi: Add a couple missing 3.0 enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	cc1e74bd19	glapi: Add missing _NV extension on COMBINE4 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	78b30938cc	glapi: Add missing enums for GL_EXT_vertex_array Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	8fcec14417	glapi: Add missing enums for GL_EXT_compiled_vertex_array Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	3c22f79412	glx/tests: Add unit tests for generated code in indirect_init.c Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Ian Romanick	4c270f9c6b	glx/tests: Add unit tests for generated code in indirect_size.c Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Ian Romanick	ae376f0567	glx/tests: Rename test as glx-test This matches the existing test in src/glsl/tests. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Ian Romanick	2e8c866f10	glx: Move tests from tests/glx to src/glx/tests This matches the organization of other unit tests in Mesa. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Brian Paul	f68ab0398b	util: add some comments, fix indentation	2012-06-13 08:52:40 -06:00
Matt Turner	ae419a0159	glsl: Transform dot product by a basis vector into a swizzle Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Matt Turner	9aa3fbcc2e	glsl: Add is_basis function Determines whether it's a basis vector, i.e., a vector with one element equal to 1 and all other elements equal to 0. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Matt Turner	d7bef19c7f	glsl: Check for zero vectors in ir_binop_dot Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Brian Paul	82ce93a8fd	mesa: move variable declaration out of loop to fix MSVC build	2012-06-12 16:31:36 -06:00
Stéphane Marchesin	a74c4fb89d	mesa: Fix bool-int mismatch Also include stdbool for windows.	2012-06-12 15:22:48 -07:00
Antoine Labour	3c9fab8822	mesa: Fix hash table leak When a value was replaced, the new key was strdup'd and leaked. To fix this, we modify the hash table implementation to return whether the value was replaced and free() the (now useless) duplicate string.	2012-06-12 14:42:22 -07:00
Antoine Labour	e2e9b4b10f	mesa: Free uniforms correclty. This is an array of uniforms, not a single one. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> NOTE: This is a candidate for the 8.0 branch.	2012-06-12 14:42:22 -07:00
Antoine Labour	53feb8ecdc	meta: Cleanup the resources we allocate. When we have multiple shared contexts, and one of them is long-running, this will lead to never freeing those resources since they are shared. Instead, free them right away on context destruction since we know the other context isn't using them. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> NOTE: This is a candidate for the 8.0 branch.	2012-06-12 14:42:22 -07:00
Stéphane Marchesin	0256edd709	glx: Handle a null reply in QueryVersion. Works around crashes when X connections break. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> NOTE: This is a candidate for the 8.0 branch.	2012-06-12 14:42:22 -07:00
Michel Dänzer	1657dec72d	radeonsi: Don't always re-compile shaders after they're bound.	2012-06-12 20:18:24 +02:00
Dave Airlie	6d289390ec	st/xorg: Fix crash on startup. Signed-off-by: Dave Airlie <airlied@redhat.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com>	2012-06-12 18:48:28 +02:00
Michel Dänzer	90c6eacdb4	radeonsi: Use linear instead of constant interpolation for now. Constant interpolation still hangs the GPU for some reason.	2012-06-12 18:48:28 +02:00
Thomas Stellard	4c418cf1a3	radeonsi: Handle SUB_f32. Signed-off-by: Thomas Stellard <tom.stellard@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>	2012-06-12 18:48:16 +02:00
Michel Dänzer	4c4ef9c29a	radeonsi: Only dump shaders with environment variable RADEON_DUMP_SHADERS=1.	2012-06-12 18:33:54 +02:00
Eric Anholt	7b11051a28	mesa: Build git_sha1.h before computing dependencies. Otherwise, version.c doesn't get a dependency on it in a clean build, and then it doesn't necessarily get generated before version.c is compiled. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50976 Reviewed-by: Jakob Bornecrantz jakob@vmware.com	2012-06-12 08:10:41 -07:00
Andreas Boll	fd64b39727	docs: whitespaces cleanup Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	5dc59455f9	docs: remove some superfluous <p> tags Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	8155ed37a1	docs: remove unused table styles Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	908f788503	docs: remove unused anchor links Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	210a27d8c3	docs: prefer lowercase html tags Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	cc4188895b	docs: use id instead of <a name> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	f85d23cea4	docs/subset-A.html: fix markup fixes tidy warnings: line 11 column 1 - Warning: <center> isn't allowed in <h1> elements line 10 column 1 - Info: <h1> previously mentioned line 11 column 34 - Warning: discarding unexpected </center> line 14 column 1 - Warning: <center> isn't allowed in <h2> elements line 13 column 1 - Info: <h2> previously mentioned line 13 column 1 - Warning: missing </h2> before <h3> line 18 column 1 - Warning: discarding unexpected </center> line 19 column 1 - Warning: discarding unexpected </h2> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	2d7f319a0a	docs/news.html: use proper markup fixes tidy warnings: line 1227 column 9 - Warning: missing <li> line 1228 column 17 - Warning: missing <li> line 1235 column 25 - Warning: missing <li> line 1259 column 17 - Warning: missing <li> line 1267 column 9 - Warning: missing <li> line 1359 column 9 - Warning: missing <li> line 1361 column 55 - Warning: discarding unexpected </i> line 1354 column 1 - Warning: trimming empty <p> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	df2be226d9	docs: fix html end/start tags for more well-formed html Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	703a662c15	docs: escape special html chars Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:29 -06:00
Andreas Boll	ecd5c7ceb8	docs: consolidate html header and footer add doctype add character encoding add missing <head> tag unify html header and footer Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:29 -06:00
Kenneth Graunke	45c21f852e	mesa: Unbind GL_TEXTURE_BUFFER on DeleteBuffers. Fixes oglconform's tbo/basic.buffer.delete test. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-11 13:30:32 -07:00
Kenneth Graunke	bbb67c3efc	mesa: Make glPrimitiveRestartIndex execute immediately in display lists. From the GL_NV_primitive_restart spec: "PrimitiveRestartIndexNV is not compiled into display lists, but is executed immediately." Prior to this patch, calls to glPrimitiveRestartIndex would hit the noop dispatch stub. +2 oglconforms. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-11 13:28:23 -07:00
Kenneth Graunke	a75e704326	mesa: Check for a negative "size" parameter in glCopyBufferSubData(). From the GL_ARB_copy_buffer spec: "An INVALID_VALUE error is generated if any of readoffset, writeoffset, or size are negative [...]" Fixes oglconform's copybuffer/negative.CNNegativeValues test. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-06-11 13:27:36 -07:00
Kenneth Graunke	4a5d020ee3	automake: Add AM_PROG_AR before LT_INIT to silence a lot of warnings. The warnings appear to occur with newer automake (probably 1.12). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-11 13:27:36 -07:00
José Fonseca	ea606ee7b4	scons: Fix scons build.	2012-06-11 19:38:07 +01:00
Brad King	f3cdcb839f	configure.ac: Add --with-(gl\|glu\|osmesa)-lib-name options These allow one to mangle the library names, without also mangling the symbol names, to make them distinct from other GL libraries on the system. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	337d9c955b	glsl: Put a bunch of optimization visitors under anonymous namespaces. Because these classes are used entirely from their own source files and not from separate DSOs, the linker gets to produce massively less code. This cuts about 13k of text in the libdricore case. In the non-libdricore case, the additional linkage information allows the compiler to inline some code, so libglsl.a size actually increases by about 300 bytes. For a dricore build, improves shader_runner runtime on glsl-fs-copy-propagation-texcoords-1 by 0.21% +/- 0.03% (n=353574, outliers removed). No statistically significant difference with n=322 on glslparsertest on a yofrankie shader intended to test compiler performance. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	279efce8bb	automake: Merge the dricore libglsl build into libdricore. Now we have just one library of "all of Mesa core" instead of both libdricore and libglsl that drivers link against. I did this change in a sort of nonrecursive make fashion: the generated files are still produced in the non-automake build, like the rest of dricore, but the GLSL files are stuffed into libdricore without building a convenience library in src/glsl (even though we could now). This would make a bit more sense if glsl was just another dir under src/mesa, because right now I had to contort the prefix variable name to look another ../ level up.	2012-06-11 09:28:00 -07:00
Eric Anholt	446faee094	automake: Add a prefix variable for libglsl sources. See `e86c40a84d` for reasoning. In the process I did s/:=/=/ to shut up automake about nonportable make syntax.	2012-06-11 09:28:00 -07:00
Eric Anholt	7edbf4b323	automake: Convert src/Makefile to automake. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	07abd913b6	automake: Move top-level makefile to automake. This is part of a series to fix our build issues in the automake case by hooking up the automatic Makefile regeneration support. The extract_git_sha1 is moved into src/mesa/Makefile so that we get correct dependency generation. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	743e505315	automake: Globally add stub automake targets to the old Makefiles. I tried to update all the old Makefiles that included the default config to be sure they had a default target if they didn't previously have one, since this new all target will always point at it. Almost everything had one. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	4038dda6cd	mesa: Move the version information right into configure.ac. Nothing else called version.mk. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	0cc216676c	automake: Remove the old static configs system. With the incremental automake conversion, we'd broken those that included glx or egl. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Tapani Pälli	d5c1801a01	android: fix the build Some more of the files are now autogenerated, this caused build breakage, patch adds generation of these missing files. Patch also changes existing make so that the files are created to be part of the local source (not intermediate directory, this causes several problems). Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2012-06-11 09:27:59 -07:00
Michael Karcher	e2c08e824b	i915g: Fix depth/stencil glClear This patch fixes a copy/paste error and masking of depth/stencil (stencil is in the top 8 bits), and makes glean/readPixSanity happy. Both the stencil and the depth buffer piglit test also pass if glClear(DEPTH \| STENCIL) is executed instead of glClear(DEPTH)/glClear(STENCIL). Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Tested-by: Christopher Egert <cme3000@gmail.com> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-06-10 16:33:42 +02:00
Kenneth Graunke	306c9f0c57	mesa: Fix "glCopyBuffserSubData" typos in error messages and comments. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 22:04:34 -07:00
Eric Anholt	a018747ac8	glsl: Clean up warnings about deleting classes without virtual destructors. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 12:42:38 -07:00
Marcin Slusarz	ea055e19c2	glsl: fix deref_hash memory leak in constant_expression_value Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 21:00:40 +02:00
Andreas Boll	ca9977d5c6	glcpp: .gitignore cleanup .o, .lo and *~ are already in toplevel .gitignore Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 11:18:55 -07:00
Andreas Boll	6224e90247	glapi: .gitignore cleanup remove archaic .cvsignore .pyo is already in toplevel .gitignore .pyc is already in toplevel .gitignore Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 11:18:38 -07:00
Roland Scheidegger	dfbb18bdb5	gallivm: Fix calculating rho for 3d textures for the single-quad case Discovered by accident, this looks like a very old typo bug. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-08 17:46:57 +01:00
Kenneth Graunke	529476b5e4	i965: Add forgotten bitcast operations in brw_fs_channel_expressions. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:22:11 -07:00
Paul Berry	9fd0e76a19	i965/blorp: allow all buffer formats provided src and dst match. Previously, blits using the "blorp" mechanism only worked for 8-bit RGBA color buffers, 24-bit depth buffers, and 8 bit stencil buffers. This was not enough, because the blorp mechanism must be used for blitting whenever MSAA is in use. This patch allows all formats to be used, provided the source and destination formats match. So far I have confirmed that the following formats work properly with MSAA: - GL_RGB - GL_RGBA - GL_ALPHA - GL_ALPHA4 - GL_ALPHA8 - GL_R3_G3_B2 - GL_RGB4 - GL_RGB5 - GL_RGB8 - GL_RGB10 - GL_RGB12 - GL_RGB16 - GL_RGBA2 - GL_RGBA4 - GL_RGB5_A1 - GL_RGBA8 - GL_RGB10_A2 - GL_RGBA12 - GL_RGBA16 Fixes piglit tests "EXT_framebuffer_multisample/formats {2,4}" on Sandy Bridge and Ivy Bridge. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Paul Berry	530bda2aac	i965/blorp: Implement logic for additional buffer formats. Previously the blorp engine only supported RGBA8 color buffers and 24-bit depth buffers. This patch adds support for any color buffer format that is supported as a render target, and for 16-bit and 32-bit depth buffers. This required threading the brw_context struct through into brw_blorp_surface_info::set() so that it can consult the brw->render_target_format array. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Paul Berry	9dbd0b6778	i965/blorp: De-virtualize brw_blorp_{mip,surface}_info::set() function. Even though brw_blorp_surface_info is derived from brw_blorp_mip_info, this function doesn't need to be virtual, because it is never accessed through a base class pointer. Making the function non-virtual will allow it to take additional parameters in the brw_blorp_surface_info case. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Paul Berry	040d015734	i965/blorp: Refactor surface format determination. This patch moves the responsibility for deciding on the format of the source and destination surfaces from the gen{6,7}_blorp_emit_surface_state() functions to brw_blorp_surface_info::set(), which is shared between Gen6 and Gen7. This will make it possible to add support for more surface formats without code duplication. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Kenneth Graunke	05790746df	i965: Enable the GL_ARB_shader_bit_encode extension. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:17:21 -07:00
Olivier Galibert	a83be8b6d7	st/mesa: Finally activate the ARB_shader_bit_encoding extension. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:17:14 -07:00
Olivier Galibert	e16b0a51be	glsl: Bitwise conversion operator support in the software renderers. TGSI doesn't need an opcode, since registers are untyped (but beware once doubles come into the scene). Mesa IR doesn't handle native integers, so trying to handle them there is worthless, the case entries are only added for warning reasons. It was only tested with softpipe, since llvmpipe doesn't support glsl 1.3 yet. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:18 -07:00
Olivier Galibert	abe9767553	glsl: Bitwise conversion operator support in ir_constant_expression. A "test_out = floatBitsToUint(-1.0);" fired through the GLSL compiler gives a correct "(assign (x) (var_ref test_out) (constant uint (3212836864)))" Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:18 -07:00
Olivier Galibert	1b8a3aad09	glsl: Bitwise conversion operator support in ir_validate. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:18 -07:00
Olivier Galibert	4fab150559	glsl: Bitwise conversion operator support in ir_expression. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:17 -07:00
Olivier Galibert	500dcbb1aa	glsl: New unary opcodes for ARB_shader_bit_encoding support. The opcodes are bitcast_f2u, bitcast_f2i, bitcast_i2f and bitcast_u2f. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:17 -07:00
Olivier Galibert	199771bc32	glsl: Scaffolding for ARB_shader_bit_encoding. That adds support for activating the extension. It doesn't actually do anything yet, of course. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:00 -07:00
Kenneth Graunke	f8d40deea5	mesa: Return 8 bits for GL_TEXTURE_RED_SIZE on RGTC formats. From the issues section of the GL_ARB_texture_compression_rgtc extension: 15) What should glGetTexLevelParameter return for GL_TEXTURE_GREEN_SIZE and GL_TEXTURE_BLUE_SIZE for the RGTC1 formats? What should glGetTexLevelParameter return for GL_TEXTURE_BLUE_SIZE for the RGTC2 formats? RESOLVED: Zero bits. These formats always return 0.0 for these respective components and have no bits devoted to these components. Returning 8 bits for red size of RGTC1 and the red and green sizes of RGTC2 makes sense because that's the maximum potential precision for the uncompressed texels. Thus, we need to return 8 bits for GL_TEXTURE_RED_SIZE on all RGTC formats and 8 bits for GL_TEXTURE_GREEN_SIZE on RGTC2 formats. BLUE should be 0. Fixes oglconform/rgtc/advanced.texture_fetch.tex_param. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-07 00:01:40 -07:00
Kenneth Graunke	3603fdcebf	glsl: Hook up loop_variable_state destructor to plug a memory leak. While ~loop_state() is already freeing the loop_variable_state objects via ralloc_free(this->mem_ctx), the ~loop_variable_state() destructor was never getting called, so the hash table inside loop_variable_state was never getting destroyed. Fixes a memory leak in any shader with loops. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:01:40 -07:00
Tom Stellard	5f3f63b76d	radeon/llvm: Emulate RECIP_UINT instruction on Cayman	2012-06-06 20:51:00 -04:00
Tom Stellard	0c9f5f22d5	radeon/llvm: Remove some duplicate code in the R600 CodeEmitter	2012-06-06 20:51:00 -04:00
Tom Stellard	9c46cb2368	radeon/llvm: Fix MULLO* instructions on Cayman On Cayman, the MULLO* instructions must fill all slots in an instruction group.	2012-06-06 20:50:36 -04:00
Tom Stellard	0c4b19ac63	r600g: Compute support for Cayman	2012-06-06 10:49:36 -04:00
Dave Airlie	2bb2e6a6e3	xorg: port to new compat API. Signed-off-by: Dave Airlie <airlied@redhat.com>	2012-06-06 15:22:50 +01:00
Brian Paul	ec19bdd16c	mesa: consolidate internal glCompressedTexSubImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:56:00 -06:00
Brian Paul	e8fdd0e0d5	mesa: consolidate internal glCompressedTexImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:56:00 -06:00
Brian Paul	cd9ab2584f	mesa: consolidate internal glCopyTexSubImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:56:00 -06:00
Brian Paul	e42d00b3f4	mesa: consolidate internal glTexSubImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:55:59 -06:00
Brian Paul	8f5fffe75d	mesa: consolidate internal glTexImage1/2/3D code The functions for handling 1D, 2D and 3D texture images were nearly identical. This folds them all together. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:55:59 -06:00
Brian Paul	3a62e8bcac	translate_test: add support for half floats Fixes assertion reported in https://bugs.freedesktop.org/show_bug.cgi?id=44519 but there's still failing cases.	2012-06-06 07:55:59 -06:00
Brian Paul	adc58e96d0	docs: remove documentation of old Makefile system It's going away in the near future.	2012-06-06 07:55:59 -06:00
Tom Stellard	d4942eb9fa	radeon/llvm: Remove obselete hooks for the ConvertToISA pass We can't remove this pass yet, because we need it to convert AMDIL registers in BRANCH* instructions, but we don't need it for instruction conversion any more.	2012-06-06 13:46:04 -04:00
Tom Stellard	edceed1b9a	radeon/llvm: Remove AMDIL MOVE* instructions	2012-06-06 13:46:04 -04:00
Tom Stellard	f81e4663a7	radeon/llvm: Add isMov() to AMDILInstrInfo This enables the CFGStructurizer to work without the AMDIL::MOV* instructions.	2012-06-06 13:46:04 -04:00
Tom Stellard	1777c99bff	radeon/llvm: Remove deadcode from the AMDILISelLowering class	2012-06-06 13:46:03 -04:00
Tom Stellard	8cc9b463de	radeon/llvm: Don't lower RETURN to S_ENDPGM on SI Instead create an S_ENDPGM instruction in the CodeEmitter and emit it after all the other instructions.	2012-06-06 13:46:03 -04:00
Tom Stellard	de7366701d	radeon/llvm: Remove AMDIL VCREATE* instructions This obsoletes the AMDGPULowerInstruction pass.	2012-06-06 13:46:03 -04:00
Tom Stellard	8d53ddb375	radeon/llvm: Remove AMDIL LOADCONST* instructions This obsoletes the R600LowerInstruction and SIPropagateImmReads passes.	2012-06-06 13:46:03 -04:00
Marcin Slusarz	17e047242e	nouveau: fix scratch buffer leak ...and create common function for destroying nouveau_context	2012-06-05 23:58:43 +02:00
Marcin Slusarz	3232a86efe	nv50: fix nv50_stream_output_state leak	2012-06-05 23:58:43 +02:00
Marcin Slusarz	cfa7cb991c	nv50: fix symbol table memory leak	2012-06-05 23:58:43 +02:00
Kenneth Graunke	2f18698220	i965/fs: Fix user-defined FS outputs with less than four components. OpenGL allows you to declare user-defined fragment shader outputs with less than four components: out ivec2 color; This makes sense if you're rendering to an RG format render target. Previously, we assumed that all color outputs had four components (like the built-in gl_FragColor/gl_FragData variables). This caused us to call emit_color_write for invalid indices, incrementing the output virtual GRF's reg_offset beyond the size of the register. This caused cascading failures: split_virtual_grfs would allocate new size-1 registers based on the virtual GRF size, but then proceed to rewrite the out-of-bounds accesses assuming that it had allocated enough new (contiguously numbered) registers. This resulted in instructions that accessed size-1 GRFs which register numbers beyond virtual_grf_next (i.e. registers that were never allocated). Finally, this manifested as live variable analysis and instruction scheduling accessing their temporary array with an out of bounds index (as they're all sized based on virtual_grf_next), and the program would segfault. It looks like the hardware's Render Target Write message requires you to send four components, even for RT formats such as RG or RGB. This patch continues to use all four MRFs, but doesn't bother to fill any data for the last few, which should be unused. +2 oglconforms. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	cb18472eca	i965/vs: Fix texelFetchOffset() on pre-Gen7. Commit `4650aea7a5` fixed texelFetchOffset() on Ivybridge, but didn't update the Ironlake/Sandybridge code. +18 piglits on Sandybridge. NOTE: This and `4650aea7a5` are both candidates for stable branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	217b62bf00	i965/fs: Fix texelFetchOffset() on pre-Gen7. Commit `f41ecade7b` fixed texelFetchOffset() on Ivybridge, but didn't update the Ironlake/Sandybridge code. +15 piglits on Sandybridge. NOTE: This and `f41ecade7b` are both candidates for stable branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	7fde071f04	meta: Fix GL_RENDERBUFFER binding in decompress_texture_image(). This isn't saved/restored by _mesa_meta_begin, so we need to do it manually (like we do for the read/draw framebuffers). Additionally, we neglected to re-bind before the glRenderbufferStorage call. +13 oglconforms. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	3edd2ba22b	mesa: Unbind ARB_transform_feedback2 binding points on Delete too. DeleteBuffer needs to unbind from these binding points as well, based on the same rationale as the previous patch. +51 oglconforms (together with the last patch). NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	05b086ce93	mesa: Support BindBuffer{Base,Offset,Range} with a buffer of 0. _mesa_lookup_bufferobj returns NULL for 0, which caused us to say "there's no such buffer object" and raise an error, rather than correctly binding the shared NullBufferObj. Now you can unbind your buffers. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:33 -07:00
Kenneth Graunke	cb8ed93dd0	mesa: Unbind ARB_copy_buffer and transform feedback buffers on delete. According to the GL 3.1 spec, section 2.9 ("Buffer Objects"): "If a buffer object is deleted while it is bound, all bindings to that object in the current context (i.e. in the thread that called DeleteBuffers) are reset to zero." The code already checked for a number of cases, but neglected these newer binding points. +21 oglconforms. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:33 -07:00
Kenneth Graunke	25edfbfccf	glsl/builtins: Fix textureGrad() for Array samplers. We were incorrectly assuming that the coordinate's dimensionality is equal to the gradient's dimensionality. For array types, the coordinate has one more component. Fixes 12 subcases of oglconform's glsl-bif-tex-grad test. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-05 14:41:33 -07:00
Kristian Høgsberg	2c4f6ceeb4	configure.ac: Fail if egl x11 platform dependencies are not available Currently, if you pass --with-egl-platforms=x11 but xcb-dri2 isn't available we just silently fail and disables building the EGL DRI2 driver. This commit cleans up the EGL platfrom checking and fails if a selected platform can't find its required dependencies. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-05 14:34:33 -04:00
Alex Deucher	75f9d24ac4	r600g: add new Trinity PCI ids Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-05 10:16:42 -04:00
Alex Deucher	6ce298f9ce	r600g: add new Sumo, Palm, BTC pci ids Note this is a candidate for the stable branch. Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-05 10:15:16 -04:00
Alex Deucher	01b7eb7c74	radeonsi: add new SI pci ids Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-05 10:12:21 -04:00
Paul Berry	555e00fdc3	Fix .gitignore for ralloc-test Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-04 18:11:43 -07:00
Vinson Lee	105f307d90	st/mesa: Fix uninitialized members in glsl_to_tgsi_visitor constructor. Fix uninitialized scalar field defects reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2012-06-02 13:18:40 -07:00
Kenneth Graunke	adbfc4a09a	i965: Implement texture buffer objects on Gen6. Commit `a07cf3397e` added support for TBOs on Gen7, but missed Gen6. Passes piglit -t texture_buffer and oglconform's buffermapping basic.read.texture tests. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-02 12:02:42 -07:00
Kenneth Graunke	608c3d2083	mesa: Restore depth texture state on glPopAttrib(GL_TEXTURE_BIT). According to Table 6.17 in the GL 2.1 specification, DEPTH_TEXTURE_MODE, TEXTURE_COMPARE_MODE, and TEXTURE_COMPARE_FUNC need to be restored on glPopAttrib(GL_TEXTURE_BIT). Makes a number of oglconform tests happier. v2: Make restoration conditional on the ARB_shadow and ARB_depth_texture extensions, as suggested by Brian. I'm not sure that any implementations still remain that don't support those, but why not? NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-02 12:02:42 -07:00
Eric Anholt	775ba11dcd	automake: Connect the libdricore target to make clean. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50480 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-01 16:25:39 -07:00
Tapani Pälli	a9cfd95c24	automake: use -m32 in CCASFLAGS when using --enable-32-bit this fixes libdricore directory build with --enable-32-bit on a x86_64 system Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-01 16:25:39 -07:00
Tom Stellard	0ebf2318b3	radeon/llvm: Fix VTX_READ patterns The VTX_READ instructions were using the ADDRParam ComplexPattern which allows a load instruction's offset to be a register, but VTX_READ instructions can only handle an immediate offset. Also, the load_param pattern fragment had an erroneous return true; statement that was causing it to match the wrong load instructions.	2012-06-01 16:52:26 -04:00
Tom Stellard	c108831d44	radeon/llvm: Emit 2 bytes for vertex fetch offsets	2012-06-01 16:52:26 -04:00
Tom Stellard	85a68814ee	radeon/llvm: Only use indirect (vertex fetch) parameters for kernels Kernel parameters can only be retrieved via vertex fetchs. Direct parameters (i.e parameters stored in the constant buffer) are not supported yet.	2012-06-01 16:52:26 -04:00
Kenneth Graunke	fb79ecb62d	intel: Change vendor string to "Intel Open Source Technology Center". Tungsten Graphics has not existed for several years, and the majority of ongoing development and support is done by Intel. I chose to include "Open Source Technology Center" to distinguish it from, say, the closed source Windows OpenGL driver. The one downside to this patch is that applications that pattern match against "Intel" may start applying workarounds meant for the Windows driver. However, it does seem like the right thing to do. This does change oglconform behavior. Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Eugeni Dodonov <eugeni.dodonov@intel.com> Acked-by: Keith Packard <keithp@keithp.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-01 14:24:57 -07:00
Ian Romanick	adfe531841	glsl: Remove spurious printf messages These look like debug messages from the switch-statement development. NOTE: This is a candidate for the 8.0 release branch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-06-01 12:27:04 -07:00
Tom Stellard	d6c2d3722d	radeon/llvm: Eliminate CFGStructurizer dependency on AMDIL instructions Add some hooks to the R600,SI InstrInfo and RegisterInfo classes, so that the CFGStructurizer pass can run without any relying on AMDIL instructions.	2012-06-01 11:28:11 -04:00
Tom Stellard	65917004d9	radeon/llvm: Change prefix on tablegen files to AMDGPU	2012-06-01 11:28:11 -04:00
Tom Stellard	afea59bf65	radeon/llvm: Remove deadcode from the R600LowerInstructions pass	2012-06-01 11:28:10 -04:00
Tom Stellard	883a0af53a	radeon/llvm: Remove AMDIL GLOBALSTORE* instructions	2012-06-01 11:28:10 -04:00
Tom Stellard	f2781271c7	radeon/llvm: Remove AMDIL GLOBALLOAD* instructions	2012-06-01 11:28:10 -04:00
Adam Rak	6a829a1b72	r600g: compute support for evergreen Tom Stellard: - Updated for gallium interface changes - Fixed a few bugs: + Set the loop counter + Calculate the correct number of pipes - Added hooks into the LLVM compiler	2012-06-01 11:28:10 -04:00
Tom Stellard	46a13b3b11	clover: Add function for building a clover::module for non-TGSI targets v6 v2: -Separate IR type and LLVM triple -Do the OpenCL C->LLVM IR and linking steps for all PIPE_SHADER_IR types. v3: - Coding style fixes - Removed compatibility code for LLVM < 3.1 - Split build_module_llvm() into three functions: compile(), link(), and build_module_llvm() v4: - Use struct pipe_compute_program v5: - Don't malloc memory for struct pipe_llvm_program v6: - Fix serialization of llvm bytecode Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:10 -04:00
Tom Stellard	f2606413ec	gallium: Add struct pipe_llvm_program_header v3 This structure is used as a header that precedes LLVM bytecode programs that are passed to the drivers. v2: - s/pipe_compute_program/pipe_llvm_program/ v3: - Rename to struct pipe_llvm_program_header - Drop the char * prog member Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:10 -04:00
Tom Stellard	741463e18d	clover: Remove target argument from compile_program_tgsi() Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:10 -04:00
Tom Stellard	d724190bce	clover: Add constructors to some of the module classes v3 This is for the llvm code that can't use extended initializers. v2: - Use const references for vector arguments - Move constructor defs before data members - Initialize all values in the default constructors v3: - Fix typo	2012-06-01 11:28:09 -04:00
Tom Stellard	5cc08efe8f	clover: Add necessary flags to libclllvm_la_CXXFLAGS $(LLVM_CFLAGS) for LLVM defines -DLIBCLC_PATH for libclc path -DCLANG_RESOURCE_DIR for clang includes $(DEFINES) for -DHAVE_LLVM	2012-06-01 11:28:09 -04:00
Tom Stellard	7a6b5d42d8	clover: Link to the necessary LLVM and Clang libs	2012-06-01 11:28:09 -04:00
Tom Stellard	d416780f39	configure.ac: Add variables LLVM_CPPFLAGS and LLVM_LIBDIR	2012-06-01 11:28:09 -04:00
Tom Stellard	c79e7668b2	configure.ac: Add option for libclc path	2012-06-01 11:28:09 -04:00
Tom Stellard	613323b256	clover: Add a function for retrieving a device's preferred ir v3 A device now has two function for getting information about the IR it needs to return. ir_format() => returns the preferred IR ir_target() => returns the triple for the target that is understood by clang/llvm. v2: - renamed ir_target() to ir_format() - renamed llvm_triple() to ir_target() v3: - Remove unnecessary include - Do proper conversion from std::vector<char> to std::string Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:09 -04:00
Francisco Jerez	c4c51153bc	gallium/compute: Add PIPE_COMPUTE_CAP_IR_TARGET v4 v2: Tom Stellard - Update CAP description v3: Tom Stellard - TGSI targets should pass an empty string for this CAP. v4: Tom Stellard - TGSI targets can ignore this CAP. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:27:53 -04:00
Tom Stellard	1d118a2a76	gallium: Add PIPE_SHADER_IR_LLVM to enum pipe_shader_ir v2 v2: - s/PIPE_SHADER_IR_LLVM_R600/PIPE_SHADER_IR_LLVM/ Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:26:57 -04:00
Tom Stellard	d85e512374	configure.ac: Add HAVE_OPENCL AM_CONDITIONAL v2 v2: - Drop HAVE_OPENCL variable for non-automake builds - s/HAVE_OPENCL/HAVE_GALLIUM_COMPUTE Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:26:57 -04:00
Brian Paul	091a61a8d5	scons: generate the glapitable.h file too	2012-06-01 08:27:21 -06:00
Brian Paul	8009fca501	svga: fix saturated TEX instructions TEX instructions can't do saturation. Do the TEX into a temp reg w/out saturation, then do a MOV_SAT. Reviewed-by: Jakob Bornecrantz <jakob@vmware.com>	2012-05-31 12:54:04 -06:00
Brian Paul	dff36e900c	scons: add code to generate the various GL API files This fixes recent build breakage when we began building the generated API files from xml as part of the normal build process. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=50475	2012-05-31 09:40:35 -06:00
Brian Paul	185ed21058	draw: simplify index buffer specification Replace draw_set_index_buffer() and draw_set_mapped_index_buffer() with draw_set_indexes() which simply takes a pointer and an index size.	2012-05-31 09:40:35 -06:00
Kenneth Graunke	151bf6e6cf	glsl/tests: Plumb $(PYTHON2) and $(PYTHON_FLAGS) into optimization-test. Some distributions (like Arch Linux) make /usr/bin/python Python 3, rather than Python 2. Since compare_ir uses /usr/bin/env python, such systems will fail to run optimization-test, causing 'make check' to always fail. Automake's TESTS_ENVIRONMENT variable provides a mechanism to run programs or set environment variables in the test environment. Ideally, I think we would want to use AM_TESTS_ENVIRONMENT, since TESTS_ENVIRONMENT is supposed to be user-overridable. However, it isn't supported using the default/serial test runner. Fixes 'make check' on Arch Linux and Gentoo. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Matt Turner <mattst88@gmail.com>	2012-05-30 21:49:41 -07:00
Kenneth Graunke	a44ccdc876	ralloc: Add some basic unit tests. I started writing unit tests for a new piece of code, and discovered they all failed due to a bug in ralloc. Clearly it needs a test suite. v2: Rename to 'ralloc-test' and fix copyright date. (idr review) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-30 21:49:41 -07:00
Kenneth Graunke	1559b2e2d7	ralloc: Fix ralloc_parent() of memory allocated out of the NULL context. If an object is allocated out of the NULL context, info->parent will be NULL. Using the PTR_FROM_HEADER macro would be incorrect: it would say that ralloc_parent(ralloc_context(NULL)) == sizeof(ralloc_header). Fixes the new "null_parent" unit test. NOTE: This is a candidate for the 7.9, 7.10, 7.11, and 8.0 branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-30 21:49:40 -07:00
Kenneth Graunke	2224fb6047	automake: Check for 'indent' and fall back to 'cat' if not found. The glapi generator code uses indent to produce more readable code. However, we don't want to make GNU indent a hard build dependency; check for it in configure.ac and fall back to 'cat' if it's not available. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50484 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Ben Widawsky <ben@bwidawsk.net>	2012-05-30 13:39:30 -07:00
Oliver McFadden	ff3eef1aff	mesa: don't compile integer clear shaders for unsupported APIs Discovered while running the Khronos conformance test suite and receiving "implementation error: meta program compile failed." This bug was recently introduced by the i965 clear patch set and would only be detected while using the ES2 API and only on gen6+ hardware. Signed-off-by: Oliver McFadden <oliver.mcfadden@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-30 15:20:34 +03:00
Paul Berry	47b64c9290	i965/blorp: Implement destination clipping and scissoring This patch implements clipping and scissoring of the destination rect for blits that use the blorp engine (e.g. MSAA blits).	2012-05-29 15:35:35 -07:00
Eric Anholt	6a15790632	mesa: Clean up some dricore-related detritus in the old Makefile. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:51 -07:00
Eric Anholt	f9d1562f35	automake: Convert dricore building to automake. This is performed in a subdirectory to avoid needing to convert all of src/mesa/Makefile in one go. I can now cherry-pick a commit containing glapi XML changes, do "(cd src/mapi/glapi/gen && make) && make", and get a working driver. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:48 -07:00
Eric Anholt	e86c40a84d	automake: Add a prefix variable to the common sources lists. In order to do the minimal change for libdricore conversion to automake, I need to put its Makefile.am in a subdirectory. Automake gets whiny/broken if you use GNU make features like "addprefix" or "$(FILES:%=../%)" to munge your *_SOURCES. So, use a plain old variable to be able to substitute in that "../" Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:45 -07:00
Eric Anholt	7d7fe1b037	automake: Rename variables in sources.mak to be automake compatible. *_SOURCES is reserved for files lists for particular automake targets. Also, "-" in the variable names is not allowed. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:42 -07:00
Eric Anholt	b284d4773b	mesa: Remove generated source files during make clean. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:40 -07:00
Eric Anholt	79273b1a7a	glapi: Enable silent rules for generation when used from automake. This variable won't be set when called from non-automake makefiles, but it cleans up shared-glapi's output. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:37 -07:00
Eric Anholt	559d592448	shared-glapi: Don't forget to clean our built file. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:33 -07:00
Eric Anholt	26eaee3245	mesa: Restore installing of libGL for non-dri builds. Reported-by: Sven Joachim <svenjoac@gmx.de> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:30 -07:00
Eric Anholt	0ce0f7c0c8	mesa: Remove the generated glapi from source control, and just build it. Mesa already always depends on python to build. The checked in changes are not reviewed (because any trivial change rewrites the world). We also have been pushing commits between xml change and regen where at-build-time xml-generated code disagrees with committed xml-generated code. And worst of all, sometimes we ("I") check in stale xml-generated code. Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-29 11:51:57 -07:00
Kurt Roeckx	f92b2e5e90	i830: Fix crash for GL_STENCIL_TEST in i830Enable() commit `87f12bb2d9` tried to fix rb->mt being NULL, but change this case wrong. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Kurt Roeckx <kurt@roeckx.be> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 11:33:02 -07:00
Marcin Slusarz	8924133627	nv50: hook up forgotten short constant buffer upload method Fixes crash in xorg st.	2012-05-29 20:24:45 +02:00
Tom Stellard	83169900fb	radeon/llvm: Update and fix some comments	2012-05-29 11:59:01 -04:00
Tom Stellard	89ece086bc	radeonsi: Remove use.sgpr* intrinsics, use load instructions instead We now model loading uses sgpr values with LLVM IR load instructions that use the USER_SGPR address space. The definition of the sgpr parameter to the use_sgpr() helper function in radeonsi_shader.c has changed so that you can pass raw sgpr values rather than having to divide the sgpr value you want to use by the dword width of the type you want to load.	2012-05-29 11:55:53 -04:00
Tom Stellard	467f51613e	radeonsi: Handle TGSI CONST registers We now emit LLVM load instructions for TGSI CONST register reads, which are lowered in the backend to S_LOAD_DWORD* instructions.	2012-05-29 11:55:52 -04:00
Tom Stellard	32b83e0366	radeon/llvm: Remove AMDILIntrinsicInfo::GetDeclaration fuction body This function was causing compile errors in the tablegen'd code for some intrinsic definitions. I don't think we really need this function, so I'm removing the function body just as a temporary solution. I'll look into removing the entire AMDILIntrinsicInfo class later.	2012-05-29 11:55:52 -04:00
Tom Stellard	49fb99bd13	radeon/llvm: Remove AMDILTargetMachine	2012-05-29 11:55:52 -04:00
Christoph Bumiller	94a25b216b	nouveau: unreference fences on resource destruction	2012-05-29 17:00:20 +02:00
Christoph Bumiller	1a21e36b68	nvc0: optimize blend cso by checking which by-RT data actually differs Can save about 200 bytes of command buffer space.	2012-05-29 17:00:18 +02:00
Christoph Bumiller	f09ee76c98	nvc0: don't upload UCPs if the shader doesn't use them	2012-05-29 17:00:15 +02:00
Christoph Bumiller	79eed0d224	nvc0/ir: allow 64-bit constant loads on nve4 Looks like only 128-bit access doesn't work.	2012-05-29 17:00:10 +02:00
Christoph Bumiller	40c224a573	nvc0/ir: fix texture barrier insertion to prevent WAW hazards Fixes, for instance, object highlighting in Diablo 3 (wine).	2012-05-29 15:01:41 +02:00
Christoph Bumiller	0d818cdacc	nvc0/ir: TEX doesn't support JOIN modifier either	2012-05-29 15:01:41 +02:00
Christoph Bumiller	f80c2874ec	gallium: add st_api feature mask to prevent advertising MS visuals v2: use a define for the maximum sample count v3: also test odd sample counts (r300 supports MS3) While multisample renderbuffers are supported by mesa, MS visuals are not, so we need a way to tell dri/st not to advertise them even if the gallium driver does support multisampled surfaces. Otherwise applications selecting these non-functional visuals would run into trouble ... Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-29 15:01:41 +02:00
Roy Spliet	6404095fba	nv30: Fix generic passing to fragment program in NV34.	2012-05-25 22:42:54 +02:00
Christoph Bumiller	384ef28cb3	nv30: handle user index buffers	2012-05-25 22:42:54 +02:00
Tom Stellard	704eac0916	radeon/llvm: Use a custom inserter for MASK_WRITE	2012-05-25 15:40:59 -04:00
Tom Stellard	4863477e22	radeon/llvm: Use tablegen pattern to lower bitconvert	2012-05-25 15:40:59 -04:00
Tom Stellard	667cdba211	radeon/llvm: Use a custom inserter to lower FNEG	2012-05-25 15:40:58 -04:00
Tom Stellard	d784bc7740	radeon/llvm: Use a custom inserter to lower CLAMP	2012-05-25 15:40:58 -04:00
Tom Stellard	17f8528923	radeon/llvm: Use a custom inserter to lower FABS	2012-05-25 15:40:58 -04:00
Kai Wasserbäch	2df2c31087	r600g: handle R16G16B16_FLOAT and R32G32B32_FLOAT in translate_colorswap Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50318 Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org>	2012-05-25 20:41:01 +02:00
Brian Paul	1609efb418	draw: fix primitive restart bug by using the index buffer offset The code which scans the index buffer for restart indexes wasn't adding the index buffer offset so we were always starting at offset=0. The offset is usually zero so it wasn't noticed before. Fixes a failure in the piglit primitive-restart test when testing vertex data + index data in a single VBO. NOTE: This is a candidate for the 8.0 branch.	2012-05-25 10:02:22 -06:00
Brian Paul	93ea5cd80b	svga: remove the special zero-stride vertex array code This code actually hasn't been needed for some time now. We can just treat a zero-stride vertex array like any other non-zero-stride array.	2012-05-25 10:02:22 -06:00
Brian Paul	dcb4ec5ae1	gallium/docs: beef up the docs related to color clamping Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-05-25 10:02:22 -06:00
Brian Paul	9c85687439	util: add GALLIUM_LOG_FILE option for logging output to a file Useful for logging different runs to files and diffing, etc.	2012-05-25 10:02:21 -06:00
Paul Berry	ab014adaed	i965/msaa: Enable 4x MSAA on Gen7. Basic 4x MSAA support now works on Gen7. This patch enables it. As with Gen6, MSAA support is still fairly preliminary. In particular, the following are not yet supported: - 8x oversampling (Gen7 has hardware support for this, but we do not yet expose it). - Fully general blits between MSAA and non-MSAA buffers. - Formats other than RGBA8, DEPTH24, and STENCIL8. - Centrold interpolation. - Coverage parameters (glSampleCoverage, GL_SAMPLE_ALPHA_TO_COVERAGE, GL_SAMPLE_ALPHA_TO_ONE, GL_SAMPLE_COVERAGE, GL_SAMPLE_COVERAGE_VALUE, GL_SAMPLE_COVERAGE_INVERT). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	4725ba03ca	i965/msaa: Implement manual blending operation for Gen7. On Gen6, the blending necessary to blit an MSAA surface to a non-MSAA surface could be accomplished with a single texturing operation. On Gen7, the WM program must fetch each sample and blend them together manually. From the Bspec (Shared Functions/Messages/Initiating Message/Message Types/sample): [DevIVB+]:Number of Multisamples on the associated surface must be MULTISAMPLECOUNT_1. This patch implements the manual blend operation. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	8b1f467cce	i965/msaa: Modify blorp code to account for Gen7 MSAA layouts. Since blorp uses color textures and render targets to do all its work (even when blitting stencil and depth data), it always has to configure the Gen7 GPU to use the new "sliced" MSAA layout. However, when blitting stencil or depth data, the actual MSAA layout is interleaved (as in Gen6). Therefore, blorp has to do extra coordinate transformation work to account for the interleaving manually. This patch causes blorp to perform the necessary extra coordinate transformations. It also modifies the blorp SURFACE_STATE setup code for Gen7, so that it does not try to correct the surface width and height to account for MSAA, since "sliced" MSAA layout doesn't affect the surface width or height. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	31f3dfd59b	i965/msaa: Validate Gen7 surface state constraints. When a Gen7 SURFACE_STATE is configured for MSAA, a number of additional constaints come in to play. This patch adds a function gen7_check_surface_setup() which verifies that all of those constraints are met. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	455ac56272	i965/msaa: Properly handle sliced layout for Gen7. Starting in Gen7, there are two possible layouts for MSAA surfaces: - Interleaved, in which additional samples are accommodated by scaling up the width and height of the surface. This is the only layout available in Gen6. On Gen7 it is used for depth and stencil surfaces only. - Sliced, in which the surface is stored as a 2D array, with array slice n containing all pixel data for sample n. On Gen7 this layout is used for color surfaces. The "Sliced" layout has an additional requirement: it must be used in ARYSPC_LOD0 mode, which means that the surface doesn't leave any extra room between array slices for miplevels other than 0. This patch modifies the surface allocation functions to use the correct layout when allocating MSAA surfaces in Gen7, and to set the array offsets properly when using ARYSPC_LOD0 mode. It also modifies the code that populates SURFACE_STATE structures to ensure that ARYSPC_LOD0 mode is selected in the appropriate circumstances. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	0e11b2c5af	i965/msaa: Add defines for Gen7. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	b08545199a	i965/blorp: Enable blorp blits on Gen7. Gen7 support for blorp (blits using the render bath) now works for non-MSAA purposes. This patch enables it. Since blorp operations re-use the logic for HiZ ops, this required adding a case to the switch statement in gen7_blorp_emit_wm_config(), to allow for the case where no HiZ op is being performed. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	1c73c705fa	i965/blorp: Implement proper texel fetch messages for Gen7. On Gen6, texel fetch is always accomplished using the SAMPLE_LD message, which accepts arguments (u, v, r, lod, si). On Gen7, there are two* texel fetch messages: SAMPLE_LD for non-MSAA surfaces, taking arguments (u, lod, v), and SAMPLE_LD2DSS for MSAA surfaces, taking arguments (si, u, v). *Technically, there are other texel fetch messages, but they are used for "compressed" MSAA surfaces, which we don't yet support. This patch adds the proper message types and argument orderings for Gen7. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	f2cdfa4c85	i965/blorp: Use 16 pixel dispatch on Gen7. Gen7 hardware requires us to enable at least one WM dispatch mode, even if there is no program being dispatched to. When this code was only used for HiZ operations (which don't use a WM program), we used 32-pixel dispatch, because it didn't matter. But blit programs are compiled for 16-pixel dispatch. So just enable 16-wide dispatch unconditionally. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> v2: Enable 16-wide dispatch unconditionally rather than add the unnecessary complication of using 32-wide dispatch when there is no WM program.	2012-05-25 08:45:11 -07:00
Paul Berry	f7df7917e0	i965/blorp: Allocate space for push constants on Gen7. On Gen7, push constants for shader programs are stored in the URB, so blorp code needs to set aside space for them. This was previously unnecessary because blorp code was based on HiZ operations, which don't require any shaders. This patch adds a call from gen7_blorp_exec() to gen7_allocate_push_constants(), to ensure that push constants are assigned the correct location in the URB. It also extracts a new function gen7_emit_urb_state() from gen7_upload_urb(), which is re-used by gen7_blorp_emit_urb_config() to ensure that the URB regions used by all the pipeline stages leave room for the push constants. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	de9752a4e5	i965/blorp: Set the dynamic state upper bound. We know from previous bug fixes (commits `c25e5300cb` and `b2ace06cbb`) that texture border color doesn't work if the dynamic state upper bound is set to 0. Although the blorp engine doesn't make use of texture borders, it seems like we ought to err on the safe side and set this value properly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	f77959b2c9	i965/blorp: Factor gen6_blorp_emit_batch_head into separate functions. This patch separates out the portions of gen6_blorp_emit_batch_head() that emit 3DSTATE_MULTISAMPLE, 3DSTATE_SAMPLE_MASK, and STATE_BASE_ADDRESS. This paves the way for making the blorp code work on Gen7, where additional command packets (3DSTATE_PUSH_CONSTANT_ALLOC_VS and 3DSTATE_PUSH_CONSTANT_ALLOC_PS) need to be emitted before 3DSTATE_MULTISAMPLE. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	34a5f12e35	i965/blorp: Use MSDISPMODE_PERSAMPLE rendering when necessary This patch modifies the "blorp" WM program so that it can be run in MSDISPMODE_PERSAMPLE (which means that every single sample of a multisampled render target is dispatched to the WM program, not just every pixel). Previously we were using the ugly hack of configuring multisampled destination surfaces as single-sampled, and generating sample indices other than zero by swizzling the pixel coordinates in the WM program. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-25 08:45:10 -07:00
Paul Berry	233c207e9e	i965/blorp: Emit sample index in SAMPLE_LD message when necessary This patch modifies the function brw_blorp_blit_program::texel_fetch() to emit the SI (sample index) argument to the SAMPLE_LD message when reading from a sample index other than zero. Previously we were using the ugly hack of configuring multisampled source surfaces as single-sampled, and accessing sample indices other than zero by swizzling the texture coordinates in the WM program. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:10 -07:00
Paul Berry	665dc82bdc	i965/blorp: Generalize sampling code in preparation for Gen7 This patch generalizes the function brw_blorp_blit_program::texture_lookup() so that it prepares the arguments to the sampler message based on a caller-provided array rather than assuming the argument order is always (u, v). This paves the way for the messages we will need to use in Gen7, which use argument orders (u, lod, v) and (si, u, v) (si=sample index). It will also will allow us to read from arbitrary sample indices on Gen6, by supplying the arguments (u, v, r, lod, si) to the SAMPLE_LD message instead of just (u, v). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:10 -07:00
Paul Berry	52fcc36f11	i965/msaa: Expand odd-sized MSAA surfaces to account for interleaving pattern. Gen6 MSAA buffers (and Gen7 MSAA depth/stencil buffers) interleave MSAA samples in a complex pattern that repeats every 2x2 pixel block. Therefore, when allocating an MSAA buffer, we need to make sure to allocate an integer number of 2x2 blocks; if we don't, then some of the samples in the last row and column will be cut off. Fixes piglit tests "EXT_framebuffer_multisample/unaligned-blit {2,4} color msaa" on i965/Gen6. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-25 08:45:10 -07:00
Thomas Gstädtner	93594f38be	gallium/targets: pass ldflags parameter to MKLIB Without passing the -ldflags parameter before $(LDFLAGS) in some cases flags will be passed to MKLIB which it does not understand. This might be -m64, -m32 or similar. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Thomas Gstädtner <thomas@gstaedtner.net> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-05-25 09:36:24 -06:00
Vadim Girlin	a1a0974401	Revert "r600g: set round_mode to truncate and get rid of tgsi_f2i on evergreen" This reverts commit `60bf0f05b4`. It seems round_mode behaves differently in some cases depending on the instruction/slot. Reverting it for now. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50232 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:28:08 +04:00
Vadim Girlin	1c5c4243c9	radeon/llvm: add FLT_TO_UINT, UINT_TO_FLT instructions Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:27:46 +04:00
Vadim Girlin	5a1b59b4e6	radeon/llvm: prepare to revert the round mode state to default Use TRUNC before FLT_TO_INT on evergreen/cayman. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:27:33 +04:00
Vadim Girlin	7fa7c608cb	radeon/llvm: fix sampler index in llvm_emit_tex Sampler index isn't a second source operand for some tgsi texture instructions. Let's assume it's always the last. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50230 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:27:23 +04:00
Vadim Girlin	029776753b	radeon/llvm: fix opcode for RECIP_UINT_r600 Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50312 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Tested-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:23:06 +04:00
Vadim Girlin	6806f81fb4	radeon/llvm/loader: convert hardcoded gpu name to option Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:22:38 +04:00
Vadim Girlin	482041a538	r600g: add RECIP_INT, PRED_SETE_INT to r600_bytecode_get_num_operands Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50315 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Tested-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:22:12 +04:00
Vinson Lee	35f302d97e	i915g: Check for geometry shader earlier in i915_set_constant_buffer. Fix resource leak defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-24 18:50:07 -07:00
Vinson Lee	5cf693266f	scons: Fix SCons build infrastructure for FreeBSD. This patch gets the FreeBSD SCons build working again. The build still fails though. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-24 18:49:40 -07:00
Tom Stellard	33e7db9a1d	radeon/llvm: Lower UDIV using the Selection DAG	2012-05-24 14:12:32 -04:00
Tom Stellard	d088da917b	radeon/llvm: Remove auto-generated AMDIL->ISA conversion code	2012-05-24 14:12:32 -04:00
Tom Stellard	662ccbfc21	radeon/llvm: Remove AMDIL instructions MULHI, SMUL	2012-05-24 14:12:32 -04:00
Tom Stellard	177b420283	radeon/llvm: Remove AMDIL bitshift instructions (SHL, SHR, USHR)	2012-05-24 14:12:32 -04:00
Tom Stellard	9d41a401dc	radeon/llvm: Remove AMDIL FTOI and ITOF instructions	2012-05-24 14:12:32 -04:00
Tom Stellard	a8ba697c1e	radeon/llvm: Remove AMDIL EXP* instructions	2012-05-24 14:12:31 -04:00
Tom Stellard	dd9927eb36	radeon/llvm: Remove AMDIL ADD instructions	2012-05-24 14:12:31 -04:00
Tom Stellard	1404e6b9fc	radeon/llvm: Remove AMDIL binary instrutions (OR, AND, XOR, NOT)	2012-05-24 14:12:31 -04:00
Tom Stellard	3059c075a7	radeon/llvm: Remove AMDILMachinePeephole pass	2012-05-24 14:12:31 -04:00
Tom Stellard	e9d8901a80	radeon/llvm: Remove AMDIL CMP instructions and associated lowering code	2012-05-24 14:12:31 -04:00
Tom Stellard	ea00632fe0	radeon/llvm: Remove AMDIL ROUND_NEAREST instruction	2012-05-24 14:12:31 -04:00
Tom Stellard	0bfa3b3e96	radeon/llvm: Remove AMDIL ROUND_POSINF instruction	2012-05-24 14:12:31 -04:00
Tom Stellard	d4984f3463	radeon/llvm: Add custom SDNode for FRACT	2012-05-24 14:12:30 -04:00
Tom Stellard	5523502ff9	radeon/llvm: Use -1 as true value for SET* integer instructions	2012-05-24 14:12:30 -04:00
Tom Stellard	86dfae1103	radeon/llvm: Handle SETGE_INT, SETGE_UINT, and SETGT_UINT opcodes Support for these was inadvertently dropped in commit `cee23ab246`	2012-05-24 14:12:30 -04:00
Tom Stellard	cc7a6d2691	radeon/llvm: Avoid error with SI in EmitInstrWithCustomInserter() We need to return immediately after inserting instructions that require S_WAITCNT so that the parent class' custom inserter won't try to insert them again.	2012-05-24 14:12:30 -04:00
Vinson Lee	0f6a3a7de3	tgsi: Initialize Padding struct fields. Fix uninitialized scalar variable defects report by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-23 21:58:37 -07:00
Kenneth Graunke	88128516d4	i965: Gut the separate OpenGL ES extension enabling. We should just set the bits of functionality that we support; the GL/ES1/ES2 flags in extensions.c will take care of advertising the appropriate extensions for the current API. This enables the GL_EXT_texture_compression_dxt1 extension on ES1/ES2 when libtxc_dxtn is installed or the force_s3tc driconf option is set. The main extension code set this up properly, but the ES-specific code failed to do so. Otherwise, the extension strings reported by es1_info, es2_info, and glxinfo all remain the same. This patch manually disables the ARB_framebuffer_object bit on ES to preserve the behavior of `1c0f5d8324`. v2: Rebase, fix the i915 Makefile, and unconditionally set the OES_draw_texture bit as core Mesa will only apply it to ES1 now. Tested-by: Daniel Charles <daniel.charles@intel.com> [v1] Reviewed-by: Chad Versace <chad.versace@linux.intel.com> [v1] Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 17:17:54 -07:00
Kenneth Graunke	d4667516b6	mesa: Remove the OES_draw_texture extension from ES2. This extension appears to be written against ES 1.0. In ES 2.0, you really want to be using FBOs instead. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 17:03:35 -07:00
Jordan Justen	dc50145253	i965: use cut index to handle primitive restart when possible If the primitive restart index and the primitive type can be handled by the cut index feature, then use the hardware to handle the primitive restart feature. The VBO module's software handling of primitive restart is used as a fall back. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 15:19:09 -07:00
Jordan Justen	f9389fbfb2	i965: add flag to enable cut_index When brw->prim_restart.enable_cut_index is set, the cut index will be enabled when uploading index_buffer commands. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 15:19:09 -07:00
Jordan Justen	df7d1323de	i965: create code path to handle primitive restart in hardware For newer hardware we disable the VBO module's software handling of primitive restart. We now handle primitive restarts in brw_handle_primitive_restart. The initial version of brw_handle_primitive_restart simply calls vbo_sw_primitive_restart, and therefore still uses the VBO module software primitive restart support. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 15:19:09 -07:00
Paul Berry	9f6932cb83	glsl/tests: Add .gitignore for uniform initialization unit test. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 14:24:33 -07:00
Paul Berry	aa173e16a0	glsl/constant propagation: kill whole var if LHS involves array indexing. When considering which components of a variable were killed by an assignment, constant propagation would previously just use the write mask of the assignment. This worked if the LHS of the assignment was simple, e.g.: v.xy = ...; // (assign (xy) (var_ref v) ...) But it did the wrong thing if the LHS of the assignment involved an array indexing operator, since in this case the write mask is always (x): v[i] = ...; // (assign (x) (deref_array (var_ref v) (var_ref i)) ...) In general, we can't predict which vector component will be selected by array indexing, so the only safe thing to do in this case is to kill the entire variable. Fixes piglit tests {fs,vs}-vector-indexing-kills-all-channels.shader_test. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-05-23 14:21:48 -07:00
Ian Romanick	b45052b3f7	glsl/tests: Add test for uniform initialization by the linker v2: Put unit tests in src/glsl/tests rather than tests/glsl. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	49da2590c2	mesa: Use initializers to configure samplers Now that the linker handles initializers of samplers just like any other uniform, a bunch of this annoying code is unnecessary. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	75dac69262	ir_to_mesa: Don't set initial uniform values again This work is now done by the linker, so we don't need to keep doing it here. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	c343b980d6	ir_to_mesa: Propagate initial values in _mesa_associate_uniform_storage The linker may have set initial values for uniforms. Propagate these values to the driver's backing storage when it is first associated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	76027f5b5c	glsl: Propagate sampler uniform initializers to gl_shader_program::SamplerUnits Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:07 -07:00
Ian Romanick	b610881317	glsl: Initialize samplers to 0, propagate sampler values to the gl_program The spec requires that samplers be initialized to 0. Since this differs from the 1-to-1 mapping of samplers to texture units assumed by ARB assembly shaders (and the gl_program structure), be sure to propagate this date from the gl_shader_program to the gl_program. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> CC: Vadim Girlin <vadimgirlin@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=49088	2012-05-23 11:42:07 -07:00
Ian Romanick	a2e623054b	glsl: Set initial values for uniforms in the linker v2: Fix handling of arrays-of-structure. Thanks to Eric Anholt for pointing this out. v3: Minor comment change based on feedback from Ken. Fixes piglit glsl-1.20/execution/uniform-initializer/fs-structure-array and glsl-1.20/execution/uniform-initializer/vs-structure-array. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:41:49 -07:00
Eric Anholt	29362875f2	i965/gen6+: Add support for GL_ARB_blend_func_extended. v2: Add support for gen6, and don't turn it on if blending is disabled. (fixes GPU hang), and note it in docs/GL3.txt Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 10:46:15 -07:00
Eric Anholt	175ad8050e	mesa: Keep a computed value for dual source blend func with each buffer. The i965 driver needed this as well for hardware setup, so instead of duplicating the logic, just save it off. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	2012-05-23 10:45:43 -07:00
Eric Anholt	68216f3581	i965/gen6+: Add support for fast depth clears. Improves citybench high-res performance 3.0% +- 0.4%, n=10. Improves Lightsmark 1024x768 performance 0.74% +/- 0.20% (n=78). No significant difference on openarena (n=5, didn't fast clear) or nexuiz (n=3). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:40:11 -07:00
Eric Anholt	5b248e5982	i965/gen6: Add CC viewport state setup to blorp code. While it doesn't have the same warning in the simulator as in gen7, let's emit it out of paranoia. We wouldn't want our resolves of some previous clear to get clamped to some current clamping value. Suggested-by: pretty much everyone	2012-05-23 10:39:45 -07:00
Eric Anholt	39a91be20d	i965/gen7: Add CC viewport setup to blorp code. When doing fast clears, a fulsim warning said that the batch was being emitted without the viewport set up. While the fast clear pass I was looking at doesn't use the clear value, the later resolves which also didn't set up the vieport would trigger the same. It's not obvious from the error message whether it meant "fast clear value gets clamped to something you haven't defined" or "fast clear value doesn't get clamped, and I saw it was out of the current (uninitialized) range, and you probably wanted it clamped to that (uninitialized) range". Be paranoid and assume the first case. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	54308f78a2	i965: Drop a layer of indirection in doing HiZ resolves. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	072634da4a	i965: Replace intel_need_resolve with the hiz ops it maps to. Having this enum separate caused us to need a bunch of helper functions to translate to the op to be executed. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	5b226ad603	i965: Add an interface for doing hiz ops from C code. This required moving gen6_hiz_op, and I put it in intel_resolve_map.h for the next commit. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	7da9795070	i965: Rename the clear function for this driver. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	3e1656567c	i965: Simplify the remaining clear logic by relying on the meta clear. The GLSL clear path doesn't need any buffer presence checks, since those are already handled in the normal drawing path code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	7c3e88f1fc	i965: Switch blit color clears to tri clears on gen4/5. Our understanding is that the 3D engine is supposed to be faster anyway. We used to have more overhead in our tri clear path than we do today, which would have led to this choice. But given that we almost always see a depth clear along with a color clear, the path was hardly exercised anyway. Also, the color mask logic was broken in the presence of GL_EXT_draw_buffers2's per-buffer colormask. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	fa15b0f3f0	i965: Remove dead logic for non-tri depth/stencil clears. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	a3967ff441	i965: We always have GLSL, so always use it for tri clears. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	03c9044c2e	i915: Drop gen4+ code from the forked clear code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	11892ea986	intel: Fork the intel_clear.c file between i915 and i965. This logic is wasted on i965 when we want to just always do GLSL tri clears. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Vadim Girlin	c91b4edff9	st/mesa: set stObj->lastLevel in guess_and_alloc_texture Fixes lockups/asserts with depthstencil-render-miplevels tests and r600g. Should also fix https://bugs.freedesktop.org/show_bug.cgi?id=50033 NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-23 06:07:00 +04:00
Paul Berry	ea8e854b2c	i965: Completely annotate the batch bo when aub dumping. Previously, when the environment variable INTEL_DEBUG=aub was set, mesa would simply instruct DRM to start dumping data to an .aub file, but we would not provide DRM with any information about the format of the data in various buffers. As a result, a lot of the data in the generate .aub file would be unannotated, making further data analysis difficult. This patch causes the entire contents of each batch buffer to be annotated using the data in brw->state_batch_list (which was previously used only to annotate the output of INTEL_DEBUG=bat). This includes data that was allocated by brw_state_batch, such as binding tables, surface and sampler states, depth/stencil state, and so on. The new annotation mechanism requires DRM version 2.4.34. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-05-22 15:19:00 -07:00
Paul Berry	1b87a93983	intel: When AUB dumping, flush before emitting final bitmap command. When we are generating an AUB dump, we make a final call to aub_dump_bmp() as the context is being destroyed, to ensure that any rendering performed before the application exits can be seen during a simulation run. However, we were doing this before flushing the batch buffer; as a result simulation runs would not always see the effect of all rendering commands. This patch flushes the batch buffer just before making the final call to aub_dump_bmp(), to ensure that all rendering is properly captured in the final bitmap.	2012-05-22 15:19:00 -07:00
José Fonseca	7a75e7d6e8	llvmpipe: Fix alpha testing precision on rgba8 formats. This is a long standing problem, that recently surfaced with the change to enable perspective correct color interpolation. A fix for all possible formats is left to the future. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-05-22 19:23:49 +01:00
Vinson Lee	e4fb332af1	scons: Do not build glx and egl on Cygwin. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-22 10:15:14 -07:00
Christoph Bumiller	89155ba71d	nv30: check for NULL vertex buffers in prevalidate_vbufs	2012-05-22 15:22:10 +02:00
Christoph Bumiller	a054fd8268	nv50: make unaligned index buffer offsets work again Messed up in `ef7bb28129`.	2012-05-22 12:50:12 +02:00
Christoph Bumiller	91fb5e0394	nvc0: don't set NEW_IDXBUF in nvc0_switch_pipe_context if none is bound	2012-05-22 12:45:19 +02:00
James Benton	8a933e36d1	llvmpipe: Added a error counter to lp_test_conv. Useful for keeping track of progress when fixing errors! Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:53 +01:00
James Benton	383c1b649b	llvmpipe: Changed known failures in lp_test_conv. To comply with the recent fixes to lp_bld_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:51 +01:00
James Benton	4203a0b034	llvmpipe: Added fixed point types tests to lp_test_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:49 +01:00
James Benton	a3d4af0c00	gallivm: Fixed erroneous optimisation in lp_build_min/max. Previously assumed normalised was 0 to 1, but it can be -1 to 1 if type is signed. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:47 +01:00
James Benton	fdeb0394cb	gallivm: Compensate for lp_const_offset in lp_build_conv. Fixing a /FIXME/ to remove errors in integer conversion in lp_build_conv. Tested using lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:46 +01:00
James Benton	f89b1f4ba4	gallivm: Fixed overflow in lp_build_clamped_float_to_unsigned_norm. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:44 +01:00
Brian Paul	c286278481	docs: add link to 8.0.3 release notes	2012-05-21 09:26:04 -06:00
Paul Seidler	a0dffe8701	tests: include mesa headers else they will fail for fresh installs Signed-off-by: Brian Paul <brianp@vmware.com>	2012-05-21 08:42:19 -06:00
Lukas Rössler	6178b653c7	glu: fix two Clang warnings This patch removes two Clang warnings in GLU: The first one seems to be an actual bug in mapdesc.cc: Clang complains that sizeof(dest) will return the size of REAL*[MAXCOORDS], instead of the intended REAL[MAXCOORDS][MAXCOORDS]. The second one is just cosmetic because Clang doesn't like extra parentheses. NOTE: This is a candidate for the 8.0 branch Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-21 08:29:23 -06:00
Homer Hsing	ed9d1bef81	docs: fix a typo Signed-off-by: Brian Paul <brianp@vmware.com>	2012-05-21 08:07:20 -06:00
ojab	3d2bf91cc1	Filter out -Wcovered-switch-default from LLVM_CFLAGS Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 08:37:06 +01:00
Tom Stellard	cee23ab246	radeon/llvm: Handle selectcc DAG node R600 can now select instructions from the selectcc DAG node, which is typically lowered to one of the SET* instructions.	2012-05-20 16:27:31 -04:00
Brian Paul	239792fb22	st/mesa: use pipe_sampler_view_release() in st_destroy_context_priv() Fixes another case of sampler views being created by one context, shared by another, then deleted by the first, leaving a dangling pipe context pointer. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	c9cb9cf050	mesa: use F_TO_I() instead of IROUND() Use it where performance matters more and the exact method of float->int conversion/rounding isn't terribly important. There should no net change here since F_TO_I() is the new name of the old IROUND() function. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	699c1894ee	mesa: reimplement IROUND(), add F_TO_I() The different implementations of IROUND() behaved differently and in the case of fistp, depended on the current x86 FPU rounding mode. This caused some tests like piglit roundmode-pixelstore and roundmode-getintegerv to fail on 32-bit x86 but pass on 64-bit x86. Now IROUND() always rounds to the nearest integer (away from zero). The new F_TO_I function converts a float to an int by whatever means is fastest. We'll use this where we're more concerned with performance and not too worried to how the conversion is done. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	31d59c78f0	mesa: fix Z32_FLOAT -> uint conversion functions The IROUND converted all arguments to 0 or 1. That's not what we wanted. NOTE: This is a candidate for the 8.0 branch. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	c3991e1c57	st/mesa: remove unused pipe variable	2012-05-19 08:28:57 -06:00
Brian Paul	bd302f36c4	svga: whitespace, comments, formatting clean-ups	2012-05-19 08:28:57 -06:00
Brian Paul	6792969cbc	st/mesa: added st_print_current_vertex_program(), for debugging	2012-05-19 08:28:56 -06:00
Brian Paul	2786343896	svga: return PIPE_OK instead of 0 And fix the emit_rss() function's return type.	2012-05-19 08:28:56 -06:00
Brian Paul	fc71e0b4a8	svga: fix zero-stride vertex array bug For zero-stride vertex arrays, the svga driver copies the value into the constant value and uses that value in the shader. The recent gallium-userbuf changes caused a regression in this. An example symptom was per-primitive glColor3f() calls getting ignored. Where we copied the vertex value from the vertex buffer to the constant buffer we neglected to take into account the pipe_vertex_buffer::buffer_offset field. Adding that value to the source offset fixes the problem. Actually, it looks like we should have been doing this all along, but it never was an issue before for some reason.	2012-05-19 08:28:56 -06:00
Brian Paul	0161691f35	mesa: add GLSL_REPORT_ERRORS debug flag If the MESA_GLSL env var contains "errors", GLSL compilation and link errors will be reported to stderr. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-19 08:28:56 -06:00
Brian Paul	1c333745f3	mesa: add some comments on shaderapi.c functions	2012-05-19 08:28:56 -06:00
Vinson Lee	315140969d	mesa: Remove undefinition of _P symbol. IRIX isn't used anymore. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-18 23:24:33 -07:00
Ian Romanick	0c6f4cd335	Import release notes for 8.0.3, add news item Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-18 16:27:17 -07:00
Jeremy Huddleston	27b821bc95	darwin: Address a build failure on Leopard and earlier OS versions <https://trac.macports.org/ticket/34499> Regression-from: `51691f0767` Signed-off-by: Jeremy Huddleston <jeremyhu@apple.com>	2012-05-18 11:32:40 -07:00
Michel Dänzer	d59b2c4b53	radeonsi: Only honour point related rasterizer state when rendering points. Avoids hangs when not rendering points.	2012-05-18 18:13:56 +02:00
Michel Dänzer	dd9d619459	radeonsi: Fix parameter cache offsets for fragment shader inputs.	2012-05-18 15:01:10 +02:00
Vinson Lee	e8a86d36f3	gallium/tgsi/text: Ensure ret is initialized in parse_immediate_data. Fix uninitialized scalar variable defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-17 21:59:08 -07:00
Tom Stellard	c20e741799	radeon/llvm: Fix segfault while lowering lrp intrinsic	2012-05-17 20:42:16 -04:00
Tom Stellard	7e3cd8df18	radeon/llvm: Add DAG nodes for MIN instructions Also, remove the AMDIL MIN* instruction defs.	2012-05-17 20:42:16 -04:00
José Fonseca	3f7a5ffac7	llvmpipe: Avoid adding floating point zero to flat inputs. Which could clobber integer inputs, if the addition is not optimized away (e.g., if optimizations are disabled for debugging purposes).	2012-05-18 01:03:13 +01:00
José Fonseca	00eb74b275	Fix fetching integer inputs.	2012-05-18 00:55:13 +01:00
Olivier Galibert	5d10d75727	llvmpipe: Implement TXQ. Piglits test for fragment shaders pass, vertex shaders fail. The actual failure seems to be in the interpolators, and not the textureSize query. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: José Fonseca <jose.r.fonseca@gmail.com>	2012-05-18 00:27:28 +01:00
Olivier Galibert	1ec421823b	llvmpipe: Don't mess with the provoking vertex when inverting a triangle. Fixes a bunch of piglit tests related to flat interpolation of floats. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jose.r.fonseca@gmail.com>	2012-05-18 00:07:18 +01:00
Tom Stellard	c6c8a05c50	radeon/llvm: Lower lrp intrinsic during ISel	2012-05-17 14:48:10 -04:00
Tom Stellard	ef8e66bc16	radeon/llvm: Remove AMDIL MAD instruction defs	2012-05-17 14:48:10 -04:00
Tom Stellard	d07473fcf4	radeon/llvm: Remove AMDIL MUL_IEEE* instructions	2012-05-17 14:48:10 -04:00
Tom Stellard	5187948bc2	r600g: Handle MUL_IEEE in r600_bytecode_get_num_operands	2012-05-17 14:48:09 -04:00
Tom Stellard	1fe70c6ae1	radeon/llvm: Expand fsub during ISel	2012-05-17 14:48:09 -04:00
Tom Stellard	9916f2d2af	radeon/llvm: Remove AMDIL floating-point ADD instruction defs	2012-05-17 14:48:09 -04:00
Tom Stellard	91484de22d	radeon/llvm: Remove AMDIL CMOVLOG* instruction defs	2012-05-17 14:48:09 -04:00
Tom Stellard	9a020092ae	radeon/llvm: Move lowering of ABS_i32 to ISel	2012-05-17 14:48:09 -04:00
Tom Stellard	89b945591b	radeon/llvm: Remove sub patterns from AMDILInstrPatterns.td	2012-05-17 14:48:09 -04:00
Tom Stellard	431bb79a41	radeon/llvm: Add custom SDNodes for MAX We now lower the various intrinsics for max to SDNodes and then use tablegen patterns to lower the SDNodes to instructions.	2012-05-17 14:48:09 -04:00

1391 changed files with 70453 additions and 173153 deletions

									
										11

.dir-locals.el
									
										Normal file
									
												View File
												
				@@ -0,0 +1,11 @@

				((nil

				  (indent-tabs-mode . nil)

				  (tab-width . 8)

				  (c-basic-offset . 3)

				  (c-file-style . "stroustrup")

				  (fill-column . 78)

				  (eval . (progn

					    (c-set-offset 'innamespace '0)

					    (c-set-offset 'inline-open '0)))

				  )

				 )

10

.emacs-dirvars

View File

@@ -1,10 +0,0 @@
 ;; -*- emacs-lisp -*-
 ;;
 ;; This file is processed by the dirvars emacs package.  Each variable
 ;; setting below is performed when this dirvars file is loaded.
 ;;
 indent-tabs-mode: nil
 tab-width: 8
 c-basic-offset: 3
 kde-emacs-after-parent-string: ""
 evaluate: (c-set-offset 'inline-open '0)

1

.gitignore vendored

View File

@@ -40,3 +40,4 @@ Makefile.in
 .dir-locals.el
 .deps/
 .libs/
 /Makefile

									
										4

Android.common.mk
									
												View File
												
				@@ -47,7 +47,9 @@ LOCAL_CFLAGS += \

				ifeq ($(strip $(MESA_ENABLE_ASM)),true)

				ifeq ($(TARGET_ARCH),x86)

				LOCAL_CFLAGS += \

					-DUSE_X86_ASM

					-DUSE_X86_ASM \

					-DHAVE_DLOPEN \

				endif

				endif

									
										271

Makefile
									
												View File
											
				@@ -1,271 +0,0 @@

				# Top-level Mesa makefile

				TOP = .

				SUBDIRS = src

				# The git command below generates an empty string when we're not

				# building in a GIT tree (i.e., building from a release tarball).

				default: $(TOP)/configs/current

					@$(TOP)/bin/extract_git_sha1

					@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE)) || exit 1 ; \

						fi \

					done

				all: default

				doxygen:

					cd doxygen && $(MAKE)

				check:

					make -C src/glsl/tests check

					make -C tests check

				clean:

					-@touch $(TOP)/configs/current

					-@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE) clean) ; \

						fi \

					done

					-@test -s $(TOP)/configs/current || rm -f $(TOP)/configs/current

				realclean: clean

					-rm -rf lib*

					-rm -f $(TOP)/configs/current

					-rm -f $(TOP)/configs/autoconf

					-rm -rf autom4te.cache

					-find . '(' -name '*.o' -o -name '*.a' -o -name '*.so' -o \

					  -name depend -o -name depend.bak ')' -exec rm -f '{}' ';'

				distclean: realclean

				install:

					@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE) install) || exit 1 ; \

						fi \

					done

				.PHONY: default doxygen clean realclean distclean install check

				# If there's no current configuration file

				$(TOP)/configs/current:

					@echo

					@echo

					@echo "Please choose a configuration from the following list:"

					@ls -1 $(TOP)/configs | grep -v "current\|default\|CVS\|autoconf.*"

					@echo

					@echo "Then type 'make <config>' (ex: 'make linux-x86')"

					@echo

					@echo "Or, run './configure' then 'make'"

					@echo "See './configure --help' for details"

					@echo

					@echo "(ignore the following error message)"

					@exit 1

				# Rules to set/install a specific build configuration

				aix \

				aix-64 \

				aix-64-static \

				aix-gcc \

				aix-static \

				autoconf \

				bluegene-osmesa \

				bluegene-xlc-osmesa \

				catamount-osmesa-pgi \

				darwin \

				darwin-fat-32bit \

				darwin-fat-all \

				freebsd \

				freebsd-dri \

				freebsd-dri-amd64 \

				freebsd-dri-x86 \

				hpux10 \

				hpux10-gcc \

				hpux10-static \

				hpux11-32 \

				hpux11-32-static \

				hpux11-32-static-nothreads \

				hpux11-64 \

				hpux11-64-static \

				hpux11-ia64 \

				hpux11-ia64-static \

				hpux9 \

				hpux9-gcc \

				irix6-64 \

				irix6-64-static \

				irix6-n32 \

				irix6-n32-static \

				irix6-o32 \

				irix6-o32-static \

				linux \

				linux-i965 \

				linux-alpha \

				linux-alpha-static \

				linux-debug \

				linux-dri \

				linux-dri-debug \

				linux-dri-x86 \

				linux-dri-x86-64 \

				linux-dri-ppc \

				linux-dri-xcb \

				linux-egl \

				linux-indirect \

				linux-fbdev \

				linux-ia64-icc \

				linux-ia64-icc-static \

				linux-icc \

				linux-icc-static \

				linux-llvm \

				linux-llvm-debug \

				linux-opengl-es \

				linux-osmesa \

				linux-osmesa-static \

				linux-osmesa16 \

				linux-osmesa16-static \

				linux-osmesa32 \

				linux-ppc \

				linux-ppc-static \

				linux-profile \

				linux-sparc \

				linux-sparc5 \

				linux-static \

				linux-ultrasparc \

				linux-tcc \

				linux-x86 \

				linux-x86-debug \

				linux-x86-32 \

				linux-x86-64 \

				linux-x86-64-debug \

				linux-x86-64-profile \

				linux-x86-64-static \

				linux-x86-profile \

				linux-x86-static \

				netbsd \

				openbsd \

				osf1 \

				osf1-static \

				solaris-x86 \

				solaris-x86-gcc \

				solaris-x86-gcc-static \

				sunos4 \

				sunos4-gcc \

				sunos4-static \

				sunos5 \

				sunos5-gcc \

				sunos5-64-gcc \

				sunos5-smp \

				sunos5-v8 \

				sunos5-v8-static \

				sunos5-v9 \

				sunos5-v9-static \

				sunos5-v9-cc-g++ \

				ultrix-gcc:

					@ if test -f configs/current -o -L configs/current; then \

						if ! cmp configs/$@ configs/current > /dev/null; then \

							echo "Please run 'make realclean' before changing configs" ; \

							exit 1 ; \

						fi ; \

					else \

						cd configs && rm -f current && ln -s $@ current ; \

					fi

					$(MAKE) default

				# Rules for making release tarballs

				PACKAGE_VERSION=8.1-devel

				PACKAGE_DIR = Mesa-$(PACKAGE_VERSION)

				PACKAGE_NAME = MesaLib-$(PACKAGE_VERSION)

				EXTRA_FILES = \

					aclocal.m4					\

					configure					\

					tests/Makefile.in				\

					tests/glx/Makefile.in				\

					src/glsl/glsl_parser.cpp			\

					src/glsl/glsl_parser.h				\

					src/glsl/glsl_lexer.cpp				\

					src/glsl/glcpp/glcpp-lex.c			\

					src/glsl/glcpp/glcpp-parse.c			\

					src/glsl/glcpp/glcpp-parse.h			\

					src/mesa/main/api_exec_es1.c			\

					src/mesa/main/api_exec_es1_dispatch.h		\

					src/mesa/main/api_exec_es1_remap_helper.h	\

					src/mesa/main/api_exec_es2.c			\

					src/mesa/main/api_exec_es2_dispatch.h		\

					src/mesa/main/api_exec_es2_remap_helper.h	\

					src/mesa/program/lex.yy.c			\

					src/mesa/program/program_parse.tab.c		\

					src/mesa/program/program_parse.tab.h

				IGNORE_FILES = \

					-x autogen.sh

				parsers: configure

					-@touch $(TOP)/configs/current

					$(MAKE) -C src/glsl glsl_parser.cpp glsl_parser.h glsl_lexer.cpp

					$(MAKE) -C src/glsl/glcpp glcpp-lex.c glcpp-parse.c glcpp-parse.h

					$(MAKE) -C src/mesa program/lex.yy.c program/program_parse.tab.c program/program_parse.tab.h

				# Everything for new a Mesa release:

				ARCHIVES = $(PACKAGE_NAME).tar.gz \

					$(PACKAGE_NAME).tar.bz2 \

					$(PACKAGE_NAME).zip \

				tarballs: md5

					rm -f ../$(PACKAGE_DIR) $(PACKAGE_NAME).tar

				# Helper for autoconf builds

				ACLOCAL = aclocal

				ACLOCAL_FLAGS =

				AUTOCONF = autoconf

				AC_FLAGS =

				aclocal.m4: configure.ac acinclude.m4

					$(ACLOCAL) $(ACLOCAL_FLAGS)

				configure: configure.ac aclocal.m4 acinclude.m4

					$(AUTOCONF) $(AC_FLAGS)

				manifest.txt: .git

					( \

						ls -1 $(EXTRA_FILES) ; \

						git ls-files $(IGNORE_FILES) \

					) | sed -e '/^\(.*\/\)\?\./d' -e "s@^@$(PACKAGE_DIR)/@" > $@

				../$(PACKAGE_DIR):

					ln -s $(PWD) $@

				$(PACKAGE_NAME).tar: parsers ../$(PACKAGE_DIR) manifest.txt

					cd .. ; tar -cf $(PACKAGE_DIR)/$(PACKAGE_NAME).tar -T $(PACKAGE_DIR)/manifest.txt

				$(PACKAGE_NAME).tar.gz: $(PACKAGE_NAME).tar ../$(PACKAGE_DIR)

					gzip --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.gz

				$(PACKAGE_NAME).tar.bz2: $(PACKAGE_NAME).tar

					bzip2 --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.bz2

				$(PACKAGE_NAME).zip: parsers ../$(PACKAGE_DIR) manifest.txt

					rm -f $(PACKAGE_NAME).zip ; \

					cd .. ; \

					zip -q -@ $(PACKAGE_NAME).zip < $(PACKAGE_DIR)/manifest.txt ; \

					mv $(PACKAGE_NAME).zip $(PACKAGE_DIR)

				md5: $(ARCHIVES)

					@-md5sum $(PACKAGE_NAME).tar.gz

					@-md5sum $(PACKAGE_NAME).tar.bz2

					@-md5sum $(PACKAGE_NAME).zip

				am--refresh:

				.PHONY: tarballs md5 am--refresh

									
										125

Makefile.am
									
										Normal file
									
												View File
												
				@@ -0,0 +1,125 @@

				# Copyright © 2012 Intel Corporation

				#

				# Permission is hereby granted, free of charge, to any person obtaining a

				# copy of this software and associated documentation files (the "Software"),

				# to deal in the Software without restriction, including without limitation

				# the rights to use, copy, modify, merge, publish, distribute, sublicense,

				# and/or sell copies of the Software, and to permit persons to whom the

				# Software is furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice (including the next

				# paragraph) shall be included in all copies or substantial portions of the

				# Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL

				# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING

				# FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS

				# IN THE SOFTWARE.

				SUBDIRS = src

				doxygen:

					cd doxygen && $(MAKE)

				check-local:

					$(MAKE) -C src/mapi/glapi/tests check

					$(MAKE) -C src/mapi/shared-glapi/tests check

					$(MAKE) -C src/mesa/main/tests check

					$(MAKE) -C src/glsl/tests check

					$(MAKE) -C src/glx/tests check

				clean-local:

					-@touch $(top_builddir)/configs/current

					-@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE) clean) ; \

						fi \

					done

					-@test -s $(top_builddir)/configs/current || rm -f $(top_builddir)/configs/current

				distclean-local:

					-rm -rf lib*

					-rm -f $(top_builddir)/configs/current

					-find . '(' -name '*.o' -o -name '*.a' -o -name '*.so' -o \

					  -name depend -o -name depend.bak ')' -exec rm -f '{}' ';'

				.PHONY: doxygen

				# Rules for making release tarballs

				PACKAGE_VERSION=8.1-devel

				PACKAGE_DIR = Mesa-$(PACKAGE_VERSION)

				PACKAGE_NAME = MesaLib-$(PACKAGE_VERSION)

				EXTRA_FILES = \

					aclocal.m4					\

					configure					\

					src/glsl/glsl_parser.cc				\

					src/glsl/glsl_parser.h				\

					src/glsl/glsl_lexer.cc				\

					src/glsl/glcpp/glcpp-lex.c			\

					src/glsl/glcpp/glcpp-parse.c			\

					src/glsl/glcpp/glcpp-parse.h			\

					src/mesa/main/api_exec_es1.c			\

					src/mesa/main/api_exec_es1_dispatch.h		\

					src/mesa/main/api_exec_es1_remap_helper.h	\

					src/mesa/main/api_exec_es2.c			\

					src/mesa/main/api_exec_es2_dispatch.h		\

					src/mesa/main/api_exec_es2_remap_helper.h	\

					src/mesa/program/lex.yy.c			\

					src/mesa/program/program_parse.tab.c		\

					src/mesa/program/program_parse.tab.h

				IGNORE_FILES = \

					-x autogen.sh

				parsers: configure

					-@touch $(top_builddir)/configs/current

					$(MAKE) -C src/glsl glsl_parser.cc glsl_parser.h glsl_lexer.cc

					$(MAKE) -C src/glsl/glcpp glcpp-lex.c glcpp-parse.c glcpp-parse.h

					$(MAKE) -C src/mesa program/lex.yy.c program/program_parse.tab.c program/program_parse.tab.h

				# Everything for new a Mesa release:

				ARCHIVES = $(PACKAGE_NAME).tar.gz \

					$(PACKAGE_NAME).tar.bz2 \

					$(PACKAGE_NAME).zip

				tarballs: md5

					rm -f ../$(PACKAGE_DIR) $(PACKAGE_NAME).tar

				manifest.txt: .git

					( \

						ls -1 $(EXTRA_FILES) ; \

						git ls-files $(IGNORE_FILES) \

					) | sed -e '/^\(.*\/\)\?\./d' -e "s@^@$(PACKAGE_DIR)/@" > $@

				../$(PACKAGE_DIR):

					ln -s $(PWD) $@

				$(PACKAGE_NAME).tar: parsers ../$(PACKAGE_DIR) manifest.txt

					cd .. ; tar -cf $(PACKAGE_DIR)/$(PACKAGE_NAME).tar -T $(PACKAGE_DIR)/manifest.txt

				$(PACKAGE_NAME).tar.gz: $(PACKAGE_NAME).tar ../$(PACKAGE_DIR)

					gzip --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.gz

				$(PACKAGE_NAME).tar.bz2: $(PACKAGE_NAME).tar

					bzip2 --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.bz2

				$(PACKAGE_NAME).zip: parsers ../$(PACKAGE_DIR) manifest.txt

					rm -f $(PACKAGE_NAME).zip ; \

					cd .. ; \

					zip -q -@ $(PACKAGE_NAME).zip < $(PACKAGE_DIR)/manifest.txt ; \

					mv $(PACKAGE_NAME).zip $(PACKAGE_DIR)

				md5: $(ARCHIVES)

					@-md5sum $(PACKAGE_NAME).tar.gz

					@-md5sum $(PACKAGE_NAME).tar.bz2

					@-md5sum $(PACKAGE_NAME).zip

				.PHONY: tarballs md5

									
										10

autogen.sh
									
												View File
												
				@@ -3,17 +3,11 @@

				srcdir=`dirname "$0"`

				test -z "$srcdir" && srcdir=.

				SRCDIR=`(cd "$srcdir" && pwd)`

				ORIGDIR=`pwd`

				if test "x$SRCDIR" != "x$ORIGDIR"; then

					echo "Mesa cannot be built when srcdir != builddir" 1>&2

					exit 1

				fi

				MAKEFLAGS=""

				cd "$srcdir"

				autoreconf -v --install || exit 1

				cd $ORIGDIR || exit $?

				if test -z "$NOCONFIGURE"; then

				    "$srcdir"/configure "$@"

1

bin/.gitignore vendored

View File

@@ -5,3 +5,4 @@ install-sh
 /missing
 ylwrap
 compile
 ar-lib

									
										48

bin/confdiff.sh
									
												View File
											
				@@ -1,48 +0,0 @@

				#!/bin/bash -e

				usage()

				{

					echo "Usage: $0 <target1> <target2>"

					echo "Highlight differences between Mesa configs"

					echo "Example:"

					echo "  $0 linux linux-x86"

				}

				die()

				{

					echo "$@" >&2

					return 1

				}

				case "$1" in

				-h|--help) usage; exit 0;;

				esac

				[ $# -lt 2 ] && die 2 targets needed. See $0 --help

				target1=$1

				target2=$2

				topdir=$(cd "`dirname $0`"/..; pwd)

				cd "$topdir"

				[ -f "./configs/$target1" ] || die Missing configs/$target1

				[ -f "./configs/$target2" ] || die Missing configs/$target2

				trap 'rm -f "$t1" "$t2"' 0

				t1=$(mktemp)

				t2=$(mktemp)

				make -f- -n -p <<EOF | sed '/^# Not a target/,/^$/d' > $t1

				TOP = .

				include \$(TOP)/configs/$target1

				default:

				EOF

				make -f- -n -p <<EOF | sed '/^# Not a target/,/^$/d' > $t2

				TOP = .

				include \$(TOP)/configs/$target2

				default:

				EOF

				diff -pu -I'^#' $t1 $t2

20

bin/extract_git_sha1

View File

@@ -1,20 +0,0 @@
 #!/bin/sh
 if [ ! -f src/mesa/main/git_sha1.h ]; then
 	touch src/mesa/main/git_sha1.h
 fi
 if [ ! -d .git ]; then
 	exit
 fi
 if which git > /dev/null; then
     # Extract the 7-digit "short" SHA1 for the current HEAD, convert
     # it to a string, and wrap it in a #define.  This is used in
     # src/mesa/main/version.c to put the GIT SHA1 in the GL_VERSION string.
     git log -n 1 --oneline |\
 	sed 's/^\([^ ]*\) .*/#define MESA_GIT_SHA1 "git-\1"/' \
 	> src/mesa/main/git_sha1.h.tmp
     if ! cmp -s src/mesa/main/git_sha1.h.tmp src/mesa/main/git_sha1.h; then
     	mv src/mesa/main/git_sha1.h.tmp src/mesa/main/git_sha1.h
     fi
 fi

									
										23

bin/shortlog_mesa.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,23 @@

				#!/bin/bash

				# This script is used to generate the list of changes that

				# appears in the release notes files, with HTML formatting.

				typeset -i in_log=0

				git shortlog $* | while read l

				do

				    if [ $in_log -eq 0 ]; then

					echo '<p>'$l'</p>'

					echo '<ul>'

					in_log=1

				    elif echo "$l" | egrep -q '^$' ; then

					echo '</ul>'

					echo

					in_log=0

				    else

				        mesg=$(echo $l | sed 's/ (cherry picked from commit [0-9a-f]\+)//;s/\&/&amp;/g;s/</\&lt;/g;s/>/\&gt;/g')

					echo '  <li>'${mesg}'</li>'

				    fi

				done

									
										17

bin/version.mk
									
												View File
											
				@@ -1,17 +0,0 @@

				#!/usr/bin/make -sf

				# Print the various Mesa version fields. This is mostly used to add the

				# version to configure.

				# This reflects that this script is usually called from the toplevel

				TOP = .

				include $(TOP)/configs/default

				version:

					@echo $(MESA_VERSION)

				major:

					@echo $(MESA_MAJOR)

				minor:

					@echo $(MESA_MINOR)

				tiny:

					@echo $(MESA_TINY)

									
										3

common.py
									
												View File
												
				@@ -89,7 +89,7 @@ def AddOptions(opts):

					opts.Add(EnumOption('machine', 'use machine-specific assembly code', default_machine,

															 allowed_values=('generic', 'ppc', 'x86', 'x86_64')))

					opts.Add(EnumOption('platform', 'target platform', host_platform,

															 allowed_values=('linux', 'windows', 'darwin', 'cygwin', 'sunos', 'freebsd8', 'haiku')))

															 allowed_values=('cygwin', 'darwin', 'freebsd', 'haiku', 'linux', 'sunos', 'windows')))

					opts.Add(BoolOption('embedded', 'embedded build', 'no'))

					opts.Add('toolchain', 'compiler toolchain', default_toolchain)

					opts.Add(BoolOption('gles', 'EXPERIMENTAL: enable OpenGL ES support', 'no'))

				@@ -98,5 +98,6 @@ def AddOptions(opts):

					opts.Add(BoolOption('debug', 'DEPRECATED: debug build', 'yes'))

					opts.Add(BoolOption('profile', 'DEPRECATED: profile build', 'no'))

					opts.Add(BoolOption('quiet', 'DEPRECATED: profile build', 'yes'))

					opts.Add(BoolOption('texture_float', 'enable floating-point textures and renderbuffers', 'no'))

					if host_platform == 'windows':

						opts.Add(EnumOption('MSVS_VERSION', 'MS Visual C++ version', None, allowed_values=('7.1', '8.0', '9.0')))

27

configs/aix

View File

@@ -1,27 +0,0 @@
 # Configuration for AIX, dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix
 # Compiler and flags
 CC = cc
 CXX = xlC
 CFLAGS = -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -O -DAIXV3 -DPTHREADS
 # Misc tools and flags
 MKLIB_OPTIONS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS = -lX11 -lXext -lpthread -lm
 GLU_LIB_DEPS = -L$(TOP)/lib -l$(GL_LIB) -lm -lC
 GLW_LIB_DEPS = -L$(TOP)/lib -l$(GL_LIB) -lXm -lXt -lX11
 OSMESA_LIB_DEPS = -L$(TOP)/lib -l$(GL_LIB)

24

configs/aix-64

View File

@@ -1,24 +0,0 @@
 # Configuration for AIX 64-bit, dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix-64
 # Compiler and flags
 CC = xlc
 CXX = xlC
 CFLAGS = -q64 -qmaxmem=16384 -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -q64 -qmaxmem=16384 -O -DAIXV3 -DPTHREADS
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS = -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lC
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lXm -lXt -lX11

21

configs/aix-64-static

View File

@@ -1,21 +0,0 @@
 # Configuration for AIX, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix-64-static
 # Compiler and flags
 CC = cc
 CXX = xlC
 CFLAGS = -q64 -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -q64 -O -DAIXV3 -DPTHREADS
 MKLIB_OPTIONS = -static
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

21

configs/aix-gcc

View File

@@ -1,21 +0,0 @@
 # Configuration for AIX with gcc
 include $(TOP)/configs/default
 CONFIG_NAME = aix-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O2 -DAIXV3
 CXXFLAGS = -O2 -DAIXV3
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MKLIB_OPTIONS = -arch aix-gcc
 GL_LIB_DEPS = -lX11 -lXext -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm

20

configs/aix-static

View File

@@ -1,20 +0,0 @@
 # Configuration for AIX, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix-static
 # Compiler and flags
 CC = cc
 CXX = xlC
 CFLAGS = -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -O -DAIXV3 -DPTHREADS
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

31

configs/bluegene-osmesa

View File

@@ -1,31 +0,0 @@
 # Configuration for building only libOSMesa on BlueGene, no Xlib driver
 # This doesn't really have a lot of dependencies, so it should be usable
 # on other (gcc-based) systems too.
 # It uses static linking and disables multithreading.
 include $(TOP)/configs/default
 CONFIG_NAME = bluegene-osmesa
 # Compiler and flags
 CC = /bgl/BlueLight/ppcfloor/blrts-gnu/bin/powerpc-bgl-blrts-gnu-gcc
 CXX = /bgl/BlueLight/ppcfloor/blrts-gnu/bin/powerpc-bgl-blrts-gnu-g++
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURC
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MKLIB_OPTIONS = -static
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

27

configs/bluegene-xlc-osmesa

View File

@@ -1,27 +0,0 @@
 # Configuration for building only libOSMesa on BlueGene using the IBM xlc compiler
 # This doesn't really have a lot of dependencies, so it should be usable
 # on similar systems too.
 # It uses static linking and disables multithreading.
 include $(TOP)/configs/default
 CONFIG_NAME = bluegene-osmesa
 # Compiler and flags
 CC = /opt/ibmcmp/vacpp/bg/8.0/bin/blrts_xlc
 CXX = /opt/ibmcmp/vacpp/bg/8.0/bin/blrts_xlC
 CFLAGS = -O3 -pedantic -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 CXXFLAGS = -O3 -pedantic -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

30

configs/catamount-osmesa-pgi

View File

@@ -1,30 +0,0 @@
 # Configuration for building only libOSMesa on Cray Xt3
 # for the compute nodes running Catamount using the
 # Portland Group compiler. The Portland Group toolchain has to be
 # enabled before using "module switch PrgEnv-gnu PrgEnv-pgi" .
 # This doesn't really have a lot of dependencies, so it should be usable
 # on other similar systems too.
 # It uses static linking and disables multithreading.
 include $(TOP)/configs/default
 CONFIG_NAME = catamount-osmesa-pgi
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -target=catamount -fastsse -O3 -Mnontemporal -Mprefetch=distance:8,nta   -fPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 CXXFLAGS = -target=catamount -fastsse -O3 -Mnontemporal -Mprefetch=distance:8,nta -fPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

49

configs/autoconf.in → configs/current.in

View File

@@ -9,21 +9,17 @@ CONFIG_NAME = autoconf
 # Compiler and flags
 CC = @CC@
 CXX = @CXX@
 OPT_FLAGS = @OPT_FLAGS@
 ARCH_FLAGS = @ARCH_FLAGS@
 ASM_FLAGS = @ASM_FLAGS@
 PIC_FLAGS = @PIC_FLAGS@
 DEFINES = @DEFINES@
 API_DEFINES = @API_DEFINES@
 SHARED_GLAPI = @SHARED_GLAPI@
 CFLAGS_NOVISIBILITY = @CPPFLAGS@ @CFLAGS@ \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(ASM_FLAGS) $(DEFINES)
 	$(PIC_FLAGS) $(DEFINES)
 CXXFLAGS_NOVISIBILITY = @CPPFLAGS@ @CXXFLAGS@ \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 	$(PIC_FLAGS) $(DEFINES)
 CFLAGS = $(CFLAGS_NOVISIBILITY) @VISIBILITY_CFLAGS@
 CXXFLAGS = $(CXXFLAGS_NOVISIBILITY) @VISIBILITY_CXXFLAGS@
 LDFLAGS = @LDFLAGS@
 EXTRA_LIB_PATH = @EXTRA_LIB_PATH@
 RADEON_CFLAGS = @RADEON_CFLAGS@
 RADEON_LIBS = @RADEON_LIBS@
 NOUVEAU_CFLAGS = @NOUVEAU_CFLAGS@
@@ -34,21 +30,20 @@ X11_LIBS = @X11_LIBS@
 X11_CFLAGS = @X11_CFLAGS@
 LLVM_BINDIR = @LLVM_BINDIR@
 LLVM_CFLAGS = @LLVM_CFLAGS@
 LLVM_CPPFLAGS = @LLVM_CPPFLAGS@
 LLVM_CXXFLAGS = @LLVM_CXXFLAGS@
 LLVM_LDFLAGS = @LLVM_LDFLAGS@
 LLVM_LIBDIR = @LLVM_LIBDIR@
 LLVM_LIBS = @LLVM_LIBS@
 LLVM_INCLUDEDIR = @LLVM_INCLUDEDIR@
 GLW_CFLAGS = @GLW_CFLAGS@
 GLX_TLS = @GLX_TLS@
 DRI_CFLAGS = @DRI_CFLAGS@
 DRI_CXXFLAGS = @DRI_CXXFLAGS@
 # dlopen
 DLOPEN_LIBS = @DLOPEN_LIBS@
 # Source selection
 MESA_ASM_SOURCES = @MESA_ASM_SOURCES@
 GLAPI_ASM_SOURCES = @GLAPI_ASM_SOURCES@
 MESA_ASM_FILES = @MESA_ASM_FILES@
 # Misc tools and flags
 MAKE = @MAKE@
@@ -64,6 +59,10 @@ NM = @NM@
 # Perl
 PERL = @PERL@
 # Indent (used for generating dispatch tables)
 INDENT = @INDENT@
 INDENT_FLAGS = @INDENT_FLAGS@
 # Python and flags (generally only needed by the developers)
 PYTHON2 = @PYTHON2@
 PYTHON_FLAGS = -t -O -O
@@ -97,7 +96,6 @@ GLAPI_LIB_NAME = @GLAPI_LIB_NAME@
 GL_LIB_GLOB = @GL_LIB_GLOB@
 GLU_LIB_GLOB = @GLU_LIB_GLOB@
 GLW_LIB_GLOB = @GLW_LIB_GLOB@
 OSMESA_LIB_GLOB = @OSMESA_LIB_GLOB@
 EGL_LIB_GLOB = @EGL_LIB_GLOB@
 GLESv1_CM_LIB_GLOB = @GLESv1_CM_LIB_GLOB@
 GLESv2_LIB_GLOB = @GLESv2_LIB_GLOB@
@@ -107,7 +105,6 @@ GLAPI_LIB_GLOB = @GLAPI_LIB_GLOB@
 # Directories to build
 LIB_DIR = @LIB_DIR@
 SRC_DIRS = @SRC_DIRS@
 GLU_DIRS = @GLU_DIRS@
 DRIVER_DIRS = @DRIVER_DIRS@
 GALLIUM_DIRS = @GALLIUM_DIRS@
 GALLIUM_DRIVERS_DIRS = @GALLIUM_DRIVERS_DIRS@
@@ -119,9 +116,6 @@ GALLIUM_DRIVERS = $(foreach DIR,$(GALLIUM_DRIVERS_DIRS),$(TOP)/src/gallium/drive
 # Driver specific build vars
 DRI_DIRS = @DRI_DIRS@
 DRICORE_GLSL_LIBS = @DRICORE_GLSL_LIBS@
 DRICORE_LIBS = @DRICORE_LIBS@
 DRICORE_LIB_DEPS = @DRICORE_LIB_DEPS@
 EGL_PLATFORMS = @EGL_PLATFORMS@
 EGL_CLIENT_APIS = @EGL_CLIENT_APIS@
@@ -133,22 +127,22 @@ GLW_SOURCES = @GLW_SOURCES@
 MOTIF_CFLAGS = @MOTIF_CFLAGS@
 # Library/program dependencies
 GL_LIB_DEPS = $(EXTRA_LIB_PATH) @GL_LIB_DEPS@
 GL_LIB_DEPS = @GL_LIB_DEPS@
 OSMESA_LIB_DEPS = -L$(TOP)/$(LIB_DIR) @OSMESA_MESA_DEPS@ \
 	$(EXTRA_LIB_PATH) @OSMESA_LIB_DEPS@
 EGL_LIB_DEPS = $(EXTRA_LIB_PATH) @EGL_LIB_DEPS@
 	@OSMESA_LIB_DEPS@
 EGL_LIB_DEPS = @EGL_LIB_DEPS@
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) @GLU_MESA_DEPS@ \
 	$(EXTRA_LIB_PATH) @GLU_LIB_DEPS@
 	@GLU_LIB_DEPS@
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) @GLW_MESA_DEPS@ \
 	$(EXTRA_LIB_PATH) @GLW_LIB_DEPS@
 GLESv1_CM_LIB_DEPS = $(EXTRA_LIB_PATH) @GLESv1_CM_LIB_DEPS@
 GLESv2_LIB_DEPS = $(EXTRA_LIB_PATH) @GLESv2_LIB_DEPS@
 VG_LIB_DEPS = $(EXTRA_LIB_PATH) @VG_LIB_DEPS@
 GLAPI_LIB_DEPS = $(EXTRA_LIB_PATH) @GLAPI_LIB_DEPS@
 	@GLW_LIB_DEPS@
 GLESv1_CM_LIB_DEPS = @GLESv1_CM_LIB_DEPS@
 GLESv2_LIB_DEPS = @GLESv2_LIB_DEPS@
 VG_LIB_DEPS = @VG_LIB_DEPS@
 GLAPI_LIB_DEPS = @GLAPI_LIB_DEPS@
 # DRI dependencies
 MESA_MODULES = @MESA_MODULES@
 DRI_LIB_DEPS = $(EXTRA_LIB_PATH) @DRI_LIB_DEPS@
 DRI_LIB_DEPS = @DRI_LIB_DEPS@
 GALLIUM_DRI_LIB_DEPS = @GALLIUM_DRI_LIB_DEPS@
 LIBDRM_CFLAGS = @LIBDRM_CFLAGS@
 LIBDRM_LIB = @LIBDRM_LIBS@
 DRI2PROTO_CFLAGS = @DRI2PROTO_CFLAGS@
@@ -187,6 +181,9 @@ VA_LIB_INSTALL_DIR=@VA_LIB_INSTALL_DIR@
 # Xorg driver install directory (for xorg state-tracker)
 XORG_DRIVER_INSTALL_DIR = @XORG_DRIVER_INSTALL_DIR@
 # Path to OpenCL C library libclc
 LIBCLC_PATH = @LIBCLC_PATH@
 # pkg-config substitutions
 GL_PC_REQ_PRIV = @GL_PC_REQ_PRIV@
 GL_PC_LIB_PRIV = @GL_PC_LIB_PRIV@

61

configs/darwin

View File

@@ -1,61 +0,0 @@
 # Configuration for Darwin / MacOS X, making dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = darwin
 INSTALL_DIR = /usr/X11
 X11_DIR = $(INSTALL_DIR)
 # Compiler and flags
 CC = $(shell xcrun -find cc)
 CXX = $(shell xcrun -find c++)
 PIC_FLAGS = -fPIC
 DEFINES =  -D_DARWIN_C_SOURCE -DPTHREADS -D_GNU_SOURCE \
 	   -DGLX_ALIAS_UNSUPPORTED \
 	   -DGLX_DIRECT_RENDERING -DGLX_USE_APPLEGL
 # -DGLX_INDIRECT_RENDERING \
 # -D_GNU_SOURCE          - for src/mesa/main ...
 # -DGLX_DIRECT_RENDERING - pulls in libdrm stuff in glx
 # -DGLX_USE_APPLEGL      - supposed to be used with GLX_DIRECT_RENDERING to use AGL rather than DRM, but doesn't compile
 # -DIN_DRI_DRIVER
 ARCH_FLAGS += $(RC_CFLAGS)
 INCLUDE_FLAGS = -I$(INSTALL_DIR)/include -I$(X11_DIR)/include
 OPT_FLAGS = -g3 -gdwarf-2 -Os -ffast-math -fno-strict-aliasing
 WARN_FLAGS = -Wall -Wmissing-prototypes
 CFLAGS = -std=c99 -fvisibility=hidden \
 	$(OPT_FLAGS) $(WARN_FLAGS) $(INCLUDE_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(ASM_FLAGS) $(DEFINES) $(EXTRA_CFLAGS)
 CXXFLAGS = -fvisibility=hidden \
 	$(OPT_FLAGS) $(WARN_FLAGS) $(INCLUDE_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(ASM_FLAGS) $(DEFINES) $(EXTRA_CFLAGS)
 # Library names (actual file names)
 GL_LIB_NAME = lib$(GL_LIB).dylib
 GLU_LIB_NAME = lib$(GLU_LIB).dylib
 GLW_LIB_NAME = lib$(GLW_LIB).dylib
 OSMESA_LIB_NAME = lib$(OSMESA_LIB).dylib
 VG_LIB_NAME = lib$(VG_LIB).dylib
 # globs used to install the lib and all symlinks
 GL_LIB_GLOB = lib$(GL_LIB).*dylib
 GLU_LIB_GLOB = lib$(GLU_LIB).*dylib
 GLW_LIB_GLOB = lib$(GLW_LIB).*dylib
 OSMESA_LIB_GLOB = lib$(OSMESA_LIB).*dylib
 VG_LIB_GLOB = lib$(VG_LIB).*dylib
 GL_LIB_DEPS = -L$(INSTALL_DIR)/$(LIB_DIR) -L$(X11_DIR)/$(LIB_DIR) -lX11-xcb -lxcb -lX11 -lXext $(EXTRA_LDFLAGS)
 OSMESA_LIB_DEPS = $(EXTRA_LDFLAGS)
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(EXTRA_LDFLAGS)
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L$(INSTALL_DIR)/$(LIB_DIR) -L$(X11_DIR)/$(LIB_DIR) -lX11 -lXt $(EXTRA_LDFLAGS)
 SRC_DIRS = glsl mapi/glapi mapi/vgapi glx/apple mesa glu
 GLU_DIRS = sgi
 DRIVER_DIRS = osmesa
 #DRIVER_DIRS = dri
 DRI_DIRS = swrast
 #GALLIUM_DRIVERS_DIRS = softpipe trace rbug noop identity galahad
 #GALLIUM_DRIVERS_DIRS += llvmpipe

7

configs/darwin-fat-32bit

View File

@@ -1,7 +0,0 @@
 # Configuration for Darwin / MacOS X, making 32bit fat dynamic libs
 RC_CFLAGS=-arch ppc -arch i386
 include $(TOP)/configs/darwin
 CONFIG_NAME = darwin-fat-32bit

7

configs/darwin-fat-all

View File

@@ -1,7 +0,0 @@
 # Configuration for Darwin / MacOS X, making 32bit and 64bit fat dynamic libs
 RC_CFLAGS=-arch ppc -arch i386 -arch ppc64 -arch x86_64
 include $(TOP)/configs/darwin
 CONFIG_NAME = darwin-fat-all

7

configs/darwin-fat-intel

View File

@@ -1,7 +0,0 @@
 # Configuration for Darwin / MacOS X, making 32bit and 64bit fat dynamic libs for intel
 RC_CFLAGS=-arch i386 -arch x86_64
 include $(TOP)/configs/darwin
 CONFIG_NAME = darwin-fat-intel

44

configs/default

View File

@@ -8,8 +8,8 @@
 CONFIG_NAME = default
 # Version info
 MESA_MAJOR=8
 MESA_MINOR=1
 MESA_MAJOR=9
 MESA_MINOR=0
 MESA_TINY=0
 MESA_VERSION = $(MESA_MAJOR).$(MESA_MINOR).$(MESA_TINY)
@@ -19,11 +19,9 @@ DRM_SOURCE_PATH=$(TOP)/../drm
 # Compiler and flags
 CC = cc
 CXX = CC
 HOST_CC = $(CC)
 CFLAGS = -O
 CXXFLAGS = -O
 LDFLAGS =
 HOST_CFLAGS = $(CFLAGS)
 GLU_CFLAGS =
 GLX_TLS = no
@@ -78,18 +76,14 @@ GLAPI_LIB_NAME = lib$(GLAPI_LIB).so
 GL_LIB_GLOB = $(GL_LIB_NAME)*
 GLU_LIB_GLOB = $(GLU_LIB_NAME)*
 GLW_LIB_GLOB = $(GLW_LIB_NAME)*
 OSMESA_LIB_GLOB = $(OSMESA_LIB_NAME)*
 EGL_LIB_GLOB = $(EGL_LIB_NAME)*
 GLESv1_CM_LIB_GLOB = $(GLESv1_CM_LIB_NAME)*
 GLESv2_LIB_GLOB = $(GLESv2_LIB_NAME)*
 VG_LIB_GLOB = $(VG_LIB_NAME)*
 GLAPI_LIB_GLOB = $(GLAPI_LIB_NAME)*
 DRI_CFLAGS = $(CFLAGS)
 DRI_CXXFLAGS = $(CXXFLAGS)
 # Optional assembly language optimization files for libGL
 MESA_ASM_SOURCES =
 MESA_ASM_FILES =
 # GLw widget sources (Append "GLwMDrawA.c" here and add -lXm to GLW_LIB_DEPS in
 # order to build the Motif widget too)
@@ -101,7 +95,6 @@ MOTIF_CFLAGS = -I/usr/include/Motif1.2
 LIB_DIR = lib
 SRC_DIRS = glsl mapi/glapi mapi/vgapi mesa \
 	gallium egl gallium/winsys gallium/targets glu
 GLU_DIRS = sgi
 DRIVER_DIRS = x11 osmesa
 # Gallium directories and
@@ -119,15 +112,15 @@ EGL_CLIENT_APIS = $(GL_LIB)
 # Library dependencies
 #EXTRA_LIB_PATH ?=
 GL_LIB_DEPS     = $(EXTRA_LIB_PATH) -lX11 -lXext -lm -lpthread
 EGL_LIB_DEPS    = $(EXTRA_LIB_PATH) -ldl -lpthread
 OSMESA_LIB_DEPS = $(EXTRA_LIB_PATH) -L$(TOP)/$(LIB_DIR) -l$(GL_LIB)
 GLU_LIB_DEPS    = $(EXTRA_LIB_PATH) -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm
 GLW_LIB_DEPS    = $(EXTRA_LIB_PATH) -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lXt -lX11
 GLESv1_CM_LIB_DEPS = $(EXTRA_LIB_PATH) -lpthread
 GLESv2_LIB_DEPS = $(EXTRA_LIB_PATH) -lpthread
 VG_LIB_DEPS    = $(EXTRA_LIB_PATH) -lpthread
 GLAPI_LIB_DEPS = $(EXTRA_LIB_PATH) -lpthread
 GL_LIB_DEPS     = -lX11 -lXext -lm -lpthread
 EGL_LIB_DEPS    = -ldl -lpthread
 OSMESA_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB)
 GLU_LIB_DEPS    = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm
 GLW_LIB_DEPS    = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lXt -lX11
 GLESv1_CM_LIB_DEPS = -lpthread
 GLESv2_LIB_DEPS = -lpthread
 VG_LIB_DEPS    = -lpthread
 GLAPI_LIB_DEPS = -lpthread
 # Program dependencies - specific GL libraries added in Makefiles
 X11_LIBS = -lX11
@@ -172,3 +165,16 @@ GLESv2_PC_CFLAGS =
 VG_PC_REQ_PRIV =
 VG_PC_LIB_PRIV =
 VG_PC_CFLAGS =
 # default targets
 # this helps reduce the mismatch between our automake Makefiles and the old
 # custom Makefiles while we transition.
 all: default
 am--refresh:
 distclean: clean
 check:
 test:

29

configs/freebsd

View File

@@ -1,29 +0,0 @@
 # Configuration for FreeBSD
 include $(TOP)/configs/default
 CONFIG_NAME = FreeBSD
 # Compiler and flags
 CC = cc
 CXX = c++
 MAKE = gmake
 OPT_FLAGS  = -O2
 PIC_FLAGS  = -fPIC
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_BSD_SOURCE -DUSE_XSHM \
 	-DHZ=100
 X11_INCLUDES = -I/usr/local/include
 CFLAGS += $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(DEFINES) $(X11_INCLUDES) -ffast-math -pedantic
 CXXFLAGS += $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(DEFINES) $(X11_INCLUDES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 EXTRA_LIB_PATH = -L/usr/local/lib

48

configs/freebsd-dri

View File

@@ -1,48 +0,0 @@
 # -*-makefile-*-
 # Configuration for freebsd-dri: FreeBSD DRI hardware drivers
 include $(TOP)/configs/freebsd
 CONFIG_NAME = freebsd-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 WARN_FLAGS = -Wall
 OPT_FLAGS = -O -g
 EXPAT_INCLUDES = -I/usr/local/include
 X11_INCLUDES = -I/usr/local/include
 DEFINES = -DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
 	-DHAVE_ALIAS
 CFLAGS = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) -Wmissing-prototypes -std=c99 -Wundef -ffast-math \
 	$(ASM_FLAGS) $(X11_INCLUDES) $(DEFINES)
 CXXFLAGS = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(DEFINES) -Wall -ansi -pedantic $(ASM_FLAGS) $(X11_INCLUDES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 ASM_SOURCES =
 MESA_ASM_SOURCES =
 # Library/program dependencies
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = `$(PKG_CONFIG) --cflags libdrm`
 LIBDRM_LIB = `$(PKG_CONFIG) --libs libdrm`
 DRI_LIB_DEPS = $(MESA_MODULES) -L/usr/local/lib -lm -pthread -lexpat $(LIBDRM_LIB)
 GL_LIB_DEPS = -L/usr/local/lib -lX11 -lXext -lXxf86vm -lXdamage -lXfixes \
 	-lm -pthread $(LIBDRM_LIB)
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -L/usr/local/lib -lGL -lXt -lX11
 # Directories
 SRC_DIRS = glx gallium mesa glu
 DRIVER_DIRS = dri
 DRM_SOURCE_PATH=$(TOP)/../drm

10

configs/freebsd-dri-amd64

View File

@@ -1,10 +0,0 @@
 # -*-makefile-*-
 # Configuration for freebsd-dri-amd64: FreeBSD DRI hardware drivers
 include $(TOP)/configs/freebsd-dri
 CONFIG_NAME = freebsd-dri-x86-64
 ASM_FLAGS = -DUSE_X86_64_ASM
 MESA_ASM_SOURCES = $(X86-64_SOURCES)
 GLAPI_ASM_SOURCES = $(X86-64_API)

13

configs/freebsd-dri-x86

View File

@@ -1,13 +0,0 @@
 # -*-makefile-*-
 # Configuration for freebsd-dri: FreeBSD DRI hardware drivers
 include $(TOP)/configs/freebsd-dri
 CONFIG_NAME = freebsd-dri-x86
 # Unnecessary on x86, generally.
 PIC_FLAGS =
 ASM_FLAGS = -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

13

configs/hpux10

View File

@@ -1,13 +0,0 @@
 # Configuration for HPUX v10, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux10
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DAportable +z -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DAportable +Z -Ae -D_HPUX_SOURCE

18

configs/hpux10-gcc

View File

@@ -1,18 +0,0 @@
 # Configuration for HPUX v10, with gcc
 include $(TOP)/configs/default
 CONFIG_NAME = hpux10-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -ansi -O3 -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include  -DUSE_XSHM
 CXXFLAGS = -ansi -O3 -D_HPUX_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing

26

configs/hpux10-static

View File

@@ -1,26 +0,0 @@
 # Configuration for HPUX v10, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux10-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DAportable +z -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DAportable +Z -Ae -D_HPUX_SOURCE
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

27

configs/hpux11-32

View File

@@ -1,27 +0,0 @@
 # Configuration for HPUX v11
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-32
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = +z -Ae -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = +z -Ae -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies
 GL_LIB_DEPS = -L/usr/lib/X11R6/ -L/usr/contrib/X11R6/lib/ -lXext -lXt -lXi -lX11 -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lCsup -lcl
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

25

configs/hpux11-32-static

View File

@@ -1,25 +0,0 @@
 # Configuration for HPUX v11, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-32-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

24

configs/hpux11-32-static-nothreads

View File

@@ -1,24 +0,0 @@
 # Configuration for HPUX v11, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-32-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

28

configs/hpux11-64

View File

@@ -1,28 +0,0 @@
 # Configuration for HPUX v11, 64-bit
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-64
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = +z -Ae +DD64 -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = +z -Ae +DD64 -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS =
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies
 GL_LIB_DEPS = -L/usr/lib/X11R6/pa20_64 -L/usr/contrib/X11R6/lib/pa20_64 -lXext -lXmu -lXt -lXi -lX11 -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lCsup -lcl
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

25

configs/hpux11-64-static

View File

@@ -1,25 +0,0 @@
 # Configuration for HPUX v11, 64-bit, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-64-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DA2.0W -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -O +DA2.0W -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS = -static
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

28

configs/hpux11-ia64

View File

@@ -1,28 +0,0 @@
 # Configuration for HPUX IA64 v11, 64-bit
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-ia64
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = +z -Ae +DD64 -O +DSmckinley -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = +z -Ae +DD64 -O +DSmckinley -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS =
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.so
 GLU_LIB_NAME = libGLU.so
 GLW_LIB_NAME = libGLw.so
 OSMESA_LIB_NAME = libOSMesa.so
 # Library/program dependencies
 GL_LIB_DEPS = -L/usr/lib/X11R6/ -L/usr/contrib/X11R6/lib/ -lXext -lXmu -lXt -lXi -lX11 -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lCsup -lcl
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

25

configs/hpux11-ia64-static

View File

@@ -1,25 +0,0 @@
 # Configuration for HPUX v11, 64-bit, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-ia64-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DD64 -Ae -D_HPUX_SOURCE +DSmckinley -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -O +DD64 -Ae -D_HPUX_SOURCE +DSmckinley -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS = -static
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

15

configs/hpux9

View File

@@ -1,15 +0,0 @@
 # Configuration for HPUX v9, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux9
 # Compiler and flags
 CC = cc
 # XXX fix this
 CXX = c++
 CFLAGS = +z -O +Olibcalls +ESlit -Ae +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R5 -DUSE_XSHM
 CXXFLAGS = +z -O +Olibcalls +ESlit -Ae +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R5

13

configs/hpux9-gcc

View File

@@ -1,13 +0,0 @@
 # Configuration for HPUX v10, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux9-gcc
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DAportable +z -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DAportable +Z -Ae -D_HPUX_SOURCE

16

configs/irix6-64

View File

@@ -1,16 +0,0 @@
 # Configuration for IRIX 6.x, make n64 DSOs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-64
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -64 -O3 -ansi -woff 1068,1069,1174,1185,1209,1474,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -64 -O3 -ansi -woff 1174 -DPTHREADS
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib64

24

configs/irix6-64-static

View File

@@ -1,24 +0,0 @@
 # Configuration for IRIX 6.x, make n64 static libs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-64-static
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -64 -O3 -ansi -woff 1068,1069,1174,1185,1209,1474,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -64 -O3 -ansi -woff 1174 -DPTHREADS
 MKLIB_OPTIONS = -static
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib64
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

16

configs/irix6-n32

View File

@@ -1,16 +0,0 @@
 # Configuration for IRIX 6.x, make n32 DSOs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-n32
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -n32 -mips3 -O3 -ansi -woff 1174,1521,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -n32 -mips3 -O3 -ansi -woff 1174,1552 -DPTHREADS
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32

23

configs/irix6-n32-static

View File

@@ -1,23 +0,0 @@
 # Configuration for IRIX 6.x, make n32 static libs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-n32-static
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -n32 -mips2 -O2 -ansi -woff 1521,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -n32 -mips2 -O2 -ansi -woff 3262,3666 -DPTHREADS
 MKLIB_OPTIONS = -static
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

17

configs/irix6-o32

View File

@@ -1,17 +0,0 @@
 # Configuration for IRIX 6.x, make o32 DSOs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-o32
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -32 -mips2 -O2 -ansi -woff 1521,1552 -DUSE_XSHM
 CXXFLAGS = -32 -mips2 -O2 -ansi -woff 3262,3666
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32

23

configs/irix6-o32-static

View File

@@ -1,23 +0,0 @@
 # Configuration for IRIX 6.x, make o32 static libs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-o32-static
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -32 -mips2 -O2 -ansi -woff 1521,1552 -DUSE_XSHM
 CXXFLAGS = -32 -mips2 -O2 -ansi -woff 3262,3666
 MKLIB_OPTIONS = -static
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

37

configs/linux

View File

@@ -1,37 +0,0 @@
 # Configuration for generic Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux
 # Compiler and flags
 CC = gcc
 CXX = g++
 OPT_FLAGS  = -O3 -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.  Add -m32
 # to build properly on 64-bit platforms.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_XSHM -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS = -Wall -Wmissing-prototypes -Wdeclaration-after-statement \
 	-Wpointer-arith $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) \
 	$(DEFINES) $(ASM_FLAGS) $(X11_INCLUDES) -std=c99 -ffast-math
 CXXFLAGS = -Wall -Wpointer-arith $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) \
 	$(DEFINES) $(X11_INCLUDES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 EXTRA_LIB_PATH = -L/usr/X11R6/lib

19

configs/linux-alpha

View File

@@ -1,19 +0,0 @@
 # Configuration for Linux on Alpha
 include $(TOP)/configs/default
 CONFIG_NAME = linux-alpha
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -fPIC -D_XOPEN_SOURCE -DUSE_XSHM
 CXXFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -fPIC -D_XOPEN_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm -lpthread
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L/usr/X11R6/lib -lXt -lX11

27

configs/linux-alpha-static

View File

@@ -1,27 +0,0 @@
 # Configuration for Linux on Alpha, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = linux-alpha-static
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -D_XOPEN_SOURCE -DUSE_XSHM
 CXXFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -D_XOPEN_SOURCE
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm -lpthread
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L/usr/X11R6/lib -lXt -lX11

9

configs/linux-debug

View File

@@ -1,9 +0,0 @@
 # Configuration for debugging on Linux
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-debug
 OPT_FLAGS = -g
 #CFLAGS += -pedantic
 DEFINES += -DDEBUG -DDEBUG_MATH

72

configs/linux-dri

View File

@@ -1,72 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 OPT_FLAGS  = -O2 -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
 	-DHAVE_ALIAS -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS = -Wall -Wmissing-prototypes -std=c99 -ffast-math \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) $(ASM_FLAGS)
 CXXFLAGS = -Wall $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)
 LIBDRM_LIB = $(shell $(PKG_CONFIG) --libs libdrm)
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl $(LIBDRM_LIB)
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lXdamage -lXfixes \
 		-lm -lpthread -ldl $(LIBDRM_LIB)
 # Directories
 SRC_DIRS := glx egl $(SRC_DIRS)
 DRIVER_DIRS = dri
 GALLIUM_WINSYS_DIRS = sw sw/xlib drm/vmware drm/intel svga/drm
 GALLIUM_TARGET_DIRS = dri-vmwgfx
 GALLIUM_STATE_TRACKERS_DIRS = egl dri
 DRI_DIRS = swrast
 INTEL_LIBS = $(shell $(PKG_CONFIG) --libs libdrm_intel)
 INTEL_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm_intel)
 NOUVEAU_LIBS = $(shell $(PKG_CONFIG) --libs libdrm_nouveau)
 NOUVEAU_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm_nouveau)
 RADEON_LIBS = $(shell $(PKG_CONFIG) --libs libdrm_radeon)
 RADEON_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm_radeon)
 RADEON_LDFLAGS = $(LIBDRM_RADEON_LIBS)

8

configs/linux-dri-debug

View File

@@ -1,8 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri-debug: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-debug
 OPT_FLAGS  = -O0 -g
 ARCH_FLAGS = -DDEBUG

9

configs/linux-dri-ppc

View File

@@ -1,9 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-ppc
 OPT_FLAGS = -Os -mcpu=603
 PIC_FLAGS = -fPIC

13

configs/linux-dri-x86

View File

@@ -1,13 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-x86
 ARCH_FLAGS = -m32 -mmmx -msse -msse2
 ASM_FLAGS = -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

17

configs/linux-dri-x86-64

View File

@@ -1,17 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-x86-64
 ARCH_FLAGS = -m64
 ASM_FLAGS = -DUSE_X86_64_ASM
 MESA_ASM_SOURCES = $(X86-64_SOURCES)
 GLAPI_ASM_SOURCES = $(X86-64_API)
 LIB_DIR = lib64
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib64

54

configs/linux-dri-xcb

View File

@@ -1,54 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri-xcb
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 OPT_FLAGS  = -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
         -DHAVE_ALIAS -DUSE_XCB -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = $(shell $(PKG_CONFIG) --cflags-only-I x11) $(shell $(PKG_CONFIG) --cflags-only-I xcb) $(shell $(PKG_CONFIG) --cflags-only-I x11-xcb) $(shell $(PKG_CONFIG) --cflags-only-I xcb-glx)
 CFLAGS = -Wall -Wmissing-prototypes $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) \
 	$(DEFINES) $(ASM_FLAGS) -std=c99 -ffast-math
 CXXFLAGS = -Wall $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=$(shell $(PKG_CONFIG) --libs-only-L x11)
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)
 LIBDRM_LIB = $(shell $(PKG_CONFIG) --libs libdrm)
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl $(LIBDRM_LIB)
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lm -lpthread -ldl \
                 $(LIBDRM_LIB) $(shell $(PKG_CONFIG) --libs xcb) $(shell $(PKG_CONFIG) --libs x11-xcb) $(shell $(PKG_CONFIG) --libs xcb-glx)
 SRC_DIRS = glx gallium mesa glu
 DRIVER_DIRS = dri

58

configs/linux-egl

View File

@@ -1,58 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 OPT_FLAGS  = -O -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
 	-DHAVE_ALIAS -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS = -Wall -Wmissing-prototypes -std=c99 -ffast-math \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) $(ASM_FLAGS)
 CXXFLAGS = -Wall $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)
 LIBDRM_LIB = $(shell $(PKG_CONFIG) --libs libdrm)
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl $(LIBDRM_LIB)
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lXdamage -lXfixes \
 		-lm -lpthread -ldl \
                 $(LIBDRM_LIB)
 # Directories
 SRC_DIRS = gallium mesa gallium/winsys gallium/targets glu egl
 DRIVER_DIRS = dri
 GALLIUM_WINSYS_DIRS = egl_drm
 GALLIUM_TARGET_DIRS =
 DRI_DIRS = intel

18

configs/linux-ia64-icc

View File

@@ -1,18 +0,0 @@
 # Configuration for Linux with Intel C compiler
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc
 # Compiler and flags
 CC = icc
 CXX = icpc
 CFLAGS = -O3 -ansi -KPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3 -ansi -KPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 MKLIB_OPTIONS = -arch icc-istatic
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB)
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

23

configs/linux-ia64-icc-static

View File

@@ -1,23 +0,0 @@
 # Configuration for Linux with Intel C compiler, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc-static
 # Compiler and flags
 CC = icc
 CXX = icpc
 CFLAGS = -O3 -ansi -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3 -ansi -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 MKLIB_OPTIONS = -static -arch icc-istatic
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

19

configs/linux-icc

View File

@@ -1,19 +0,0 @@
 # Configuration for Linux with Intel C compiler
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc
 # Compiler and flags
 CC = icc
 CXX = g++
 CFLAGS = -O3 -tpp6 -axK -KPIC -D_GCC_LIMITS_H_ -D__GNUC__ -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3
 MKLIB_OPTIONS = -arch icc
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm -lpthread
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

23

configs/linux-icc-static

View File

@@ -1,23 +0,0 @@
 # Configuration for Linux with Intel C compiler, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc-static
 # Compiler and flags
 CC = icc
 CXX = icpc
 CFLAGS = -O3 -tpp6 -axK -D_GCC_LIMITS_H_ -D__GNUC__ -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3 -tpp6 -axK -DPTHREADS
 MKLIB_OPTIONS = -static -arch icc
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS =
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

52

configs/linux-indirect

View File

@@ -1,52 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-indirect: Builds a libGL capable of indirect
 # rendering, but *NOT* capable of direct rendering.
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 WARN_FLAGS = -Wall
 OPT_FLAGS  = -O -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DGLX_INDIRECT_RENDERING \
 	-DPTHREADS -DHAVE_ALIAS -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS   = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) \
 	$(ASM_FLAGS) -std=c99 -ffast-math
 CXXFLAGS = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lm -lpthread -ldl
 # Directories
 SRC_DIRS = glx glu
 DRIVER_DIRS =

47

configs/linux-llvm

View File

@@ -1,47 +0,0 @@
 # -*-makefile-*-
 # Configuration for Linux and LLVM with optimizations
 # Builds the llvmpipe gallium driver
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-llvm
 # Add llvmpipe driver
 GALLIUM_DRIVERS_DIRS += llvmpipe
 OPT_FLAGS = -O3 -ansi -pedantic
 ARCH_FLAGS = -mmmx -msse -msse2 -mstackrealign
 DEFINES += -DNDEBUG -DGALLIUM_LLVMPIPE
 # override -std=c99
 CFLAGS += -std=gnu99
 LLVM_VERSION := $(shell llvm-config --version)
 ifeq ($(LLVM_VERSION),)
   $(warning Could not find LLVM! Make Sure 'llvm-config' is in the path)
   MESA_LLVM=0
 else
   MESA_LLVM=1
   HAVE_LLVM := 0x0$(subst .,0,$(LLVM_VERSION:svn=))
   DEFINES += -DHAVE_LLVM=$(HAVE_LLVM)
 #  $(info Using LLVM version: $(LLVM_VERSION))
 endif
 ifeq ($(MESA_LLVM),1)
   LLVM_CFLAGS=`llvm-config --cppflags|sed 's/-DNDEBUG\>//g'`
   LLVM_CXXFLAGS=`llvm-config --cxxflags` -Wno-long-long
   LLVM_LDFLAGS = $(shell llvm-config --ldflags)
   LLVM_LIBS = $(shell llvm-config --libs)
   MKLIB_OPTIONS=-cplusplus
 else
   LLVM_CFLAGS=
   LLVM_CXXFLAGS=
 endif
 LD = g++
 GL_LIB_DEPS = $(LLVM_LDFLAGS) $(LLVM_LIBS) $(EXTRA_LIB_PATH) -lX11 -lXext -lm -lpthread -lstdc++
 # to allow the NV drivers to compile
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)

12

configs/linux-llvm-debug

View File

@@ -1,12 +0,0 @@
 # -*-makefile-*-
 # Configuration for Linux and LLVM with debugging info
 # Builds the llvmpipe gallium driver
 include $(TOP)/configs/linux-llvm
 CONFIG_NAME = linux-llvm-debug
 OPT_FLAGS = -g -ansi -pedantic
 DEFINES += -DDEBUG -UNDEBUG

27

configs/linux-opengl-es

View File

@@ -1,27 +0,0 @@
 # Configuration for OpenGL ES on Linux
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-opengl-es
 # Directories to build
 LIB_DIR = lib
 SRC_DIRS = egl glsl mapi/es1api mapi/es2api mesa/es \
 	gallium gallium/winsys gallium/targets
 # egl st needs this
 DEFINES += -DGLX_DIRECT_RENDERING
 # no mesa or egl drivers
 DRIVER_DIRS =
 GALLIUM_DRIVERS_DIRS = softpipe
 # build libGLES*.so
 GALLIUM_STATE_TRACKERS_DIRS = es
 # build egl_x11_{swrast,i915}.so
 GALLIUM_DRIVERS_DIRS += trace rbug i915
 GALLIUM_STATE_TRACKERS_DIRS += egl
 GALLIUM_WINSYS_DIRS += drm/intel
 GALLIUM_TARGET_DIRS += egl-swrast egl-i915

26

configs/linux-osmesa

View File

@@ -1,26 +0,0 @@
 # Configuration for building only libOSMesa on Linux, no Xlib driver
 # This doesn't really have any Linux dependencies, so it should be usable
 # on other (gcc-based) systems.
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -g -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -D_GNU_SOURCE -DPTHREADS
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Directories
 SRC_DIRS = mapi/glapi glsl mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread -ldl
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

32

configs/linux-osmesa-static

View File

@@ -1,32 +0,0 @@
 # Configuration for building static libOSMesa.a on Linux, no Xlib driver
 # This doesn't really have any Linux dependencies, so it should be usable
 # on other (gcc-based) systems.
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa
 # Compiler and flags
 CC = gcc -m32
 CXX = g++ -m32
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DPTHREADS
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

29

configs/linux-osmesa16

View File

@@ -1,29 +0,0 @@
 # Configuration for 16 bits/channel OSMesa library on Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa16
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include -DCHAN_BITS=16 -DDEFAULT_SOFTWARE_DEPTH_BITS=31
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names
 OSMESA_LIB = OSMesa16
 OSMESA_LIB_NAME = libOSMesa16.so
 # Directories
 SRC_DIRS = mapi/glapi glsl mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

30

configs/linux-osmesa16-static

View File

@@ -1,30 +0,0 @@
 # Configuration for 16 bits/channel OSMesa library on Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa16-static
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -ansi -pedantic -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include -DCHAN_BITS=16 -DDEFAULT_SOFTWARE_DEPTH_BITS=31
 CXXFLAGS = -O3 -ansi -pedantic -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names
 OSMESA_LIB = OSMesa16
 OSMESA_LIB_NAME = libOSMesa16.a
 # Directories
 SRC_DIRS = gallium mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread

29

configs/linux-osmesa32

View File

@@ -1,29 +0,0 @@
 # Configuration for 32 bits/channel OSMesa library on Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa32
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include -DCHAN_BITS=32 -DDEFAULT_SOFTWARE_DEPTH_BITS=31
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names
 OSMESA_LIB = OSMesa32
 OSMESA_LIB_NAME = libOSMesa32.so
 # Directories
 SRC_DIRS = mapi/glapi glsl mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

9

configs/linux-ppc

View File

@@ -1,9 +0,0 @@
 # Configuration for Linux on PPC
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-ppc
 OPT_FLAGS = -O3 -mcpu=603 -fsigned-char -funroll-loops
 # FIXME: Use of PowerPC assembly should be enabled here.

14

configs/linux-ppc-static

View File

@@ -1,14 +0,0 @@
 # Configuration for Linux on PPC, static libs
 include $(TOP)/configs/linux-ppc
 CONFIG_NAME = linux-ppc-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

8

configs/linux-profile

View File

@@ -1,8 +0,0 @@
 # Configuration for profiling on Linux with gprof
 include $(TOP)/configs/linux-static
 CONFIG_NAME = linux-profile
 OPT_FLAGS = -pg -g -O2
 DEFINES += -DNDEBUG

9

configs/linux-sparc

View File

@@ -1,9 +0,0 @@
 # Configuration for Linux on Sparc
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-sparc
 #ASM_FLAGS = -DUSE_SPARC_ASM
 #MESA_ASM_SOURCES = $(SPARC_SOURCES)
 #GLAPI_ASM_SOURCES = $(SPARC_API)

7

configs/linux-sparc5

View File

@@ -1,7 +0,0 @@
 # Configuration for Linux on Sparc5
 include $(TOP)/configs/linux-sparc
 CONFIG_NAME = linux-sparc5
 ARCH_FLAGS += -mcpu=ultrasparc

23

configs/linux-static

View File

@@ -1,23 +0,0 @@
 # Configuration for generic Linux, making static libs
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =
 # Need to specify all libraries we may need
 	-l$(GL_LIB) -lm -L/usr/X11R6/lib/ -lX11 -lXext -lXmu -lXi -lpthread

7

configs/linux-ultrasparc

View File

@@ -1,7 +0,0 @@
 # Configuration for Linux on UltraSparc
 include $(TOP)/configs/linux-sparc
 CONFIG_NAME = linux-ultrasparc
 ARCH_FLAGS += -mv8 -mtune=ultrasparc

11

configs/linux-x86

View File

@@ -1,11 +0,0 @@
 # Configuration for Linux with x86 optimizations
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-x86
 ARCH_FLAGS = -m32 -mmmx -msse -msse2
 ASM_FLAGS = -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

7

configs/linux-x86-32

View File

@@ -1,7 +0,0 @@
 # To build Linux x86 32-bit in an x86-64 environment
 include $(TOP)/configs/linux-x86
 CONFIG_NAME = linux-x86-32
 ARCH_FLAGS += -m32

14

configs/linux-x86-64

View File

@@ -1,14 +0,0 @@
 # Configuration for Linux for 64-bit X86 (Opteron)
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-x86-64
 ARCH_FLAGS = -m64
 MESA_ASM_SOURCES = $(X86-64_SOURCES)
 GLAPI_ASM_SOURCES = $(X86-64_API)
 ASM_FLAGS = -DUSE_X86_64_ASM
 LIB_DIR = lib64
 EXTRA_LIB_PATH = -L/usr/X11R6/lib64

8

configs/linux-x86-64-debug

View File

@@ -1,8 +0,0 @@
 # Configuration for Linux for 64-bit X86 (Opteron)
 include $(TOP)/configs/linux-x86-64
 CONFIG_NAME = linux-x86-64-debug
 OPT_FLAGS = -g
 DEFINES += -DDEBUG -DDEBUG_MATH

8

configs/linux-x86-64-profile

View File

@@ -1,8 +0,0 @@
 # Configuration for profiling on Linux for 64-bit X86 (Opteron) with gprof
 include $(TOP)/configs/linux-x86-64-static
 CONFIG_NAME = linux-x86-64-profile
 OPT_FLAGS = -pg -g -O2
 DEFINES += -DNDEBUG

21

configs/linux-x86-64-static

View File

@@ -1,21 +0,0 @@
 # Configuration for Linux for 64-bit X86 (Opteron), static libs
 include $(TOP)/configs/linux-x86-64
 CONFIG_NAME = linux-x86-64-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

9

configs/linux-x86-debug

View File

@@ -1,9 +0,0 @@
 # Configuration for Linux with x86 code, but no gcc optimizations and
 # debugging enabled.
 include $(TOP)/configs/linux-x86
 CONFIG_NAME = linux-x86-debug
 OPT_FLAGS = -g
 DEFINES += -DDEBUG -DDEBUG_MATH

8

configs/linux-x86-profile

View File

@@ -1,8 +0,0 @@
 # Configuration for profiling on Linux with x86 optimizations with gprof
 include $(TOP)/configs/linux-x86-static
 CONFIG_NAME = linux-x86-profile
 OPT_FLAGS = -pg -g -O2
 DEFINES += -DNDEBUG

21

configs/linux-x86-static

View File

@@ -1,21 +0,0 @@
 # Configuration for Linux with x86 optimizations, static libs
 include $(TOP)/configs/linux-x86
 CONFIG_NAME = linux-x86-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

15

configs/netbsd

View File

@@ -1,15 +0,0 @@
 # Configuration for NetBSD
 include $(TOP)/configs/default
 CONFIG_NAME = netbsd
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O2 -fPIC -DUSE_XSHM -I/usr/X11R6/include -DHZ=100
 CXXFLAGS = -O2 -fPIC
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing

20

configs/openbsd

View File

@@ -1,20 +0,0 @@
 # Configuration for OpenBSD
 include $(TOP)/configs/default
 CONFIG_NAME = openbsd
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O2 -fPIC -I/usr/X11R6/include -DUSE_XSHM -DHZ=100
 CXXFLAGS = -O2 -fPIC -I/usr/X11R6/include -DHZ=100
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB)

14

configs/osf1

View File

@@ -1,14 +0,0 @@
 # Configuration for OSF/1
 include $(TOP)/configs/default
 CONFIG_NAME = osf1
 # Compiler and flags
 CC = cc
 CXX = cxx
 CFLAGS = -O0 -std1 -ieee_with_no_inexact -DUSE_XSHM -DPTHREADS -D_REENTRANT
 CXXFLAGS = -O2 -std ansi -ieee -DPTHREADS -D_REENTRANT
 GL_LIB_DEPS = -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm

15

configs/osf1-static

View File

@@ -1,15 +0,0 @@
 # Configuration for OSF/1
 include $(TOP)/configs/default
 CONFIG_NAME = osf1
 # Compiler and flags
 CC = cc
 CXX = cxx
 CFLAGS = -O2 -std1 -ieee_with_no_inexact -DUSE_XSHM -DPTHREADS -D_REENTRANT
 CXXFLAGS = -O2 -std ansi -ieee -DPTHREADS -D_REENTRANT
 MKLIB_OPTIONS = -static
 GL_LIB_DEPS =
 GLU_LIB_DEPS =

16

configs/solaris-x86

View File

@@ -1,16 +0,0 @@
 # Configuration for Solaris on x86
 include $(TOP)/configs/default
 CONFIG_NAME = solaris-x86
 # Compiler and flags
 CC = cc
 CFLAGS = -Xa -xO3 -xpentium -KPIC -I/usr/openwin/include -DUSE_XSHM
 MKLIB_OPTIONS = -static
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

18

configs/solaris-x86-gcc

View File

@@ -1,18 +0,0 @@
 # Configuration for Solaris on x86 with gcc, dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = solaris-x86-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -march=i486 -fPIC -I/usr/openwin/include -DUSE_XSHM
 CXXFLAGS = -O3 -march=i486 -fPIC
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/openwin/lib -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm

24

configs/solaris-x86-gcc-static

View File

@@ -1,24 +0,0 @@
 # Configuration for Solaris on x86 with gcc, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = solaris-x86-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -march=i486 -fPIC -I/usr/openwin/include -DUSE_XSHM
 CXXFLAGS = -O3 -march=i486 -fPIC
 MKLIB_OPTIONS = -static
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/openwin/lib -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

11

configs/sunos4

View File

@@ -1,11 +0,0 @@
 # Configuration for SunOS 4, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = sunos4
 # Compiler and flags
 CC = acc
 CFLAGS = -Kpic -O -I/usr/include/X11R5 -DUSE_XSHM -DSUNOS4

17

configs/sunos4-gcc

View File

@@ -1,17 +0,0 @@
 # Configuration for SunOS 4, with gcc, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = sunos4-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -fPIC -O3 -I/usr/openwin/include -I/usr/include/X11R5 -I/usr/include/X11R5 -DUSE_XSHM -DSUNOS4
 CXXFLAGS = -fPIC -O3 -I/usr/openwin/include -DSUNOS4
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing

22

configs/sunos4-static

View File

@@ -1,22 +0,0 @@
 # Configuration for SunOS 4, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = sunos4-static
 # Compiler and flags
 CC = acc
 CFLAGS = -O -DUSE_XSHM -DSUNOS4
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

15

configs/sunos5

View File

@@ -1,15 +0,0 @@
 # Configuration for SunOS 5
 include $(TOP)/configs/default
 CONFIG_NAME = sunos5
 # Compiler and flags
 CC = cc
 CXX = c++
 CFLAGS = -KPIC -Xa -O -I/usr/openwin/include -I/usr/dt/include -DUSE_XSHM
 CXXFLAGS = -KPIC -Xa -O -I/usr/openwin/include -I/usr/dt/include
 GL_LIB_DEPS = -L/usr/openwin/lib -L/usr/dt/lib -lX11 -lXext -lXmu -lXi -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L/usr/openwin/lib -lXt -lX11

Compare commits

1774 Commits i965-primi ... 9.0-branch

11 .dir-locals.el Normal file Unescape Escape View File

10 .emacs-dirvars Unescape Escape View File

1 .gitignore vendored Unescape Escape View File

4 Android.common.mk Unescape Escape View File

271 Makefile Unescape Escape View File

125 Makefile.am Normal file Unescape Escape View File

10 autogen.sh Unescape Escape View File

1 bin/.gitignore vendored Unescape Escape View File

48 bin/confdiff.sh Unescape Escape View File

20 bin/extract_git_sha1 Unescape Escape View File

23 bin/shortlog_mesa.sh Executable file Unescape Escape View File

17 bin/version.mk Unescape Escape View File

3 common.py Unescape Escape View File

27 configs/aix Unescape Escape View File

24 configs/aix-64 Unescape Escape View File

21 configs/aix-64-static Unescape Escape View File

21 configs/aix-gcc Unescape Escape View File

20 configs/aix-static Unescape Escape View File

31 configs/bluegene-osmesa Unescape Escape View File

27 configs/bluegene-xlc-osmesa Unescape Escape View File

30 configs/catamount-osmesa-pgi Unescape Escape View File

49 configs/autoconf.in → configs/current.in Unescape Escape View File

61 configs/darwin Unescape Escape View File

7 configs/darwin-fat-32bit Unescape Escape View File

7 configs/darwin-fat-all Unescape Escape View File

7 configs/darwin-fat-intel Unescape Escape View File

44 configs/default Unescape Escape View File

29 configs/freebsd Unescape Escape View File

48 configs/freebsd-dri Unescape Escape View File

10 configs/freebsd-dri-amd64 Unescape Escape View File

13 configs/freebsd-dri-x86 Unescape Escape View File

13 configs/hpux10 Unescape Escape View File

18 configs/hpux10-gcc Unescape Escape View File

26 configs/hpux10-static Unescape Escape View File

27 configs/hpux11-32 Unescape Escape View File

25 configs/hpux11-32-static Unescape Escape View File

24 configs/hpux11-32-static-nothreads Unescape Escape View File

28 configs/hpux11-64 Unescape Escape View File

25 configs/hpux11-64-static Unescape Escape View File

28 configs/hpux11-ia64 Unescape Escape View File

25 configs/hpux11-ia64-static Unescape Escape View File

15 configs/hpux9 Unescape Escape View File

13 configs/hpux9-gcc Unescape Escape View File

16 configs/irix6-64 Unescape Escape View File

24 configs/irix6-64-static Unescape Escape View File

16 configs/irix6-n32 Unescape Escape View File

23 configs/irix6-n32-static Unescape Escape View File

17 configs/irix6-o32 Unescape Escape View File

23 configs/irix6-o32-static Unescape Escape View File

37 configs/linux Unescape Escape View File

19 configs/linux-alpha Unescape Escape View File

27 configs/linux-alpha-static Unescape Escape View File

9 configs/linux-debug Unescape Escape View File

72 configs/linux-dri Unescape Escape View File

8 configs/linux-dri-debug Unescape Escape View File

9 configs/linux-dri-ppc Unescape Escape View File

13 configs/linux-dri-x86 Unescape Escape View File

17 configs/linux-dri-x86-64 Unescape Escape View File

54 configs/linux-dri-xcb Unescape Escape View File

58 configs/linux-egl Unescape Escape View File

18 configs/linux-ia64-icc Unescape Escape View File

23 configs/linux-ia64-icc-static Unescape Escape View File

19 configs/linux-icc Unescape Escape View File

23 configs/linux-icc-static Unescape Escape View File

52 configs/linux-indirect Unescape Escape View File

47 configs/linux-llvm Unescape Escape View File

12 configs/linux-llvm-debug Unescape Escape View File

27 configs/linux-opengl-es Unescape Escape View File

26 configs/linux-osmesa Unescape Escape View File

32 configs/linux-osmesa-static Unescape Escape View File

29 configs/linux-osmesa16 Unescape Escape View File

30 configs/linux-osmesa16-static Unescape Escape View File

29 configs/linux-osmesa32 Unescape Escape View File

9 configs/linux-ppc Unescape Escape View File

14 configs/linux-ppc-static Unescape Escape View File

8 configs/linux-profile Unescape Escape View File

9 configs/linux-sparc Unescape Escape View File

7 configs/linux-sparc5 Unescape Escape View File

1774 Commits

i965-primi ... 9.0-branch

11

.dir-locals.el Normal file

View File

10

.emacs-dirvars

View File

1

.gitignore vendored

View File

4

Android.common.mk

View File

271

Makefile

View File

125

Makefile.am Normal file

View File

10

autogen.sh

View File

1

bin/.gitignore vendored

View File

48

bin/confdiff.sh

View File

20

bin/extract_git_sha1

View File

23

bin/shortlog_mesa.sh Executable file

View File

17

bin/version.mk

View File

3

common.py

View File

27

configs/aix

View File

24

configs/aix-64

View File

21

configs/aix-64-static

View File

21

configs/aix-gcc

View File

20

configs/aix-static

View File

31

configs/bluegene-osmesa

View File

27

configs/bluegene-xlc-osmesa

View File

30

configs/catamount-osmesa-pgi

View File

49

configs/autoconf.in → configs/current.in

View File

61

configs/darwin

View File

7

configs/darwin-fat-32bit

View File

7

configs/darwin-fat-all

View File

7

configs/darwin-fat-intel

View File

44

configs/default

View File

29

configs/freebsd

View File

48

configs/freebsd-dri

View File

10

configs/freebsd-dri-amd64

View File

13

configs/freebsd-dri-x86

View File

13

configs/hpux10

View File

18

configs/hpux10-gcc

View File

26

configs/hpux10-static

View File

27

configs/hpux11-32

View File

25

configs/hpux11-32-static

View File

24

configs/hpux11-32-static-nothreads

View File

28

configs/hpux11-64

View File

25

configs/hpux11-64-static

View File

28

configs/hpux11-ia64

View File

25

configs/hpux11-ia64-static

View File

15

configs/hpux9

View File

13

configs/hpux9-gcc

View File

16

configs/irix6-64

View File

24

configs/irix6-64-static

View File

16

configs/irix6-n32

View File

23

configs/irix6-n32-static

View File

17

configs/irix6-o32

View File

23

configs/irix6-o32-static

View File

37

configs/linux

View File

19

configs/linux-alpha

View File

27

configs/linux-alpha-static

View File

9

configs/linux-debug

View File

72

configs/linux-dri

View File

8

configs/linux-dri-debug

View File

9

configs/linux-dri-ppc

View File

13

configs/linux-dri-x86

View File

17

configs/linux-dri-x86-64

View File

54

configs/linux-dri-xcb

View File

58

configs/linux-egl

View File

18

configs/linux-ia64-icc

View File

23

configs/linux-ia64-icc-static

View File

19

configs/linux-icc

View File

23

configs/linux-icc-static

View File

52

configs/linux-indirect

View File

47

configs/linux-llvm

View File

12

configs/linux-llvm-debug

View File

27

configs/linux-opengl-es

View File

26

configs/linux-osmesa

View File

32

configs/linux-osmesa-static

View File

29

configs/linux-osmesa16

View File

30

configs/linux-osmesa16-static

View File

29

configs/linux-osmesa32

View File

9

configs/linux-ppc

View File

14

configs/linux-ppc-static

View File

8

configs/linux-profile

View File

9

configs/linux-sparc

View File

7

configs/linux-sparc5

View File

23

configs/linux-static

View File