Comparing 602913192d...f3bc8c7fbc - mesa

fran/mesa

Author	SHA1	Message	Date
Jordan Justen	85e97b18e0	mesa: don't enable legacy GL functions when using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2012-07-24 15:41:59 -07:00
Jordan Justen	f2c8a8f550	intel: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2012-07-24 15:41:59 -07:00
Jordan Justen	631566bd77	meta: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2012-07-24 15:41:59 -07:00
Jordan Justen	7027b53956	glsl: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2012-07-24 15:41:59 -07:00
Jordan Justen	b0396f5d7b	mesa: add support for using API_OPENGL_CORE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2012-07-24 15:41:59 -07:00
Jordan Justen	cbc6974330	mesa: add api check macros These macros make it easier to check for multiple API types. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2012-07-24 15:41:59 -07:00
Jordan Justen	f7a395f970	mesa: add API_OPENGL_CORE api Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2012-07-24 15:41:58 -07:00
Paul Berry	497bf5dd2b	i965/msaa: Switch on 8x MSAA for Gen7. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:59 -07:00
Paul Berry	7285612713	i965/msaa: Adjust MCS buffer allocation for 8x MSAA. MCS buffers use 32 bits per pixel in 8x MSAA, and 8 bits per pixel in 4x MSAA. This patch adjusts the format we use to allocate the buffer so that enough memory is set aside for 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	304be9db14	i965/msaa: Remove assertion in 3DSTATE_SAMPLE_MASK to allow 8x MSAA. The code to emit 3DSTATE_SAMPLE_MASK was already correct for 8x MSAA--this patch just removes an assertion that would have prevented it from being used for 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	2a9ab29ed9	i965/msaa: Adjust 3DSTATE_MULTISAMPLE packet for 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	7fae97c98b	i965/blorp: Encode and decode IMS format for 8x MSAA correctly. This patch updates the blorp functions encode_msaa() and decode_msaa() to properly handle the encoding of IMS MSAA buffers when num_samples=8. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	619471dc32	i965/blorp: Compute sample number correctly for 8x MSAA. When operating in persample dispatch mode, the blorp engine would previously assume that subspan N always represented sample N (this is correct assuming 4x MSAA and a 16-wide dispatch). In order to support 8x MSAA, we must compute which sample is associated with each subspan, using the "Starting Sample Pair Index" field in the thread payload. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	082874e389	i965/blorp: Properly adjust primitive size for 8x MSAA. When rendering to an IMS MSAA surface on Gen7, blorp sets up the rendering pipeline as though it were rendering to a single-sampled surface; accordingly it must adjust the size of the primitive it sends down the pipeline to account for the interleaving of samples in an IMS surface. This patch modifies the size adjustment code to properly handle 8x MSAA, which makes room for the extra samples by using an interleaving pattern that is twice as wide as 4x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	17eae9762c	i965/blorp: Parameterize manual_blend() by num_samples. This patch adds a num_samples argument to the blorp function manual_blend(), allowing it to be told how many samples need to be blended together. Previously it assumed 4x MSAA, since that was all we supported. We also bump up LOG2_MAX_BLEND_SAMPLES from 2 to 3, so that manual_blend() will be able to handle 8x MSAA. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-24 14:52:58 -07:00
Paul Berry	4afee38a2f	i965/msaa: Remove comment about falsely claiming to support MSAA. Gen6+ hardware now supports MSAA properly. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:58 -07:00
Paul Berry	ff9313fac7	i965/blorp: Handle DrawBuffers properly. When the client program uses glDrawBuffer() or glDrawBuffers() to select more than one color buffer for drawing into, and then performs a blit, we need to blit into every single enabled draw buffer. +2 oglconforms. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50407 Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	fa1d267beb	i965/blorp: Rearrange order of blit validation and preparation steps. This patch rearranges the order of steps performed by a blorp blit from this: - Sync up state of window system buffers. - Find buffers. - Find miptrees. - Make sure buffer formats match. - Handle mirroring. - Make sure width and height match. - Handle clipping/scissoring. - Account for window system origin conventions. - Do depth resolves, if applicable. - Do the blit. - Record the need for a future HiZ resolve, if applicable. To this: - Sync up state of window system buffers. - Handle mirroring. - Make sure width and height match. - Handle clipping/scissoring. - Account for window system origin conventions. - Find buffers. - Make sure buffer formats match. - Find miptrees. - Do depth resolves, if applicable. - Do the blit. - Record the need for a future HiZ resolve, if applicable. The steps are the same, but they are now performed in an order that will make it possible to implement correct DrawBuffers support. Note that the last four steps are now in a separate function (do_blorp_blit), since they will need to be executed repeatedly when DrawBuffers support is added. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	eac4f1a707	i965/blorp: Don't fall back to swrast when miptrees absent. Previously, the blorp engine would fall back to swrast if the source or destination of a blit had no associated miptree. This was unnecessary, since _mesa_BlitFramebufferEXT() already takes care of making the blit silently succeed if there are no buffers bound, so the fallback paths could never actually happen in practice. Removing these fallback paths will simplify the implementation of correct DrawBuffers support in blorp. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	0dbec6ae07	i965/blorp: Fixup scissoring of blits to window system buffers. This patch modifies the order of operations in the blorp engine so that clipping and scissoring are performed before adjusting the coordinates to account for the difference in origin convention between window system buffers and framebuffer objects. Previously, we would do clipping and scissoring after adjusting for origin conventions, so we would get scissoring wrong in window system buffers. Fixes Piglit test "fbo-scissor-blit window". Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	da54d2e576	i965/blorp: Simplify check that src/dst width/height match. When checking that the source and destination dimensions match, we don't need to store the width and height in variables; doing so just risks confusion since right after the check, we do clipping and scissoring, which may alter the width and height. No functional change. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	bac43b8bb7	i965/msaa: Work around problems with null render targets on Gen6. On Gen6, multisampled null render targets don't seem to work properly--they cause the GPU to hang. So, as a workaround, we render into a dummy color buffer. Fortunately this situation (multisampled rendering without a color buffer) is rare, and we don't have to waste too much memory, because we can give the workaround buffer a very small pitch. Fixes piglit test "EXT_framebuffer_multisample/no-color {2,4} depth-computed *" on Gen6. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	0aeb87023e	i965: Set width, height, and tiling properly for null render targets. The HW docs say that the width and height of null render targets need to match the width and height of the corresponding depth and/or stencil buffers, and that they need to be marked as Y-tiled. Although leaving these values at 0 doesn't seem to cause any ill effects, it seems wise to follow the documented requirements. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	691c55f356	i965/msaa: Control multisampling behaviour via the visual. Previously, we used the number of samples in draw buffer 0 to determine whether to set up the 3D pipeline for multisampling. Using the visual is cleaner, and has the benefit of working properly when there is no color buffer. Fixes all piglit tests "EXT_framebuffer_multisample/no-color" on Gen7. On Gen6, the "depth-computed" variants of these tests still fail; this will be addresed in a later patch. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:57 -07:00
Paul Berry	48fdfbcb58	msaa: Compute visual samples/sampleBuffers from all buffers. This patch ensures that Visual.samples and Visual.sampleBuffers are set correctly even in the case where there is no color buffer. Previously, these values would retain their default value of 0 in this circumstance, even if the depth or stencil buffer was multisampled. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-24 14:52:56 -07:00
Anthony G. Basile	f35e380dd2	Fix compile time errors when building against uclibc Mesa misses a few checks when compiling on a uclibc system which cause it to fall back on glibc-ism. This patch addresses those issues. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Anthony G. Basile <blueness@gentoo.org>	2012-07-24 13:00:47 -07:00
Jerome Glisse	1ffac44e83	r600g: enable streamout only on 2.14 or latter kernel The kernel streamout support was supposed to get into 3.3 along the tiling change and thus use the same kernel version bump of 2.13 to report userspace that streamout register were supported. This is not what happen. So as streamout kernel support did not bump the kernel driver version, rely on kernel 2.14 version bump to know if streamout is enabled or not. Which means you need at least 3.4 kernel. Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-07-24 15:08:31 -04:00
Jordan Justen	881bb4ac72	intel: move error on create context to proper path The error was being set on the non-error path, rather than the error path. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-24 11:59:19 -07:00
Jordan Justen	01168df4d9	mesa context: generate an error for uninstalled context functions For 'non-legacy' contexts we will want to generate an error if an uninstalled function is called. The effect of this change will be that we can avoid installing legacy functions, and they will then generate an error as needed for deprecated functions in GL >= 3.1. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-24 11:50:35 -07:00
Brian Paul	1f9239ec8d	nouveau: include glformats.h to get missing prototype Fixes http://bugs.freedesktop.org/show_bug.cgi?id=52449	2012-07-24 10:33:20 -06:00
Brian Paul	a271a0c9f6	mesa: improve comment in build_tnl_program()	2012-07-24 09:54:50 -06:00
Brian Paul	8f2a13c5e3	docs: the legacy makefile system is removed in Mesa 8.1	2012-07-24 08:49:02 -06:00
Brian Paul	7e18a039ee	mesa: move _mesa_error_check_format_and_type() to glformats.c Now all the format/type-related helper functions are in glformats.c and image.c is just image-related functions.	2012-07-24 08:37:29 -06:00
Brian Paul	a1287f549a	mesa: move more format helper functions to glformats.c	2012-07-24 08:37:29 -06:00
Brian Paul	8b762ebd72	mesa: move some format helper functions to glformats.c	2012-07-24 08:37:29 -06:00
Christian König	de3335dba8	radeonsi: remove old state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	9b213c871a	radeonsi: move everything else into the new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	53d47889e6	radeonsi: move format handling into si_state.c Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	73dd906ba0	radeonsi: move remaining sampler state into si_state.c Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	ca9cf611b6	radeonsi: move draw state into new handling Split it out into si_state_draw.c Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	0d6b0b512a	radeonsi: move constants to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	baf2039756	radeonsi: move sampler states into new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	3c09f11e5c	radeonsi: move shaders to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	bd2a5cf328	radeonsi: move spi into new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	840f05da6b	radeonsi: move init state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	e4e6f954ae	radeonsi: move draw_info to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	76660dfcce	radeonsi: move CB_TARGET_MASK into fb/blend state Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	e6937211da	radeonsi: move stencil_ref to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:30 +02:00
Christian König	b41b3eb989	radeonsi: move dsa state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	bd18a316e1	radeonsi: move infeered fb/rs state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	f67fae0e43	radeonsi: move rasterizer state into new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	835098a529	radeonsi: move framebuffer to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	7e011d92c9	radeonsi: move viewport to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	43f414f7b7	radeonsi: move scissor state to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	9cbbe0d4e6	radeonsi: move clip state to new handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	0a091a4824	radeonsi: move blend color to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	63636ae52a	radeonsi: move blender to new state handling Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Christian König	bf7302a6e1	radeonsi: rework state handling v2 Add a complete new state handling for SI. v2: fix spelling error Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-07-24 12:29:29 +02:00
Brad King	27382c0f7b	automake: Honor GL_LIB for mangled/custom lib names Commit `2d4b77c7` (automake: Convert src/mesa/drivers/x11/Makefile to automake, 2012-06-12) dropped the old Makefile, which used GL_LIB, and replaced it with a Makefile.am hard-coding the name "GL". This broke handling of --enable-mangling and --with-gl-lib-name options which depend on GL_LIB to specify the GL library name. Use "@GL_LIB@" in src/mesa/drivers/x11/Makefile.am to configure the library name. Also use this approach to simplify src/glx/Makefile.am and drop the HAVE_MANGLED_GL conditional. While at it, fix the compatibility link we create in "lib" for the software-only driver to use version GL_MAJOR instead of hard-coding "1". Reviewed-by: Dan Nicholson <dbn.lists@gmail.com>	2012-07-23 22:34:13 -07:00
Marek Olšák	82fc813ca8	st/mesa: fix DDY opcode for FBOs This fixes piglit/fbo-deriv. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:53 +02:00
Marek Olšák	f40b5723f0	st/mesa: set the centroid qualifier in fragment shader inputs This fixes some centroid tests in the EXT_framebuffer_multisample piglit group. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:53 +02:00
Marek Olšák	162b3ad94d	st/mesa: flush the glBitmap cache before changing framebuffer state This fixes the piglit EXT_framebuffer_multisample/bitmap tests. Note that we must not rely on ctx->DrawBuffer when flushing the cache, because that's already updated with a new framebuffer. We want to draw into the old framebuffer where glBitmap was called. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:53 +02:00
Marek Olšák	07b9b3c37b	st/mesa: set the correct window renderbuffer internal format The multisample-resolve blit relies on this being correct. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:52 +02:00
Marek Olšák	5927227576	mesa: fix format checking when doing a multisample resolve v2: make it more bullet-proof Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-23 19:23:52 +02:00
José Fonseca	c30bf68946	gallivm: Prefer the standard JIT engine whenever possible. Testing shows that the standard JIT engine retrofited with AVX support is quite stable and as capable to handle AVX instructions as MC-JIT is. And the old JIT is much more memory efficient, as we don't need to allocate one engine instance per shader, as we do for MC-JIT due to its incompleteness. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-23 17:46:38 +01:00
Jerome Glisse	cb149bf9e1	r600g: don't emit forbidden reg with old kernel on evergreen Fix https://bugs.freedesktop.org/show_bug.cgi?id=52313 Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-07-23 11:42:36 -04:00
Jerome Glisse	b7b5a77ec0	r600g: don't emit forbidden register on old kernel Fix https://bugs.freedesktop.org/show_bug.cgi?id=52313 Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-07-23 11:28:25 -04:00
Vincent Lejeune	bc4b4c605c	radeon/llvm: Fix a bug with IF LOGICALNZ with int operand Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-23 15:04:36 +00:00
Tom Stellard	044de40cb0	pipe_loader: Try to connect with the X server before probing pciids v2 When X is running it is neccesary for pipe_loader to authenticate with DRM, in order to be able to use the device. This makes it possible to run OpenCL programs while X is running. v2: - Fix C++ style comments - Drop Xlib-xcb dependency - Close the X connection when done - Split auth code into separate function Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-07-23 13:25:36 +00:00
Tom Stellard	17f6c9195f	configure.ac: Add --with-llvm-prefix option This option allows you to specify the llvm install prefix. It is useful for switching between different versions of LLVM.	2012-07-23 13:25:36 +00:00
Kenneth Graunke	c3bc41011f	mesa: Prevent repeated glDeleteShader() from blowing away our refcounts. Calling glDeleteShader() should mark shaders as pending for deletion, but shouldn't decrement the refcount every time. Otherwise, repeated glDeleteShader() is not safe. This is particularly bad since glDeleteProgram() frees shaders: if you first call glDeleteShader() on the shaders attached to the program (thus decrementing the refcount), then called glDeleteProgram(), it would try to free them again (decrementing the refcount another time), causing a refcount > 0 assertion to fail. Similar to commit `d950a778`. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-22 14:34:44 -07:00
Matt Turner	cfdf60f236	imports.h: Correct ceilf typo. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-22 14:06:08 -07:00
Marek Olšák	f96405f254	st/mesa: remove st_flush_bitmap wrapper just a cleanup	2012-07-22 03:32:55 +02:00
Jordan Justen	749c9060ac	mesa formats: add MESA_FORMAT_ABGR2101010_UINT Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Jordan Justen	1c8812c244	mesa formats: unpack ARGB8888/XRGB8888 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Jordan Justen	8c265cf5ef	mesa pack: use _mesa_problem instead of assert If the pack type is not supported, use _mesa_problem rather than asserting. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Jordan Justen	9ad8f431b2	mesa: add glformats integer type/format detection routines _mesa_is_integer_format is moved to formats.c and renamed as _mesa_is_enum_format_integer. _mesa_is_format_unsigned, _mesa_is_type_integer, _mesa_is_type_unsigned, and _mesa_is_enum_format_or_type_integer are added. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-21 16:49:42 -07:00
Vinson Lee	e2e7b467d8	scons: Add instrumentation component libraries to linking on llvm-3.2. llvm-3.2svn r160587 moved createBoundsCheckingPass from lib/Transforms/Scalar to lib/Transforms/Instrumentation. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-21 10:38:25 -07:00
Matt Turner	d24cf88a1a	Remove unused _mesa_memset16 Unused since commit `fd104a845`. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
Matt Turner	f58ba6ca91	Remove _mesa_inv_sqrtf in favor of 1/SQRTF Except for a couple of explicit uses, _mesa_inv_sqrtf was disabled since its addition in 2003 (see `f9b1e524`). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
Matt Turner	948b1c541f	Remove _mesa_sqrt* in favor of plain sqrt Temporarily disabled since 2003 (see `386578c5b`). This saves us from calling sqrt() 128 times to generate the sqrttab in one_time_init(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
Matt Turner	ec79138138	Use INV_SQRT instead of 1/SQRTF Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-21 08:23:38 -07:00
José Fonseca	bd9bf7a424	autoconf: Only kink mcjit component when available. Should fix build failures with older LLVM version, but only tested on LLVM 3.1.	2012-07-21 11:43:35 +01:00
Chad Versace	735070c45b	i830: Fix stack corruption Found by compiler warning: i830_texstate.c:131:28: warning: argument to 'sizeof' in 'memset' call is the same expression as the destination; did you mean to dereference it? [-Wsizeof-pointer-memaccess] memset(state, 0, sizeof(state)); ~~~~~ ^~~~~ On 64-bit systems, memset here would write an extra 4 bytes. Note: This is a candidate for the stable branches. Reviewed-by: Brian Paul <brianp@vmware.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-20 16:01:57 -07:00
José Fonseca	1a8f6ac5a4	mesa: disable MSVC global optimization in pack.c To reduce excessive compilation time in release mode. NOTE: This is a candidate for the 8.0 branch. Tested-by: Brian Paul <brianp@vmware.com>	2012-07-20 16:23:22 -06:00
Brian Paul	9fd4e9e9e6	mesa: whitespace fixes in pbo.c	2012-07-20 16:22:59 -06:00
Brian Paul	ac14f569fe	mesa: update texstore.c comment	2012-07-20 15:13:19 -06:00
Roland Scheidegger	70a969f123	llvmpipe: use runtime loop instead of static loop for looping over quads This can potentially cut shader program size by a factor of 4 for 4-wide execution respectively 2 for 8-wide execution and while this ratios aren't quite reached for more complex shaders it can be close. Could not really measure a performance difference so far except for trivial shaders (glxgears). There seems to be a fair amount of unnecessary move's generated especially at the beginning it might be possible to optimize those away somehow. Things aren't quite as clean, some additional stuff needs to be done for keeping both paths working (though llvm might be able to optimize this away). glxgears seems to lose about 5-10% of performance, looking at the generated shaders this is actually less than I'd think it would be - both 4 and 8-wide shaders, despite containing a loop actually have about 10% more instructions in total, and will have roughly 50% more executed instructions (though mostly cheap ones). Need to figure out how to reduce overhead... v2: keep complex interpolation for 4-wide mode, adapt to interface changes. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-20 20:17:15 +01:00
Roy Spliet	542bd6941f	nv30: Support negative offsets in indirect constant access. Fixes piglit vp-address-01 amongst several others. Signed-off-by: Roy Spliet <r.spliet@student.tudelft.nl> Reviewed-by: Lucas Stach <dev@lynxeye.de> Tested-by: Lucas Stach <dev@lynxeye.de>	2012-07-20 20:31:40 +02:00
Bryan Cain	248e6f0331	nv50/ir: set position before i instead of i->next in NV50LoweringPreSSA::visit Fixes rendering glitches in Psychonauts such as Raz's eyes flickering white. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=51962.	2012-07-20 20:30:07 +02:00
Eric Anholt	b2a44cde64	i965/gen7: Increase the WM threads to hardware limits. This thread count is only supposed to be enabled when "WIZ Hashing Disable in GT_MODE register enabled." I've always been confused whether that means the bit in the register should be 1 or 0. For my IVB GT2's register 0x7008 value of 0x0, this appears to work fine. Improves l4d2 performance at 640x480 by 0.88 +/- 0.11% (n=88). Improves performance with rasterization at 1280x1024 by 1.45% +/- 0.36% (n=6). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-20 11:05:39 -07:00
Eric Anholt	8ab5842a6d	glsl: Assign locations for uniforms in UBOs using the std140 rules. Fixes piglit layout-std140. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:44:04 -07:00
Eric Anholt	9feb403b0e	glsl: Don't resize arrays in uniform blocks. This is a requirement for std140 uniform blocks, and optional for packed/shared blocks. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:59 -07:00
Eric Anholt	0cea8a56b6	glsl: Don't dead-code eliminiate uniforms declared in uniform blocks. This is a requirement for std140 uniform blocks, and optional for packed/shared blocks. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:52 -07:00
Eric Anholt	548bce4733	mesa: Implement the UBO-specific pnames of glGetActiveUniformsiv. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:50 -07:00
Eric Anholt	a74507dc94	glsl: Propagate uniform block information into gl_uniform_storage. Now we can actually return information on uniforms in uniform blocks in the new queries. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:47 -07:00
Eric Anholt	ddc88fbf51	mesa: Add implementation of glGetUniformBlockIndex(). Now that we finally have a list of uniform blocks in the linked shader program, we can tell what their indices are. Fixes piglit GL_ARB_uniform_buffer_object/getuniformblockindex. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:44 -07:00
Eric Anholt	093b20666d	glsl: Set the uniform_block index for the linked shader variables. At this point in the linking, we've totally lost track of the struct gl_uniform_buffer that this pointed to in the original unlinked shader, so we do a nasty n^2 walk to find it the new one based on the variable name. Note that these point into the shader's list of gl_uniform_buffers, not the linked program's. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:42 -07:00
Eric Anholt	9f1a4a6340	mesa: Add support for glGetActiveUniformsiv on non-UBO pnames. We'll need to propagate the UBO fields to the uniform storage records before we can handle the other pnames. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:40 -07:00
Eric Anholt	acfbdfcbc8	mesa: Add support for glGetUniformIndices(). This is a single entrypoint that maps from a series of names to the indices of those names within the active uniforms list. Each index is like glGetUniformLocation()'s return value, except that it doesn't encode an array offset. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:35 -07:00
Eric Anholt	abcdbdf9cc	mesa: Move the _mesa_uniform_merge_location_offset to glGetUniformLocation(). With the upcoming GL_ARB_uniform_buffer_object changes, the only other caller that will want the cooked value is state_tracker. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:33 -07:00
Eric Anholt	f609cf782a	glsl: Merge the lists of uniform blocks into the linked shader program. This attempts error-checking, but the layout isn't done yet. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:28 -07:00
Eric Anholt	b3c093c79c	glsl: Translate the AST for uniform blocks into some IR structures. We're going to need this structure to cross-validate the uniform blocks between shader stages, since unused ir_variables might get dropped. It's also the place we store the RowMajor qualifier, which is not part of the GLSL type (since that would cause a bunch of type equality checks to fail). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:19 -07:00
Eric Anholt	f7561e8ecd	glsl: Turn UBO variable declarations into ir_variables and check qualifiers. Fixes piglit layout--non-uniform and layout--within-block. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-20 10:43:12 -07:00
Lucas Stach	cdad337fec	st/xorg: fix masked transformations Someone tried to be clever and "optimized" add_vertex_data2() to just use two points for the texture coordinates and then reuse individual components. Sadly this is not how matrix multiplication works. Fixes rendercheck -t tmcoords Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-20 18:47:54 +02:00
Paul Berry	60c3e69dbf	i965/blorp: Use IMS layout when texturing from depth/stencil surfaces. Previously, on Gen7, when texturing from a depth or stencil surface, the blorp engine would configure the 3D pipeline as though the input surface was non-multisampled, and perform the necessary coordinate transformations in the fragment shader to account for the IMS layout. This meant outputting a lot of extra fragment shader code, and it raised some uncertainty about how to deal with very large surfaces. This patch modifies blorp to configure the 3D pipeline properly for IMS layout when reading from depth and stencil surfaces. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:38 -07:00
Paul Berry	0dd5e98aa5	i965/blorp: Loosen assertions in compute_msaa_layout_for_pipeline. Previously, on Gen7, compute_msaa_layout_for_pipeline() would verify that IMS layout is not used. However, now that we configure SURFACE_STATE correctly for IMS surfaces, IMS layout is available. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:38 -07:00
Paul Berry	989218b980	i965/blorp: Configure SURFACE_STATE correctly for IMS surfaces. This patch modifies gen7_set_surface_num_multisamples() to set up the SURFACE_STATE appropriately for texturing from IMS format MSAA surfaces (which are only used on Gen7 for depth and stencil buffers). Since the function now sets more than just the number of multisamples, it's been renamed to gen7_set_surface_msaa(). This will make it possible to remove some kludginess from the blorp engine. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:38 -07:00
Paul Berry	f91b4d92b9	i965/blorp: Optimize manual_blend() for compressed multisampled surfaces. When downsampling a compressed multisampled surface, we can take a shortcut to downsample any pixels that were completely covered by a single primitive. In this case, the first color value we fetch is the correct final color for the downsampled pixel, so we can skip the rest of the blending operation. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	e5d983267a	i965/blorp: Fix integer downsampling on Gen7. When downsampling an integer-format buffer on Gen7, we need to use the "avg" instruction rather than the "add" instruction, to ensure that we don't overflow the range of 32-bit integers. Also, we need to use the proper register type (BRW_REGISTER_TYPE_D or BRW_REGISTER_TYPE_UD) for intermediate color data and for writing to the render target. Note: this patch causes blorp to use the proper register type for all operations (downsampling, upsampling, and ordinary blits). Strictly speaking, this is only necessary for downsampling, because the other operations exclusively use MOV instructions on the color data. But it's simpler to use the proper register type in all cases. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	b961d37e61	i965/blorp: Modify manual_blend() to avoid unnecessary loss of precision. When downsampling from an MSAA image to a single-sampled image, it is inevitable that some loss of numerical precision will occur, since we have to use 32-bit floating point registers to hold the intermediate results while blending. However, it seems reasonable to expect that when all samples corresponding to a given pixel have the exact same color value, there will be no loss of precision. Previously, we averaged samples as follows: blend = (((sample[0] + sample[1]) + sample[2]) + sample[3]) / 4 This had the potential to lose numerical precision when all samples have the same color value, since ((sample[0] + sample[1]) + sample[2]) may not be precisely representable as a 32-bit float, even if the individual samples are. This patch changes the formula to: blend = ((sample[0] + sample[1]) + (sample[2] + sample[3])) / 4 This avoids any loss of precision in the event that all samples are the same, by ensuring that each addition operation adds two equal values. As a side benefit, this puts the formula in the form we will need in order to implement correct blending of integer formats. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	6a27506181	i965: Add support for AVG instruction. From the Ivy Bridge PRM, Vol4 Part3 p152: "The avg instruction performs component-wise integer average of src0 and src1 and stores the results in dst. An integer average uses integer upward rounding. It is equivalent to increment one to the addition of src0 and src1 and then apply an arithmetic right shift to this intermediate value." Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-20 09:35:37 -07:00
Paul Berry	9544e44262	i965: Replace fs_visitor::kill_emitted with gl_fragment_program::UsesKill. The kill_emitted variable was duplicating the functionality of gl_fragment_program::UsesKill. There's no need for both. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-20 09:33:07 -07:00
Paul Berry	0f1f2ff8db	mesa: Set gl_fragment_program::UsesKill in do_set_program_inouts. Previously, the code for setting this flag for GLSL programs was duplicated in three places: brw_link_shader(), glsl_to_tgsi_visitor, and ir_to_mesa_visitor. In addition to the unnecessary duplication, there was a performance problem on i965: brw_link_shader() set the flag before doing its final round of optimizations, which meant that if the optimizations managed to eliminate all the discard operations, the flag would still be set, resulting (at least in theory) in slower performance. This patch consolidates all of the code that sets UsesKill for GLSL programs into do_set_program_inouts(), which already is doing a similar job for UsesDFdy, and which occurs after i965's final round of optimizations. Non-GLSL programs (ARB programs and the state tracker's glBitmap program) are unaffected. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-20 09:33:07 -07:00
Kristian Høgsberg	a8c092266e	gallium-egl: Move wayland query_buffer implementation Move it to native_wayland_drm_bufmgr_helper.c which only gets compiled when wayland is enabled and which already includes the right headers. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 16:11:06 -04:00
Olivier Galibert	fbe3fa74e5	softpipe: Fix segfault with fbo-cubemap. The cube sampler generates two-dimensional texture coordinates and hence passes NULL for the array for the third one. The actual 2D sampler, lower in the pipe, knew not to used that array since it didn't need it. But the samplers have become single-texel and the coordinate array dereference has been moved up one step, to a level where the code does not know only two coordinates are used. Hence the segfault. The simplest fix by far is to add a third dummy coordinate array in the call to the next pipe step, which will be dereferenced to an harmless 0 which then will be happily ignored by the sampler. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=52250 Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-19 13:19:14 -06:00
Kristian Høgsberg	d7522ed130	wayland: Support EGL_WIDTH and EGL_HEIGHT queries for wl_buffer We're going to make the public wl_buffer struct as small as possible. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 14:03:17 -04:00
Kristian Høgsberg	e23bfdb329	wayland: Use existing EGL_TEXTURE_FORMAT for querying wl_buffer texture format We also reuse EGL_TEXTURE_RGBA and EGL_TEXTURE_RGB, adding only the new planar YUV texture formats: EGL_TEXTURE_Y_U_V_WL, EGL_TEXTURE_Y_UV_WL and EGL_TEXTURE_Y_XUXV_WL. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 14:03:17 -04:00
Kristian Høgsberg	e1b45a3c06	gallium-egl: Implement eglQueryWaylandBufferWL Support this query for gallium EGL too. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-19 14:03:17 -04:00
Kenneth Graunke	d43f4181e1	glsl: Remove open coded version of ir_variable::interpolation_string(). Presumably the function didn't exist when we wrote this code. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 11:00:00 -07:00
Paul Berry	d08fdacd58	i965: Avoid unnecessary recompiles for shaders that don't use dFdy(). The i965 back-end needs to compile dFdy() differently for FBOs and window system framebuffers, because Y coordinates are flipped between the two (see commit `82d2596`: i965: Compute dFdy() correctly for FBOs). This patch avoids unnecessarily recompiling shaders that don't use dFdy(), by only setting render_to_fbo in the wm program key if the shader actually uses dFdy(). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:25 -07:00
Paul Berry	ce1d2f08f9	glsl: Set UsesDFdy appropriately for GLSL shaders. This patch updates the ir_set_program_inouts_visitor so that it also sets gl_fragment_program::UsesDFdy. This is a bit of a hack (since dFdy() isn't an input or an output), but there's no other obvious visitor to squeeze this functionality into, and it would be silly to create a brand new visitor just for this purpose. v2: use local 'fprog' var to avoid repeated casting. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:21 -07:00
Paul Berry	a0f7b86959	mesa: Set UsesDFdy appropriately for assembly programs. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:19 -07:00
Paul Berry	5e310e9f83	mesa: Add UsesDFdy to struct gl_fragment_program. The i965 back-end needs to compile dFdy() differently for FBOs and window system framebuffers, because Y coordinates are flipped between the two (see commit `82d2596`: i965: Compute dFdy() correctly for FBOs). This boolean will allow it to avoid unnecessarily recompiling shaders that don't use dFdy(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-19 10:02:01 -07:00
Kenneth Graunke	658a63e5d9	drirc: Add disable_blend_func_extended workaround for Unigine OilRush. The previous commit implemented the workaround, cited a bug report about OilRush, but actually only enabled the workaround for the demos. Turn it on for OilRush too. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50291 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-19 01:40:24 -07:00
Kenneth Graunke	040894391a	i965: Add a driconf option to disable GL_ARB_blend_func_extended. Unigine Heaven (at least) has a bug where it incorrectly uses the GL_ARB_blend_func_extended extension. Dual source blending allows two color outputs per render target; individual shader outputs can be assigned to be either the first or second blending input by setting the 'index' via one of two methods: - An API call: glBindFragDataLocationIndexed() - The GLSL 'layout' qualifier provided by GL_ARB_explicit_attrib_location Both of these only work on user defined fragment shader outputs; it's an error to use either on built-in outputs like gl_FragData. Unigine uses gl_FragData and gl_FragColor exclusively, and doesn't even attempt to use either method to set index == 1. However, it does set the blending function to SRC1 enums, which requires a fragment shader output with index == 1 or else rendering is undefined. In other words, enabling ARB_blend_func_extended causes Unigine to render incorrectly, resulting in an apparent regression, even though our driver code (as far as I can tell) is perfectly fine. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50291 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-19 01:22:34 -07:00
Brian Paul	768be75c44	mesa: remove stale comment	2012-07-18 16:51:47 -06:00
Brian Paul	e4f8d33aea	mesa: use gl_program cast wrappers In a few cases, remove unneeded casts. And fix a few other const-correctness issues. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 16:51:47 -06:00
Brian Paul	1170b5aa9f	mesa: add some gl_program cast wrappers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 16:51:47 -06:00
Marek Olšák	c3c83af380	r600g: setup streamout before calling last r600_need_cs_space before drawing This fixes CS checker errors due to registers not being initialized, because the flush occured after dirty state was emitted but before drawing.	2012-07-18 22:42:58 +02:00
Eric Anholt	a40c1f9522	i965/fs: Make register spill/unspill only do the regs for that instruction. Previously, if we were spilling the result of a texture call, we would store all 4 regs, then for each use of one of those regs as the source of an instruction, we would unspill all 4 regs even though only one was needed. In both lightsmark and l4d2 with my current graphics config, the shaders that produce spilling do so on split GRFs, so this doesn't help them out. However, in a capture of the l4d2 shaders with a different snapshot and playing the game instead of using a demo, it reduced one shader from 2817 instructions to 2179, due to choosing a now-cheaper texture result to spill instead of piles of texcoords. v2: Fix comment noted by Ken, and fix the if condition associated with it for the current state of what constitutes a partial write of the destination. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2012-07-18 12:30:06 -07:00
Eric Anholt	a454f8ec6d	i965/fs.h: Refactor tests for instructions modifying a register. There's one instance of a potential behavior change: propagate_constants may now propagate into a part of a vgrf after a different part of it was overwritten by a send that returns multiple registers. I don't think we ever generate IR that meets that condition, but it's something to note if we bisect behavior change to this. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Eric Anholt	fc01376c50	i965/fs: Replace usage is_tex() with regs_written() checks. In these places, we care about any sort of send that hits more than one reg, not just textures. We don't yet have anything else returning more than one reg, so there's no change. v2: Use mlen instead of is_tex() for the is-it-a-send check. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Eric Anholt	a6411520b4	i965/fs: Rename virtual_grf_next to virtual_grf_count. "count" is a more useful name, since most of the time we're using it for looping over the variables. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Eric Anholt	40cd60a315	i965/fs: Move a block out of a loop in live variables setup. This was accidentally copy-and-pasted inside. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-18 12:30:06 -07:00
Anuj Phogat	cd5cd85a43	i965/msaa: Disable alpha-to-{coverage, one} when drawbuffer zero is in integer format OpenGL specification 3.3 (page 196), section 4.1.3 says: If drawbuffer zero is not NONE and the buffer it references has an integer format, the SAMPLE_ALPHA_TO_COVERAGE and SAMPLE_ALPHA_TO_ONE operations are skipped." This should work properly even if there are other draw buffers that are not in integer format. This patch makes following piglit tests pass on mesa: int-draw-buffers-alpha-to-coverage int-draw-buffers-alpha-to-one Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-07-18 11:54:12 -07:00
Lucas Stach	fb18ec4f27	st/xorg: attach EDID to outputs Allows tools like GNOME's monitor configuration to show meaningful names. v2: fix resource leak Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-18 17:19:16 +02:00
Lucas Stach	9de16ac0a8	st/xorg: remove superfluous memset exaDriverAlloc() uses calloc, which already initialises pExa to zero. Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-18 17:19:07 +02:00
Lucas Stach	70f0eda127	st/xorg: reorder exa context creation and use screen param queries Gives the x-server a more accurate description of the exa hardware capabilities. v2: drop NPOT check Signed-off-by: Lucas Stach <dev@lynxeye.de>	2012-07-18 17:18:55 +02:00
Olivier Galibert	229a1a7e4d	softpipe: Take all lods into account when texture sampling. This patch churns a lot because it needs to change 4-wide filters into single pixel filters, since each fragment may use a different filter. The only case not entirely supported is the anisotropic filtering. Not sure what we want to do there, since a full quad is required by that filter. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-18 08:02:39 -06:00
Marek Olšák	99c65bac34	r600g: implement wait-free buffer transfer for DISCARD_RANGE Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2012-07-18 07:16:30 +02:00
Marek Olšák	8ac9801669	r600g: accelerate buffer copying This will be useful for efficient handling of the DISCARD transfer flags. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2012-07-18 06:32:57 +02:00
Marek Olšák	f237fd431b	r600g: update R600_MAX_DRAW_CS_DWORDS to take draw-opaque into account	2012-07-18 06:25:37 +02:00
Marek Olšák	30257c3291	r600g: move VGT_STRMOUT_DRAW_OPAQUE_OFFSET initialization into invariant state	2012-07-18 06:25:37 +02:00
Marek Olšák	d9ba1b0beb	r600g: only set the index type if drawing is indexed	2012-07-18 06:25:37 +02:00
Marek Olšák	1cfb55c509	r600g: remove debug code for streamout	2012-07-18 06:25:37 +02:00
Marek Olšák	ff9a49328e	r600g: inline r600_context_draw_opaque_count	2012-07-18 06:25:37 +02:00
Marek Olšák	1b699a4832	r600g: fix alphatest without a colorbuffer on evergreen	2012-07-18 06:25:36 +02:00
Marek Olšák	82a1d24175	r600g: fix alphatest without a colorbuffer on r6xx-r7xx	2012-07-18 04:35:38 +02:00
Marek Olšák	de4fd087cb	r600g: always derive alphatest state from the first colorbuffer	2012-07-18 04:17:11 +02:00
Marek Olšák	bc2f5fc01e	r600g: atomize alphatest state	2012-07-18 03:45:25 +02:00
Marek Olšák	5130196c0b	r600g: try to fix line stippling with lineloops The piglit test is failing, but visually it looks almost correct.	2012-07-18 02:17:10 +02:00
Marek Olšák	43e226b6ef	r600g: optimize uploading depth textures Make it only copy the portion of a depth texture being uploaded and not the whole 2D layer. There is also a little code cleanup.	2012-07-18 00:32:50 +02:00
Marek Olšák	b242adbe5c	r600g: remove needless wrapper r600_texture_depth_flush	2012-07-18 00:21:53 +02:00
Marek Olšák	611dd52942	r600g: init_flushed_depth_texture should be able to report errors	2012-07-18 00:21:53 +02:00
Paul Berry	e9b908b014	msaa: Generate proper error for operations prohibited on MSAA buffers. From the GL 3.0 spec, section 4.3.3, in the documentation for CopyPixels(): "An INVALID_OPERATION error will be generated if the object bound to READ_FRAMEBUFFER_BINDING is framebuffer complete and the value of SAMPLE_BUFFERS is greater than zero." The same applies to CopyTexImage...() and CopyTexSubImage...() functions, since they are defined in terms of CopyPixels(). Previously we were generating an INVALID_FRAMEBUFFER_OPERATION error in these cases. Fixes piglit tests "EXT_framebuffer_multisample/negative-{copypixels,copyteximage}". Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 14:40:01 -07:00
Brian Paul	c4d2a14d6e	gallivm: silence uninitialized variable warnings	2012-07-17 14:41:29 -06:00
Marek Olšák	9d699cd845	r600g: fix lockups with and enable dual source blending on evergreen GL_ARB_blend_func_extended is now enabled on all chipsets.	2012-07-17 21:22:15 +02:00
Marek Olšák	c26fadf195	r600g: remove unused code after conversion of sampler views	2012-07-17 21:22:15 +02:00
Marek Olšák	5d8d4252f2	r600g: convert sampler view emission into atoms Vertex and constant buffers are emitted in the same way. This is mainly a simplification of the code. The cleanup is in another patch.	2012-07-17 21:22:15 +02:00
Marek Olšák	7022f49b52	r600g: only make constant buffers dirty if there's something to update	2012-07-17 21:22:15 +02:00
Marek Olšák	80755ff563	r600g: properly track which textures are depth This fixes the issue with have_depth_texture never being set to false.	2012-07-17 21:22:15 +02:00
Marek Olšák	e5de73cafd	r600g: consolidate and optimize sampler states changes for evergreen Only set sampler states which changed.	2012-07-17 21:22:14 +02:00
Marek Olšák	883c43cdd4	r600g: don't invalidate texture caches when setting sampler states Changing sampler states doesn't change resource bindings.	2012-07-17 21:22:14 +02:00
Marek Olšák	ba48f47ebf	r600g: consolidate code for setting sampler views and fix bugs in the process Issues fixed: - set_vs_sampler_views for evergreen is now properly implemented. - Added the missing inval_texture_cache call for evergreen. - have_depth_texture was sometimes incorrectly set to false on evergreen even if there were depth textures in other shader stages. To fix this, set it to true once and never set it to false again. It's stupid, but it matches the r600 code. The proper fix is left to another patch. - Optimizaton: The sampler views which aren't changed aren't updated.	2012-07-17 21:22:14 +02:00
Marek Olšák	d1ca16b273	r600g: remove unused flag have_depth_fb This is a leftover from: commit `fe1fd67556` Author: Marek Olšák <maraeo@gmail.com> Date: Sun Jul 8 03:10:37 2012 +0200 r600g: don't flush depth textures set as colorbuffers	2012-07-17 21:22:14 +02:00
Marek Olšák	585baac652	r600g: do fine-grained vertex buffer updates If only some buffers are changed, the other ones don't have to re-emitted. This uses bitmasks of enabled and dirty buffers just like emit_constant_buffers does.	2012-07-17 21:22:14 +02:00
Marek Olšák	f4f2e8ebe1	r600g: don't call inval_shader_cache in r600_context_flush twice It's already called in r600_constant_buffers_dirty.	2012-07-17 21:22:14 +02:00
Marek Olšák	6694a68d89	gallium/util: add util_bit_last - finds the last bit set in a word	2012-07-17 21:22:14 +02:00
Marek Olšák	018e3f75d6	r600g: fix all failing depth-stencil tests for evergreen	2012-07-17 21:22:14 +02:00
Michel Dänzer	761131ce45	configure.ac: Further LLVM fixups. * Also add mcjit in the non-OpenCL case. * Replace hardcoded llvm-config with $LLVM_CONFIG everywhere. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellad <thomas.stellard@amd.com>	2012-07-17 19:12:01 +02:00
Michel Dänzer	39c4bc7fdf	glsl: Drop obsolete .gitignore entries. Helps spotting and removing the obsolete generated files, which otherwise break the build. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2012-07-17 18:30:32 +02:00
Tom Stellard	ed41a559dc	configure.ac: Add libLLVMMCJIT to the LLVM_LDFLAGS This is neccessary for linking the llvmpipe tests. It appears this dependency was introduced by the "wider native register" changes. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-07-17 12:08:24 -04:00
Eric Anholt	fadc9eaf97	intel: Add a comment explaining why we early return on matching BO names. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:08 -07:00
Eric Anholt	2b311fd802	intel: Drop other checks for old loader version. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:06 -07:00
Eric Anholt	1b4374d364	intel: Replace the non-getBuffersWithFormat compat path with an error message. It's been broken (using NULL getBuffersWithFormat() instead of getBuffers()) due to a copy and paste error for a year now. GetBuffersWithFormat has been around since 2009, so I don't feel any guilt in not supporting it. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:04 -07:00
Eric Anholt	9bbf7c139b	intel: Remove dead intel_framebuffer_has_hiz(). Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:18:02 -07:00
Eric Anholt	bce58e155d	intel: Convert to using private depth/stencil buffers (v2) This means that GLX buffer sharing of these no longer works. On the other hand, just look at this code reduction. v2: - [chad] Fix intelCreateBuffer for gen < 6. When the branch for !screen->hw_has_separate_stencil was taken, intel_create_private_renderbuffer was incorrectly not used. - [chad] Remove all code in intel_process_dri2_buffer for processing depth, stencil, and hiz buffers. That code is now dead. CC: Eric Anholt <eric@anholt.net> Signed-off-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:17:56 -07:00
Eric Anholt	433ff3e16e	intel: Add a function for creating a private window system buffer. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-17 08:17:38 -07:00
Roland Scheidegger	bf484024b9	gallivm: (trivial) remove unnecessary bogus include	2012-07-17 17:11:18 +02:00
Kristian Høgsberg	2023bf996e	gbm: Add gbm_bo_import for gallium gbm backend	2012-07-17 10:54:00 -04:00
Elvis Lee	1f2c87cc8f	st/egl: Fix build for wayland includes common/native_wayland_drm_bufmgr_helper.c fails to find wayland-server.h Signed-off-by: Elvis Lee <kwangwoong.lee@lge.com>	2012-07-17 10:54:00 -04:00
Elvis Lee	23f1e551cc	st/gbm: renaming pitch to stride on gallium commit '7250cd506baa0bd4649b30d87509cdd0cbc06a57' changes struct gbm_bo, renaming it's 'pitch' to 'stride'. This applies to Gallium. Signed-off-by: Elvis Lee <kwangwoong.lee@lge.com>	2012-07-17 10:54:00 -04:00
Matt Turner	f42e601ce0	glx: build tests after libglx.la Previously, if you ran make followed by make check it would work, but if you just ran make check the test program would fail to compile. Reviewed-by: Jon TURNEY <jon.turney@dronecode.org.uk>	2012-07-17 06:59:00 -07:00
José Fonseca	3469715a8a	gallivm,draw,llvmpipe: Support wider native registers. Squashed commit of the following: commit 7acb7b4f60dc505af3dd00dcff744f80315d5b0e Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 9 17:46:31 2012 +0100 draw: Don't use dynamically sized arrays. Not supported by MSVC. commit 5810c28c83647612cb372d1e763fd9d7780df3cb Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 9 17:44:16 2012 +0100 gallivm,llvmpipe: Don't use expressions with PIPE_ALIGN_VAR(). MSVC doesn't accept exceptions in _declspec(align(...)). Use a define instead. commit 8aafd1457ba572a02b289b3f3411e99a3c056072 Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 9 17:41:56 2012 +0100 gallium/util: Make u_cpu_detect.h header C++ safe. commit 5795248350771f899cfbfc1a3a58f1835eb2671d Author: José Fonseca <jfonseca@vmware.com> Date: Mon Jul 2 12:08:01 2012 +0100 gallium/util: Add ULL suffix to large constants. As suggested by Andy Furniss: it looks like some old gcc versions require it. commit 4c66c22727eff92226544c7d43c4eb94de359e10 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Jun 29 13:39:07 2012 +0100 gallium/util: Truly disable INF/NAN tests on MSVC. Thanks to Brian for spotting this. commit 8bce274c7fad578d7eb656d9a1413f5c0844c94e Author: José Fonseca <jfonseca@vmware.com> Date: Fri Jun 29 13:39:07 2012 +0100 gallium/util: Disable INF/NAN tests on MSVC. Somehow they are not recognized as constants. commit 6868649cff8d7fd2e2579c28d0b74ef6dd4f9716 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jul 5 15:05:24 2012 +0200 gallivm: Cleanup the 2 x 8 float -> 16 ub special path in lp_build_conv. No behaviour change intended, like 7b98455fb40c2df84cfd3cdb1eb7650f67c8a751. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 5147a0949c4407e8bce9e41d9859314b4a9ccf77 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jul 5 14:28:19 2012 +0200 gallivm: (trivial) fix issues with multiple-of-4 texture fetch Some formats can't handle non-multiple of 4 fetches I believe, but everything must support length 1 and multiples of 4. So avoid going to scalar fetch (which is very costly) just because length isn't 4. Also extend the hack to not use shift with variable count for yuv formats to arbitrary length (larger than 1) - doesn't matter how many elements we have we always want to avoid it unless we have variable shift count instruction (which we should get with avx2). Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 87ebcb1bd71fa4c739451ec8ca89a7f29b168c08 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jul 4 02:09:55 2012 +0200 gallivm: (trivial) fix typo for wrap repeat mode in linear filtering aos code This would lead to bogus coordinates at the edges. (undetected by piglit because this path is only taken for block-based formats). Signed-off-by: José Fonseca <jfonseca@vmware.com> commit 3a42717101b1619874c8932a580c0b9e6896b557 Author: José Fonseca <jfonseca@vmware.com> Date: Tue Jul 3 19:42:49 2012 +0100 gallivm: Fix TGSI integer translation with AVX. commit d71ff104085c196b16426081098fb0bde128ce4f Author: José Fonseca <jfonseca@vmware.com> Date: Fri Jun 29 15:17:41 2012 +0100 llvmpipe: Fix LLVM JIT linear path. It was not working properly because it was looking at the JIT function before it was actually compiled. Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit a94df0386213e1f5f9a6ed470c535f9688ec0a1b Author: José Fonseca <jfonseca@vmware.com> Date: Thu Jun 28 18:07:10 2012 +0100 gallivm: Refactor lp_build_broadcast(_scalar) to share code. Doesn't really change the generated assembly, but produces more compact IR, and of course, makes code more consistent. Reviewed-by: Brian Paul <brianp@vmware.com> commit 66712ba2731fc029fa246d4fc477d61ab785edb5 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Jun 27 17:30:13 2012 +0100 gallivm: Make LLVMContextRef a singleton. There are any places inside LLVM that depend on it. Too many to attempt to fix. Reviewed-by: Brian Paul <brianp@vmware.com> commit ff5fb7897495ac263f0b069370fab701b70dccef Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jun 28 18:15:27 2012 +0200 gallivm: don't use 8-wide texture fetch in aos path This appears to be a slight loss usually. There are probably several reasons for that: - fetching itself is scalar - filtering is pure int code hence needs splitting anyway, same for the final texel offset calculations - texture wrap related code, which can be done 8-wide, is slightly more complex with floats (with clamp_to_edge) and float operations generally more costly hence probably not much faster overall - the code needed to split when encountering different mip levels for the quads, adding complexity So, just split always for aos path (but leave it 8-wide for soa, since we do 8-wide filtering there when possible). This should certainly be revisited if we'd have avx2 support. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit ce8032b43dcd8e8d816cbab6428f54b0798f945d Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 27 18:41:19 2012 +0200 gallivm: (trivial) don't extract fparts variable if not needed Did not have any consequences but unnecessary. commit aaa9aaed8f80dc282492f62aa583a7ee23a4c6d5 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 27 18:09:06 2012 +0200 gallivm: fix precision issue in aos linear int wrap code now not just passes at a quick glance but also with piglit... If we do the wrapping with floats, we also need to set the weights accordingly. We can potentially end up with different (integer) coordinates than what the integer calculations would have chosen, which means the integer weights calculated previously in this case are completely wrong. Well at least that's what I think happens, at least recalculating the weights helps. (Some day really should refactor all the wrapping, so we do whatever is fastest independent of 16bit int aos or 32bit float soa filtering.) Reviewed-by: José Fonseca <jfonseca@vmware.com> commit fd6f18588ced7ac8e081892f3bab2916623ad7a2 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Jun 27 11:15:53 2012 +0100 gallium/util: Fix parsing of options with underscore. For example GALLIVM_DEBUG=no_brilinear which was being parsed as two options, "no" and "brilinear". commit 09a8f809088178a03e49e409fa18f1ac89561837 Author: James Benton <jbenton@vmware.com> Date: Tue Jun 26 15:00:14 2012 +0100 gallivm: Added a generic lp_build_print_value which prints a LLVMValueRef. Updated lp_build_printf to share common code. Removed specific lp_build_print_vecX. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> commit e59bdcc2c075931bfba2a84967a5ecd1dedd6eb0 Author: José Fonseca <jfonseca@vmware.com> Date: Wed May 16 15:00:23 2012 +0100 draw,llvmpipe: Avoid named struct types on LLVM 3.0 and later. Starting with LLVM 3.0, named structures are meant not for debugging, but for recursive data types, previously also known as opaque types. The recursive nature of these types leads to several memory management difficulties. Given that we don't actually need recursive types, avoid them altogether. This is an attempt to address fdo bugs 41791 and 44466. The issue is somewhat random so there's no easy way to check how effective this is. Cherry-picked from `9af1ba565d` commit df6070f618a203c7a876d984c847cde4cbc26bdb Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 27 14:42:53 2012 +0200 gallivm: (trivial) fix typo in faster aos linear int wrap code no longer crashes, now REALLY tested. commit d8f98dce452c867214e6782e86dc08562643c862 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 18:20:58 2012 +0200 llvmpipe: (trivial) remove bogus optimization for float aos repeat wrap This optimization for nearest filtering on the linear path generated likely bogus results, and the int path didn't have any optimizations there since the only shader using force_nearest apparently uses clamp_to_edge not repeat wrap anyway. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit c4e271a0631087c795e756a5bb6b046043b5099d Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 23:01:52 2012 +0200 gallivm: faster repeat wrap for linear aos path too Even if we already have scaled integer coords, it's way faster to use the original float coord (plus some conversions) rather than use URem. The choice of what to do for texture wrapping is not really tied to int aos or float soa filtering though for some modes there can be some gains (because of easier weight calculations). Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 1174a75b1806e92aee4264ffe0ffe7e70abbbfa3 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 14:39:22 2012 +0200 gallivm: improve npot tex wrap repeat in linear soa path URem gets translated into series of scalar divisions so just about anything else is faster. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit f849ffaa499ed96fa0efd3594fce255c7f22891b Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 26 00:40:35 2012 +0100 gallivm: (trivial) fix near-invisible shift-space typo I blame the keyboard. commit 5298a0b19fe672aebeb70964c0797d5921b51cf0 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 16:24:28 2012 +0200 gallivm: add new intrinsic helper to deal with arbitrary vector length This helper will split vectors which are too large for the hw, or expand them if they are too small, so a caller of a function using intrinsics which uses such sizes need not split (or expand) the vectors manually and the function will still use the intrinsic instead of dropping back to generic llvm code. It can also accept scalars for use with pseudo-vector intrinsics (only useful for float arguments, all x86 scalar simd float intrinsics use 4vf32). Only used for lp_build_min/max() for now (also added the scalar float case for these while there). (Other basic binary functions could use it easily, whereas functions with a different interface would need different helpers.) Expanding vectors isn't widely used, because we always try to use build contexts with native hw vector sizes. But it might (or not) be nicer if this wouldn't need to be done, the generated code should in theory stay the same (it does get hit by lp_build_rho though already since we didn't have a intrinsic for the scalar lp_build_max case before). v2: incorporated Brian's feedback, and also made the scalar min/max case work instead of crash (all scalar simd float intrinsics take 4vf32 as argument, probably the reason why it wasn't used before). Moved to lp_bld_intr based on José's request, and passing intrinsic size instead of length. Ideally we'd derive the source type info from the passed in llvm value refs and process some llvmtype return type so we could handle intrinsics where the source and destination type isn't the same (like float/int conversions, packing instructions) but that's a bit too complicated for now. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 01aa760b99ec0b2dc8ce57a43650e83f8c1becdf Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 16:19:18 2012 +0200 gallivm: (trivial) increase max code size for shader disassembly 64kB was just short of what I needed (which caused a crash) hence increase to 96kB (should probably be smarter about that). commit 74aa739138d981311ce13076388382b5e89c6562 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 11:53:29 2012 +0100 gallivm: simplify aos float tex wrap repeat nearest just handle pot and npot the same. The previous pot handling ended up with exactly the same instructions plus 2 more (leave it in the soa path though since it is probably still cheaper there). While here also fix a issue which would cause a crash after an assert. commit 0e1e755645e9e49cfaa2025191e3245ccd723564 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 11:29:24 2012 +0100 gallivm: (trivial) skip floor rounding in ifloor when not signed This was only done for the non-sse41 case before, but even with sse41 this is obviously unnecessary (some callers already call itrunc in this case anyway but some might not). commit 7f01a62f27dcb1d52597b24825931e88bae76f33 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 25 11:23:12 2012 +0100 gallivm: (trivial) fix bogus comments commit 5c85be25fd82e28490274c468ce7f3e6e8c1d416 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Jun 20 11:51:57 2012 +0100 translate: Free elt8_func/elt16_func too. These were leaking. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit 0ad498f36fb6f7458c7cffa73b6598adceee0a6c Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 19 15:55:34 2012 +0200 gallivm: fix bug for tex wrap repeat with linear sampling in aos float path The comparison needs to be against length not length_minus_one, otherwise the max texel is never chosen (for the second coordinate). Fixes piglit texwrap-1D-npot-proj (and 2D/3D versions). Reviewed-by: José Fonseca <jfonseca@vmware.com> commit d1ad65937c5b76407dc2499b7b774ab59341209e Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Jun 19 16:13:43 2012 +0200 gallivm: simplify soa tex wrap repeat with npot textures and no mip filtering Similar to what is already done in aos sampling for the float path (but not the int path since we don't get normalized float coordinates there). URem is expensive and the calculation is done trivially with normalized floats instead (at least with sse41-capable cpus). (Some day should probably do the same for the mip filter path but it's much more complicated there hence the gain is smaller.) Reviewed-by: José Fonseca <jfonseca@vmware.com> commit e1e23f57ba9b910295c306d148f15643acc3fc83 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 18 20:38:56 2012 +0200 llvmpipe: (trivial) remove duplicated function declaration Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 07ca57eb09e04c48a157733255427ef5de620861 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 18 20:37:34 2012 +0200 llvmpipe: destroy setup variants on context destruction lp_delete_setup_variants() used to be called in garbage collection, but this no longer exists hence the setup shaders never got freed. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit ed0003c633859a45f9963a479f4c15ae0ef1dca3 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 18 16:25:29 2012 +0100 gallivm: handle different ilod parts for multiple quad sampling This fixes filtering when the integer part of the lod is not the same for all quads. I'm not fully convinced of that solution yet as it just splits the vector if the levels to be sampled from are different. But otherwise we'd need to do things like some minify steps, and getting mip level base address separately anyway hence it wouldn't really look like much of a win (and making the code even more complex). This should now give identical results to single quad sampling. commit 8580ac4cfc43a64df55e84ac71ce1a774d33c0d2 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jun 14 18:14:47 2012 +0200 gallivm: de-duplicate sample code common to soa and aos sampling There doesn't seem to be any reason why this code dealing with cube face selection, lod and mip level calculation is separate in aos and soa sampling, and I am sick of having it to change in both places. commit fb541e5f957408ce305b272100196f1e12e5b1e8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Jun 14 18:15:41 2012 +0200 gallivm: do mip filtering with per quad lod_fpart This gives better results for mip filtering, though the generated code might not be optimal. For now it also creates some artifacts if the lod_ipart isn't the same for all quads, since instead of using the same mip weight for all quads as previously (which just caused non-smooth gradients) this now will use the right weights but with the wrong mip level in this case (can easily be seen with things like texfilt, mipmap_tunnel). v2: use logic helper suggested by José, and fix issue with negative lod_fpart values commit f1cc84eef7d826a20fab6cd8ccef9a275ff78967 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Jun 13 18:35:25 2012 +0200 gallivm: (trivial) fix bogus assert in lp_build_unpack_broadcast_aos_scalars commit 7c17dbae8ae290df9ce0f50781a09e8ed640c044 Author: James Benton <jbenton@vmware.com> Date: Tue Jun 12 12:11:14 2012 +0100 util: Reimplement half <-> float conversions. Removed u_half.py used to generate the table for previous method. Previous implementation of float to half conversion was faulty for denormalised and NaNs and would require extra logic to fix, thus making the speedup of using tables irrelevant. commit 7762f59274070e1dd4b546f5cb431c2eb71ae5c3 Author: James Benton <jbenton@vmware.com> Date: Tue Jun 12 12:12:16 2012 +0100 tests: Updated tests to properly handle NaN for half floats. commit fa94c135aea5911fd93d5dfb6e6f157fb40dce5e Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 11 18:33:10 2012 +0200 gallivm: do mip level calculations per quad This is the final piece which shouldn't change the rendering output yet. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 23cbeaddfe03c09ca18c45d28955515317ffcf4c Author: Roland Scheidegger <sroland@vmware.com> Date: Sat Jun 9 00:54:21 2012 +0200 gallivm: do per-quad cube face selection Doesn't quite fix the piglit cubemap test (not sure why actually) but doing per-quad face selection is doing the right thing and definitely an improvement. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit abfb372b3702ac97ac8b5aa80ad1b94a2cc39d33 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Jun 11 18:22:59 2012 +0200 gallivm: do all lod calculations per quad Still no functional change but lod is now converted to scalar after lod calculations. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 519368632747ae03feb5bca9c655eccbc5b751b4 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 16:46:10 2012 +0100 gallivm: Added support for half-float to float conversion in lp_build_conv. Updated various utility functions to support this change. commit 135b4d683a4c95f7577ba27b9bffa4a6fbd2c2e7 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 16:02:46 2012 +0100 gallivm: Added function for half-float to float conversion. Updated lp_build_format_aos_array to support half-float source. commit 37d648827406a20c5007abeb177698723ed86673 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 14:55:18 2012 +0100 util: Updated u_format_tests to rigidly test half-float boundary values. commit 2ad18165d96e578aa9046df7c93cb1c3284d8c6b Author: James Benton <jbenton@vmware.com> Date: Tue May 22 14:54:16 2012 +0100 llvmpipe: Updated lp_test_format to properly handle Inf/NaN results. commit 78740acf25aeba8a7d146493dd5c966e22c27b73 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 14:53:30 2012 +0100 util: Added functions for checking NaN / Inf for double and half-floats. commit 35e9f640ae01241f9e0d67fe893bbbf564c05809 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 24 21:05:13 2012 +0200 gallivm: Fix calculating rho for 3d textures for the single-quad case Discovered by accident, this looks like a very old typo bug. commit fc1220c636326536fd0541913154e62afa7cd1d8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 24 21:04:59 2012 +0200 gallivm: do calcs per-quad in lp_build_rho Still convert to scalar at the end of the function. commit 50a887ffc550bf310a6988fa2cea5c24d38c1a41 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon May 21 23:21:50 2012 +0200 gallivm: (trivial) return scalar in lp_build_extract_range for length 1 vectors Our type system on top of llvm's one doesn't generally support vectors of length 1, instead using scalars. So we should return a scalar from this function instead of having to bitcast the vector with length 1 later elsewhere. commit 80c71c621f9391f0f9230460198d861643324876 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 17:49:15 2012 +0100 draw: Fixed bad merge error commit c47401cfad0c9167de20ff560654f533579f452c Author: James Benton <jbenton@vmware.com> Date: Tue May 22 15:29:30 2012 +0100 draw: Updated store_clip to store whole vectors instead of individual elements. commit 2d9c1ad74b0b0b41861fffcecde39f09cc27f1cf Author: James Benton <jbenton@vmware.com> Date: Tue May 22 15:28:32 2012 +0100 gallivm: Added lp_build_fetch_rgba_aos_array. A version of lp_build_fetch_rgba_aos which is targeted at simple array formats. Reads the whole vector from memory in one, instead of reading each element individually. Tested with mesa tests and demos. commit ff7805dc2b6ef6d8b11ec4e54aab1633aef29ac8 Author: James Benton <jbenton@vmware.com> Date: Tue May 22 15:27:40 2012 +0100 gallivm: Added lp_build_pad_vector. This function pads a vector with undef to a desired length. commit 701f50acef24a2791dabf4730e5b5687d6eb875d Author: James Benton <jbenton@vmware.com> Date: Fri May 18 17:27:19 2012 +0100 util: Added util_format_is_array. This function checks whether a format description is in a simple array format. commit 5e0a7fa543dcd009de26f34a7926674190fa6246 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 19:13:47 2012 +0100 draw: Removed draw_llvm_translate_from and draw/draw_llvm_translate.c. This is "replaced" by adding an optimised path in lp_build_fetch_rgba_aos in an upcoming patch. commit 8c886d6a7dd3fb464ecf031de6f747cb33e5361d Author: James Benton <jbenton@vmware.com> Date: Wed May 16 15:02:31 2012 +0100 draw: Modified store_aos to write the vector as one, not individual elements. commit 37337f3d657e21dfd662c7b26d61cb0f8cfa6f17 Author: James Benton <jbenton@vmware.com> Date: Wed May 16 14:16:23 2012 +0100 draw: Changed aos_to_soa to use lp_build_transpose_aos. commit bd2b69ce5d5c94b067944d1dcd5df9f8e84548f1 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 19:14:27 2012 +0100 draw: Changed soa_to_aos to use lp_build_transpose_aos. commit 0b98a950d29a116e82ce31dfe7b82cdadb632f2b Author: James Benton <jbenton@vmware.com> Date: Fri May 18 18:57:45 2012 +0100 gallivm: Added lp_build_transpose_aos which converts between aos and soa. commit 69ea84531ad46fd145eb619ed1cedbe97dde7cb5 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 18:57:01 2012 +0100 gallivm: Added lp_build_interleave2_half aimed at AVX unpack instructions. commit 7a4cb1349dd35c18144ad5934525cfb9436792f9 Author: José Fonseca <jfonseca@vmware.com> Date: Tue May 22 11:54:14 2012 +0100 gallivm: Fix build on Windows. MC-JIT not yet supported there. Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit afd105fc16bb75d874e418046b80d9cc578818a1 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:17:26 2012 +0100 llvmpipe: Added a error counter to lp_test_conv. Useful for keeping track of progress when fixing errors! Signed-off-by: José Fonseca <jfonseca@vmware.com> commit b644907d08c10a805657841330fc23db3963d59c Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:16:46 2012 +0100 llvmpipe: Changed known failures in lp_test_conv. To comply with the recent fixes to lp_bld_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit d7061507bd94f6468581e218e61261b79c760d4f Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:14:38 2012 +0100 llvmpipe: Added fixed point types tests to lp_test_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit 146b3ea39b4726dbe125ac666bd8902ea3d6ca8c Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:26:35 2012 +0100 llvmpipe: Changed lp_test_conv src/dst alignment to be correct. Now based on the define rather than a fixed number. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit f3b57441f834833a4b142a951eb98df0aa874536 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:06:44 2012 +0100 gallivm: Fixed erroneous optimisation in lp_build_min/max. Previously assumed normalised was 0 to 1, but it can be -1 to 1 if type is signed. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit a0613382e5a215cd146bb277646a6b394d376ae4 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:04:49 2012 +0100 gallivm: Compensate for lp_const_offset in lp_build_conv. Fixing a /FIXME/ to remove errors in integer conversion in lp_build_conv. Tested using lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit a3d2bf15ea345bc8a0664f8f441276fd566566f3 Author: James Benton <jbenton@vmware.com> Date: Fri May 18 16:01:25 2012 +0100 gallivm: Fixed overflow in lp_build_clamped_float_to_unsigned_norm. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit e7b1e76fe237613731fa6003b5e1601a2e506207 Author: José Fonseca <jfonseca@vmware.com> Date: Mon May 21 20:07:51 2012 +0100 gallivm: Fix build with LLVM 2.6 Trivial, and useful. commit d3c6bbe5c7f5ba1976710831281ab1b6a631082d Author: José Fonseca <jfonseca@vmware.com> Date: Tue May 15 17:15:59 2012 +0100 gallivm: Enable MCJIT/AVX with vanilla LLVM 3.1. Add the necessary C++ glue, so that we don't need any modifications to the soon to be released LLVM 3.1. Reviewed-by: Roland Scheidegger <sroland@vmware.com> commit 724a019a14d40fdbed21759a204a2bec8a315636 Author: José Fonseca <jfonseca@vmware.com> Date: Mon May 14 22:04:06 2012 +0100 gallivm: Use HAVE_LLVM 0x0301 consistently. commit af6991e2a3868e40ad599b46278551b794839748 Author: José Fonseca <jfonseca@vmware.com> Date: Mon May 14 21:49:06 2012 +0100 gallivm: Add MCRegisterInfo.h to silence benign warnings about missing implementation. Trivial. commit 6f8a1d75458daae2503a86c6b030ecc4bb494e23 Author: Vinson Lee <vlee@freedesktop.org> Date: Mon Apr 2 22:14:15 2012 -0700 gallivm: Pass in a MCInstrInfo to createMCInstPrinter on llvm-3.1. llvm-3.1svn r153860 makes MCInstrInfo available to the MCInstPrinter. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit 62555b6ed8760545794f83064e27cddcb3ce5284 Author: Vinson Lee <vlee@freedesktop.org> Date: Tue Mar 27 21:51:17 2012 -0700 gallivm: Fix method overriding in raw_debug_ostream. Use matching type qualifers to avoid method hiding. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com> commit 6a9bd784f4ac68ad0a731dcd39e5a3c39989f2be Author: Vinson Lee <vlee@freedesktop.org> Date: Tue Mar 13 22:40:52 2012 -0700 gallivm: Fix createOProfileJITEventListener namespace with llvm-3.1. llvm-3.1svn r152620 refactored the OProfile profiling code. createOProfileJITEventListener was moved from the llvm namespace to the llvm::JITEventListener namespace. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit b674955d39adae272a779be85aa1bd665de24e3e Author: Vinson Lee <vlee@freedesktop.org> Date: Mon Mar 5 22:00:40 2012 -0800 gallivm: Pass in a MCRegisterInfo to MCInstPrinter on llvm-3.1. llvm-3.1svn r152043 changes createMCInstPrinter to take an additional MCRegisterInfo argument. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit 11ab69971a8a31c62f6de74905dbf8c02884599f Author: Vinson Lee <vlee@freedesktop.org> Date: Wed Feb 29 21:20:53 2012 -0800 Revert "gallivm: Change getExtent and readByte to non-const with llvm-3.1." This reverts commit `d5a6c17254`. llvm-3.1svn r151687 makes MemoryObject accessor members const again. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> commit 339960c82d2a9f5c928ee9035ed31dadb7f45537 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon May 14 16:19:56 2012 +0200 gallivm: (trivial) fix assertion failure for mipmapped 1d textures In lp_build_rho, we may end up with a 1-element vector (for mipmapped 1d textures), but in this case we require the type to be a non-vector type, so need a cast. commit 9d73edb727bd6d196030dc3026b7bf0c574b3e19 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 10 18:12:07 2012 +0200 gallivm: prepare for per-quad lod calculations for large vectors to be able to handle multiple quads at once in texture sampling and still do lod calculations per quad, it is necessary to get the per-quad derivatives into the lp_build_rho function. Until now these derivative values were just scalars, which isn't going to work. So we now use vectors, and since the interface needs to change we also do some different (slightly more efficient) packing of the values. For 8-wide vectors the packed derivative values for 3 coords would look like this, this scales to a arbitrary (multiple of 4) vector size: ds1dx ds1dy dt1dx dt1dy ds2dx ds2dy dt2dx dt2dy dr1dx dr1dy _____ _____ dr2dx dr2dy _____ _____ The second vector will be unused for 1d and 2d textures. To facilitate future changes the derivative values are put into a struct, since quite some functions just pass these values through. The generated code seems to be very slightly better for 2d textures (with 4-wide vectors) than before with sse2 (if you have a cpu with physical 128bit simd units - otherwise it's probably not a win). v2: suggestions from José, rename variables, add comments, use swizzle helper commit 0aa21de0d31466dac77b05c97005722e902517b8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu May 10 18:10:31 2012 +0200 gallivm: add undefined swizzle handling to lp_build_swizzle_aos This is useful for vectors with "holes", it lets llvm choose the most efficient shuffle instructions if some elements aren't needed without having to worry what elements to manually pick otherwise. commit 00faf3f370e7ce92f5ef51002b0ea42ef856e181 Author: José Fonseca <jfonseca@vmware.com> Date: Fri May 4 17:25:16 2012 +0100 gallivm: Get the LLVM IR optimization passes before JIT compilation. MC-JIT engine compiles the module immediately on creation, so the optimization passes were being run too late. So now we create a target data layout from a string, that matches the ABI parameters reported by the compiler. The backend optimization passes were always been run, so the performance improvement is modest (3% on multiarb mesa demo). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> commit 40a43f4e2ce3074b5ce9027179d657ebba68800a Author: Roland Scheidegger <sroland@vmware.com> Date: Wed May 2 16:03:54 2012 +0200 gallivm: (trivial) fix wrong define used in lp_build_pack2 should fix stack-smashing crashes. commit e6371d0f4dffad4eb3b7a9d906c23f1c88a2ab9e Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Apr 30 21:25:29 2012 +0200 gallivm: add perf warnings when not using intrinsics with 256bit vectors Helper functions using integer sse2 intrinsics could split the vectors with AVX instead of using generic fallback (which should be faster). We don't actually expect to hit these paths (hence don't fix them up to actually do the vector splitting) so just emit warnings (for those functions where it's obvious doing split/intrinsic is faster than using generic path). Only emit warnings for 256bit vectors since we _really_ don't expect to hit arbitrary large vectors which would affect a lot more functions. The warnings do not actually depend on avx since the same logic applies to plain sse2 too (but of course again there's _really_ no reason we should hit these functions with 256bit vectors without avx). commit 8a9ea701ea7295181e846c6383bf66a5f5e47637 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue May 1 20:37:07 2012 +0200 gallivm: split vectors manually for avx in lp_build_pack2 (v2) There's 2 reasons for this: First, there's a llvm bug (fixed in 3.1) which generates tons of byte inserts/extracts otherwise, and second, more importantly, we want to use pack intrinsics instead of shuffles. We do this in lp_build_pack2 and not the calling code (aos sample path) because potentially other callers might find that useful too, even if for larger sequences of code using non-native vector sizes it might be better to manually split vectors. This should boost texture performance in the aos path considerably. v2: fix issues with intrinsics types with old llvm commit 27ac5b48fa1f2ea3efeb5248e2ce32264aba466e Author: Roland Scheidegger <sroland@vmware.com> Date: Tue May 1 20:26:22 2012 +0200 llvmpipe: refactor lp_build_pack2 (v2) prettify, and it's unnecessary to assert when there's no intrinsic due to unsupported bit width - the shuffle path will work regardless. In contrast lp_build_packs2, should only rely on lp_build_pack2 doing the clamping for element sizes for which there is a sse2 intrinsic. v2: fix bug spotted by Jose regarding the intrinsic type for packusdw on old llvm versions. commit ddf279031f0111de4b18eaf783bdc0a1e47813c8 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue May 1 20:13:59 2012 +0200 gallivm: add src width check in lp_build_packs2() not doing so would skip clamping even if no sse2 pack instruction is available, which is incorrect (in theory only, such widths would also always hit a (unnecessary) assertion in lp_build_pack2(). commit e7f0ad7fe079975eae7712a6e0c54be4fae0114b Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Apr 27 15:57:00 2012 +0200 gallivm: (trivial) fix crash-causing typo for npot textures with avx commit 28a9d7f6f655b6ec508c8a3aa6ffefc1e79793a0 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Apr 25 19:38:45 2012 +0200 gallivm: (trivial) remove code mistakenly added twice. commit d5926537316f8ff67ad0a52e7242f7c5478d919b Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Apr 24 21:16:15 2012 +0200 gallivm: add a new avx aos sample path (v2) Try to avoid mixing float and int address calculations. This does texture wrap modes with floats, and then the offset calculations still with ints (because of lack of precision with floats, though we could do some effort to make it work with not too large (16MB) textures). This also handles wrap repeat mode with npot-sized textures differently than either the old soa or aos int path (likely way faster but untested). Otherwise the actual address wrap code is largely similar to the soa path (not quite the same as this one also has some int code), it should get used by avx soa sampling later as well but doesn't handle more complex address modes yet (this will also have the benefit that we can use aos sampling path for all texture address modes). Generated code for that looks reasonable, but still does not split vectors explicitly for fetch/filter which means still get hit by llvm (fixed upstream) which generates hundreds of pinsrb/pextrb instead of two shuffles. It is not obvious though if it's much of a win over just doing address calcs 4-wide but with ints, even if it is definitely much less instructions on avx. piglit's texwrap seems to look exactly the same but doesn't test neither the non-normalized nor the npot cases. v2: fix comments, prettify based on Brian's and Jose's feedback. commit bffecd22dea66fb416ecff8cffd10dd4bdb73fce Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Apr 19 01:58:29 2012 +0200 gallivm: refactor aos lp_build_sample_image_nearest/linear split them up to separate address calculations and fetching/filtering. Need this for being able to do 8-wide float address calcs and 4-wide fetch/filter later (for avx). Plus the functions were very big scary monsters anyway (in particular lp_build_sample_image_linear). commit a80b325c57529adddcfa367f96f03557725c4773 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Apr 16 17:17:18 2012 +0200 gallivm: fix lp_build_resize when truncating width but expanding vector size Missed this case which I thought was impossible - the assertion for it was right after the division by zero... (AoS) texture sampling may ask us to do this, for things like 8 4x32int vectors to 1 32x8int vector conversion (eventually, we probably don't want this to happen). commit f9c8337caa3eb185830d18bce8b95676a065b1d7 Author: Roland Scheidegger <sroland@vmware.com> Date: Sat Apr 14 18:00:59 2012 +0200 gallivm: fix cube maps with larger vectors This makes the branchless cube face selection code work with larger vectors. Because the complexity is quite high (cannot really be improved it seems, per-face selection would reduce complexity a lot but this leads to errors unless the derivatives are calculated all from the same face which almost doubles the work to be done) it is still slower than the branching version, hence only enable this with large vectors. It doesn't actually do per-quad face selection yet (only makes sense with matching lod selection, in fact it will select the same face for all pixels based on the average of the first four pixels for now) but only different shuffles are required to make it work (the branching version actually should work with larger vectors too now thanks to the improved horizontal add but of course it cannot be extended to really select the face per-quad unless doing branching per quad). commit 7780c58869fc9a00af4f23209902db7e058e8a66 Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 30 21:11:12 2012 +0100 llvmpipe: (trivial) fix compiler warning and also clarify comment regarding availability of popcnt instruction. commit a266dccf477df6d29a611154e988e8895892277e Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 30 14:21:07 2012 +0100 gallivm: remove unneeded members in lp_build_sample_context Minor cleanup, the texture width, height, depth aren't accessed in their scalar form anywhere. Makes it more obvious those values should probably be fetched already vectorized (but this requires more invasive changes)... commit b678c57fb474e14f05e25658c829fc04d2792fff Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 29 15:53:55 2012 +0100 gallivm: add a helper for concatenating vectors Similar to the extract_range helper intended to get around slow code generated by llvm for 128bit insertelements. Concatenating two 128bit vectors this way will result in a single vinsertf128 operation rather than two 64bit stores plus one 128bit load, though it might be mildly useful for other purposes as well. commit 415ff228bcd0cf5e44a4c15350a661f0f5520029 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 19:41:15 2012 +0100 gallivm: add a custom 2x8f->1x16ub avx conversion path Similar to the existing 4x4f->1x16ub sse2 path, shaves off a couple instructions (min/max mostly) because it relies on pack intrinsics clamping. commit 78c08fc89f8fbcc6dba09779981b1e873e2a0299 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 18:44:07 2012 +0100 gallivm: add avx arithmetic intrinsics Add all avx intrinsics for arithmetic functions (with the exception of the horizontal add function which needs another look). Seems to pass basic tests. Reviewed-by: José Fonseca <jfonseca@vmware.com> commit a586caa2800aa5ce54c173f7c0d4fc48153dbc4e Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 15:31:35 2012 +0100 gallivm: add avx logic intrinsics Add the blend intrinsics for 8-wide float and 4-wide double vectors. Since we lack 256bit int instructions these are used for int vectors as well, though obviously not for byte or word element values. The comparison intrinsics aren't extended for avx since these are only used for pre-2.7 llvm versions. commit 70275e4c13c89315fc2560a4c488c0e6935d5caf Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 28 00:40:53 2012 +0100 gallivm: new helper function for extract shuffles. Based on José's idea as we can need that in a couple places. Note that such shuffles should not be used lightly, since data layout of <4 x i8> is different to <16 x i8> for instance, hence might cause data rearrangement. commit 4d586dbae1b0c55915dda1759d2faea631c0a1c2 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 27 18:27:25 2012 +0100 gallivm: (trivial) don't overallocate shuffle variable using wrong define meant huge array... commit 06b0ec1f6d665d98c135f9573ddf4ba04b2121ad Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 27 17:54:20 2012 +0100 gallivm: don't do per-element extract/insert for vector element resize Instead of doing per-element extract/insert if the src vectors and dst vector differ in total size (which generates atrocious code) first change the src vectors size by using shuffles to destination vector size. We can still do better than that on AVX for packing to color buffer (by exploiting pack intrinsics characteristics hence eleminating the need for some clamps) but this already generates much better code. v2: incorporate feedback from José, Keith and use shuffle instead of bitcasts/extracts. Due to llvm deficiencies the latter cause all data to get moved to GPRs and back in pieces (even though the data in the regs actually stays the same...). commit c9970d70e05f95d3f52fe7d2cd794176a52693aa Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 23 19:33:19 2012 +0000 gallivm: fix bug in simple position interpolation Accidental use of position attribute instead of just pixel coordinates. Caused failures in piglit glsl-fs-ceil and glsl-fs-floor. commit d0b6fcdb008d04d7f73d3d725615321544da5a7e Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 23 15:31:14 2012 +0000 gallivm: fix emission of ceil opcode lp_build_ceil seems more appropriate than lp_build_trunc. This seems to be never hit though someone performs some ceil to floor magic. commit d97fafed7e62ffa6bf76560a92ea246a1a26d256 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 22 11:46:52 2012 +0000 gallivm: new vectorized path for cubemap calculations should be faster when adapted to multiple quads as only selection masks need to be different. The code is more or less a per-pixel version adapted to only do it per quad. A per pixel version would be much simpler (could drop 2 selects, 6 broadcasts and the messy horizontal add of 3 vectors at the expense of only 2 more absolute value instructions - would also just work for arbitary large vectors). This version doesn't yet work with larger vectors because the horizontal add isn't adjusted to be able to work with 2x4 vectors (and also because face selection wouldn't be done per quad just per block though that would be only a correctness issue just as with lod selection). The downside is this code is quite a bit slower. On a Core2 it can be sped up by disabling the hw blend instructions for selection and using logicop fallbacks instead, but it is still slower than the old code, hence leave that in for now. Probably will chose one or the other version based on vector length in the end. commit b375fbb18a3fd46859b7fdd42f3e9908ea4ff9a3 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 21 14:42:29 2012 +0000 gallivm: fix optimized occlusion query intrinsic name commit a9ba0a3b611e48efbb0e79eb09caa85033dbe9a2 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Mar 21 16:19:43 2012 +0000 draw,gallivm,llvmpipe: Call gallivm_verify_function everywhere. commit f94c2238d2bc7383e088b8845b7410439a602071 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 20 18:54:10 2012 +0000 gallivm: optimize calculations for cube maps a bit this does some more vectorized calculations and uses horizontal adds if possible. A definite win with sse3 otherwise it doesn't seem to make much of a difference. In any case this is arithmetically identical, cannot handle larger vectors. Should be useful as a reference point against larger vector version later... commit 21a2c1cf3c8e1ac648ff49e59fdc0e3be77e2ebb Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 20 15:16:27 2012 +0000 llvmpipe: slight optimization of occlusion queries using movmskps when available. While this is slightly better for cpus without popcnt we should really sum the vectors ourselves (it is also possible to cast to i4 before doing the popcnt but that doesn't help that much neither since llvm is using some optimized popcnt version for i32) commit 5ab5a35f216619bcdf55eed52b0db275c4a06c1b Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 20 13:32:11 2012 +0000 llvmpipe: fix occlusion queries with larger vectors need to adjust casts etc. commit ff95e6fdf5f16d4ef999ffcf05ea6e8c7160b0d5 Author: José Fonseca <jfonseca@vmware.com> Date: Mon Mar 19 20:15:25 2012 +0000 gallivm: Restore optimization passes. commit 57b05b4b36451e351659e98946dae27be0959832 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 19:34:22 2012 +0000 llvmpipe: use existing min2 macro commit bc9a20e19b4f600a439f45679451f2e87cd4b299 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 19:07:27 2012 +0000 llvmpipe: add some safeguards against really large vectors As per José's suggestion, prevent things from blowing up if some cpu would have 1024bit or larger vectors. commit 0e2b525e5ca1c5bbaa63158bde52ad1c1564a3a9 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 18:31:08 2012 +0000 llvmpipe: fix mask generation for uberwide vectors this was the only piece preventing 16-wide vectors from working (apart from the LP_MAX_VECTOR_WIDTH define that is), which is the maximum as we don't get more pixels in the fragment shader at once. Hence adjust that so things could be tested properly with that size even though there seems to be no practical value. commit 3c8334162211c97f3a11c7f64e9e5a2a91ad9656 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 18:19:41 2012 +0000 llvmpipe: fix the simple interpolation method with larger vectors so both methods actually _really_ work now. Makes textures look nice with larger vectors... commit 1cb0464ef8871be1778d43b0c56adf9c06843e2d Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 17:26:35 2012 +0000 llvmpipe: fix mask generation and position interpolation with 8-wide vectors trivial bugs, with these things start to look somewhat reasonable. Textures though have some swizzling issues it seems. commit 168277a63ef5b72542cf063c337f2d701053ff4b Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 16:04:03 2012 +0000 llvmpipe: don't overallocate variables we never have more than 16 (stamp size) / 4 (minimum possible vector size). (With larger vectors those variables are still overallocated a bit.) commit 409b54b30f81ed0aa9ed0b01affe15c72de9abd2 Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 15:56:48 2012 +0000 llvmpipe: add some 32f8 formats to lp_test_conv Also add the ability to handle different sized vectors. commit 55dcd3af8366ebdac0af3cdb22c2588f24aa18ce Author: Roland Scheidegger <sroland@vmware.com> Date: Mon Mar 19 15:47:27 2012 +0000 gallivm: handle different sized vectors in conversion / pack only fully generic path for now (extract/insert per element). commit 9c040f78c54575fcd94a8808216cf415fe8868f6 Author: Roland Scheidegger <sroland@vmware.com> Date: Sun Mar 18 00:58:28 2012 +0100 llvmpipe: fix harmless use of unitialized values commit 551e9d5468b92fc7d5aa2265db9a52bb1e368a36 Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 16 23:31:21 2012 +0100 gallivm: drop special path in extract_broadcast with different sized vectors Not needed, llvm can handle shuffles with different sized result vector just fine. Should hopefully generate the same code in the end, but simpler IR. commit 44da531119ffa07a421eaa041f63607cec88f6f8 Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 16 23:28:49 2012 +0100 llvmpipe: adapt interpolation for handling multiple quads at once this is still WIP there are actually two methods possible not quite sure what makes the most sense, so there's code for both for now: 1) the iterative method as used before (compute attrib values at upper left corner of stamp and upper left corner of each quad initially). It is improved to handle more than one quad at once, and also do some more vectorized calculations initially for slightly better code - newer cpus have full throughput with 4 wide float vectors, hence don't try to code up a path which might be faster if there's just one channel active per attribute. 2) just do straight interpolation for each pixel. Method 2) is more work per quad, but less initially - if all quads are executed significantly more overall though. But this might change with larger vector lengths. This method would also be needed if we'd do some kind of active quad merging when operating on multiple quads at once. This path contains some hack to force llvm to generate better code, it is still far from ideal though, still generates far too many unnecessary register spills/reloads. Both methods should work with different sized vectors. Not very well tested yet, still seems to work with four-wide vectors, need changes elsewhere to be able to test with wider vectors. commit be5d3e82e2fe14ad0a46529ab79f65bf2276cd28 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Mar 16 20:59:37 2012 +0000 draw: Cleanup. commit f85bc12c7fbacb3de2a94e88c6cd2d5ee0ec0e8d Author: José Fonseca <jfonseca@vmware.com> Date: Fri Mar 16 20:43:30 2012 +0000 gallivm: More module compilation refactoring. commit d76f093198f2a06a93b2204857e6fea5fd0b3ece Author: José Fonseca <jfonseca@vmware.com> Date: Thu Mar 15 21:29:11 2012 +0000 llvmpipe: Use gallivm_compile/free_function() in linear code. Should had been done before. commit 122e1adb613ce083ad739b153ced1cde61dfc8c0 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 13 14:47:10 2012 +0100 llvmpipe: generate partial pixel mask for multiple quads still works with one quad, cannot be tested yet with more At least for now always fixed order with multiple quads. commit 4c4f15081d75ed585a01392cd2dcce0ad10e0ea8 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 8 22:09:24 2012 +0100 llvmpipe: refactor state setup a bit Refactor to make it easier to emit (and potentially later fetch in fs) coefficients for multiple attributes at once. Need to think more about how to make this actually happen however, the problem is different attributes can have different interpolation modes, requiring different handling in both setup and fs (though linear and perspective handling is close). commit 9363e49722ff47094d688a4be6f015a03fba9c79 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 8 19:23:23 2012 +0100 llvmpipe: vectorize tri offset calc cuts number of instructions in quad-offset-factor from 107 to 75. This code actually duplicated the (scalar) code calculating the determinant except it used different vertex order (leading to different sign but it doesn't matter) hence llvm could not have figured out it's the same (of course with determinant vectorized in the other place that wouldn't have worked any longer neither). Note this particular piece doesn't actually vectorize well, not many arithmetic instructions left but tons of shuffle instructions... Probably would need to work on n tris at a time for better vectorization. commit 63169dcb9dd445c94605625bf86d85306e2b4297 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Mar 8 03:11:37 2012 +0100 llvmpipe: vectorize some scalar code in setup reduces number of arithmetic instructions, and avoids loading vector x,y values twice (once as scalars once as vectors). Results in a reduction of instructions from 76 to 64 in fs setup for glxgears (16%) on a cpu with sse41. Since this code uses vec2 disguised as vec4, on old cpus which had physical 64bit sse units (pre-Core2) it probably is less of a win in practice (and if you have no vectors you can only hope llvm eliminates the arithmetic for unneeded elements). commit 732ecb877f951ab89bf503ac5e35ab8d838b58a1 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Mar 7 00:32:24 2012 +0100 draw: fix clipping bug introduced by 4822fea3f0440b5205e957cd303838c3b128419c broke clipping pretty badly (verified with lineclip test) commit ef5d90b86d624c152d200c7c4056f47c3c6d2688 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 6 23:38:59 2012 +0100 draw: don't store vertex header per attribute storing the vertex header once per attribute is totally unnecessary. Some quick look at the generated assembly says llvm in fact cannot optimize away the additional stores (maybe due to potentially aliasing pointers somewhere). Plus, this makes the code cleaner and also allows using a vector "or" instead of scalar ones. commit 6b3a5a57b0b9850854cfbd7b586e4e50102dda71 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Mar 6 19:11:01 2012 +0100 draw: do the per-vertex "boolean" clipmask "or" with vectors no point extracting the values and doing it per component. Doesn't help that much since we still extract the values elsewhere anyway. commit 36519caf1af40e4480251cc79a2d527350b7c61f Author: Roland Scheidegger <sroland@vmware.com> Date: Fri Mar 2 22:27:01 2012 +0100 gallivm: fix lp_build_extract_broadcast with different sized vectors Fix the obviously wrong argument, so it doesn't blow up. commit 76d0ac3ad85066d6058486638013afd02b069c58 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Mar 2 12:16:23 2012 +0000 draw: Compile per module and not per function (WIP). Enough to get gears w/ LLVM draw + softpipe to work on AVX doing: GALLIUM_DRIVER=softpipe SOFTPIPE_USE_LLVM=yes glxgears But still hackish -- will need to rethink and refactor this. commit 78e32b247d2a7a771be9a1a07eb000d1e54ea8bd Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 29 12:01:05 2012 +0000 llvmpipe: Remove lp_state_setup_fallback. Never used. commit 6895d5e40d19b4972c361e8b83fdb7eecda3c225 Author: José Fonseca <jfonseca@vmware.com> Date: Mon Feb 27 19:14:27 2012 +0000 llvmpipe: Don't emit EMMS on x86 We already take precautions to ensure that LLVM never emits MMX code. commit 4822fea3f0440b5205e957cd303838c3b128419c Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Feb 29 15:58:19 2012 +0100 draw: modifications for larger vector sizes We want to be able to use larger vectors especially for running the vertex shader. With this patch we build soa vectors which might have a different length than 4. Note that aos structures really remain the same, only when aos structures are converted to soa potentially different sized vectors are used. Samplers probably don't work yet, didn't look at them. Testing done: glxgears works with both 128bit and 256bit vectors. commit f4950fc1ea784680ab767d3dd0dce589f4e70603 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 29 15:51:57 2012 +0100 gallivm: override native vector width with LP_NATIVE_VECTOR_WIDTH env var for debug commit 6ad6dbf0c92f3bf68ae54e5f2aca035d19b76e53 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 29 15:51:24 2012 +0100 draw: allocate storage with alignment according to native vector width commit 7bf0e3e7c9bd2469ae7279cabf4c5229ae9880c1 Author: José Fonseca <jfonseca@vmware.com> Date: Fri Feb 24 19:06:08 2012 +0000 gallivm: Fix comment grammar. Was missing several words. Spotted by Roland. commit b20f1b28eb890b2fa2de44a0399b9b6a0d453c52 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 19:22:09 2012 +0000 gallivm: Use MC-JIT on LLVM 3.1 + (i.e, SVN) MC-JIT Note: MC-JIT is still WIP. For this to work correctly it requires LLVM changes which are not yet upstream. commit b1af4dfcadfc241fd4023f4c3f823a1286d452c0 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Feb 23 20:03:15 2012 +0100 llvmpipe: use new lp_type_width() helper in lp_test_blend commit 04e0a37e888237d4db2298f31973af459ef9c95f Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Feb 23 19:50:34 2012 +0100 llvmpipe: clean up lp_test_blend a little Using variables just sized and aligned right makes it a bit more obvious what's going on. The test still only tests vector length 4. For AoS anything else probably isn't going to work. For SoA other lengths should work (at least with floats). commit e61c393d3ec392ddee0a3da170e985fda885a823 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 17:48:30 2012 +0000 gallivm: Ensure vector width consistency. Instead of assuming that everything is the max native size. commit 330081ac7bc41c5754a92825e51456d231bf84dd Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 17:44:14 2012 +0000 draw: More simd vector width consistency fixes. commit d90ca002753596269e37297e2e6c139b19f29f03 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 17:43:00 2012 +0000 gallivm: Remove unused lp_build_int32_vec4_type() helper. commit cae23417824d75869c202aaf897808d73a2c1db0 Author: Roland Scheidegger <sroland@vmware.com> Date: Thu Feb 23 17:32:16 2012 +0100 gallivm: use global variable for native vector width instead of define We do not know the simd extensions (and hence the simd width we should use) available at compile time. At least for now keep a define for maximum vector width, since a global variable obviously can't be used to adjust alignment of automatic stack variables. Leave the runtime-determined value at 128 for now in all cases. commit 51270ace6349acc2c294fc6f34c025c707be538a Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 15:41:02 2012 +0000 gallivm: Add a hunk inadvertedly lost when rebasing. commit bf256df9cfdd0236637a455cbaece949b1253e98 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 14:24:23 2012 +0000 llvmpipe: Use consistent vector width in depth/stencil test. commit 5543b0901677146662c44be2cfba655fd55da94b Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 14:19:59 2012 +0000 draw: Use a consistent the vector register width. Instead of 4x32 sometimes, LP_NATIVE_VECTOR_WIDTH other times. commit eada8bbd22a3a61f549f32fe2a7e408222e5c824 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 12:08:04 2012 +0000 gallivm: Remove garbagge collection. MC-JIT will require one compilation per module (as opposed to one compilation per function), therefore no state will be shared, eliminating the need to do garbagge collection. commit 556697ea0ed72e0641851e4fbbbb862c470fd7eb Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 10:33:41 2012 +0000 gallivm: Move all native target initialization to lp_set_target_options(). commit c518e8f3f2649d5dc265403511fab4bcbe2cc5c8 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:52:32 2012 +0000 llvmpipe: Create one gallivm instance for each test. commit 90f10af8920ec6be6f2b1e7365cfc477a0cb111d Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:48:08 2012 +0000 gallivm: Avoid LLVMAddGlobalMapping() in lp_bld_assert(). Brittle, complex, and unecesary. Just use function pointer constant. commit 98fde550b33401e3fe006af59db4db628bcbf476 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:21:26 2012 +0000 gallivm: Add a lp_build_const_func_pointer() helper. To be reused in all places where we want to call C code. commit 6cfedadb62c2ce5af8d75969bc95a607f3ece118 Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 09:44:41 2012 +0000 gallivm: Cleanup/simplify lp_build_const_string_variable. - Move to lp_bld_const where it belongs - Rename to lp_build_const_string - take the length from the argument (and don't count the zero terminator twice) - bitcast the constant to generic i8 * commit db1d4018c0f1fa682a9da93c032977659adfb68c Author: José Fonseca <jfonseca@vmware.com> Date: Thu Feb 23 11:52:17 2012 +0000 gallivm: Set NoFramePointerElimNonLeaf to true where supported. commit 088614164aa915baaa5044fede728aa898483183 Author: Roland Scheidegger <sroland@vmware.com> Date: Wed Feb 22 19:38:47 2012 +0100 llvmpipe: pass in/out pointers rather scalar floats in lp_bld_arit we don't want llvm to potentially optimize away the vectors (though it doesn't seem to currently), plus we want to be able to handle in/out vectors of arbitrary length. commit 3f5c4e04af8a7592fdffa54938a277c34ae76b51 Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Feb 21 23:22:55 2012 +0100 gallivm: fix lp_build_sqrt() for vector length 1 since we optimize away vectors with length 1 need to emit intrinsic without vector type. commit 79d94e5f93ed8ba6757b97e2026722ea31d32c06 Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 22 17:00:46 2012 +0000 llvmpipe: Remove lp_test_round. commit 81f41b5aeb3f4126e06453cfc78990086b85b78d Author: Roland Scheidegger <sroland@vmware.com> Date: Tue Feb 21 23:56:24 2012 +0100 llvmpipe: subsume lp_test_round into lp_test_arit Much simpler, and since the arguments aren't passed as 128bit values can run on any arch. This also uses the float instead of the double versions of the c functions (which probably was the intention anyway). In contrast to lp_test_round the output is much less verbose however. Tested vector width of 32 to 512 bits - all pass except 32 (length 1) which crashes in lp_build_sqrt() due to wrong type. Signed-off-by: José Fonseca <jfonseca@vmware.com> commit 945b338b421defbd274481d8c4f7e0910fd0e7eb Author: José Fonseca <jfonseca@vmware.com> Date: Wed Feb 22 09:55:03 2012 +0000 gallivm: Centralize the function compilation logic. This simplifies a lot of code. Also doing this in a central place will make it easier to carry out the changes necessary to use MC-JIT in the future. gallivm: Fix typo in explicit derivative shuffle. Trivial. draw: make DEBUG_STORE work again adapt to lp_build_printf() interface changes Reviewed-by: José Fonseca <jfonseca@vmware.com> draw: get rid of vecnf_from_scalar() just use lp_build_broadcast directly (cannot assign a name but don't really need it, vecnf_from_scalar() was producing much uglier IR due to using repeated insertelement instead of insertelement+shuffle). Reviewed-by: José Fonseca <jfonseca@vmware.com> llvmpipe: fix typo in complex interpolation code Fixes position interpolation when using complex mode (piglit fp-fragment-position and similar) Reviewed-by: José Fonseca <jfonseca@vmware.com> draw: fix clipvertex/position storing again This appears to be the result of a bad merge. Fixes piglit tests relying on clipping, like a lot of the interpolation tests. Reviewed-by: José Fonseca <jfonseca@vmware.com> gallivm: Fix explicit derivative manipulation. Same counter variable was being used in two nested loops. Use more meanigful variable names for the counter to fix and avoid this. gallivm: Prevent buffer overflow in repeat wrap mode for NPOT. Based on Roland's patch, discussion, and review . Reviewed-by: Roland Scheidegger <sroland@vmware.com> gallivm: Fix dims for TGSI_TEXTURE_1D in emit_tex. Reviewed-by: Roland Scheidegger <sroland@vmware.com> gallivm: Fix explicit volume texture derivatives. Reviewed-by: Roland Scheidegger <sroland@vmware.com> gallivm: fix 1d shadow texture sampling Always r coordinate is used, hence need 3 coords not two (the second one is unused). Reviewed-by: José Fonseca <jfonseca@vmware.com> gallivm: Enable AVX support without MCJIT, where available. For now, this just enables AVX on Windows for testing. If the code is stable then we might consider prefering the old JIT wherever possible. No change elsewhere. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-17 13:42:39 +01:00
José Fonseca	ba9c1773d7	gallivm: Allow to force nearest filtering on a per-axis basis. Experimental code, not really used yet.	2012-07-17 13:42:39 +01:00
Kristian Høgsberg	b262f56738	wayland: Include wl_drm format enum in wayland-drm.h This gets referenced before we get to generate the header files, so just include the enum that we need and don't include the generated header.	2012-07-17 08:30:39 -04:00
James Benton	e253175c9c	llvmpipe: Fix bug with blend factor in complementary optimisations. Fixes fdo 52168. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-17 13:16:38 +01:00
Christian König	89e755d762	radeonsi: fix vertex element state The vertex element state isn't in registers any more, so remove that old code. That fixes a memory corruption with the blend state and gets eglgears partially working. Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2012-07-17 10:44:12 +02:00
Christian König	4247fd9928	radeon/llvm: fix compiling when llvm is active, but opencl isn't Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-17 10:43:53 +02:00
Brian Paul	aa0becdbb6	mesa: include inttypes.h to get uint8_t type To fix MSVC build.	2012-07-16 16:12:02 -06:00
Brian Paul	fe2a7b7e7f	st/egl: fix uninitialized pointer bug If no format is matched in the loop the value of xconf was undefined. NOTE: This is a candidate for the 8.0 branch.	2012-07-16 16:03:31 -06:00
Brian Paul	2f92a9f721	r300g: silence uninitialized var warning	2012-07-16 16:03:31 -06:00
Elvis Lee	cf775c9cbf	egl_dri2: NULL check for EGLNativeWindowType Some application calls eglCreateWindowSurface with EGLNativeWindowType parameter having zero value. It causes SEGV and disturbs error handling like EGL_NO_SURFACE. Signed-off-by: Elvis Lee <kwangwoong.lee@lge.com> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-07-16 16:03:31 -06:00
Jon TURNEY	d80fd04639	Fix building mesa with assembly enabled since `a112ca5d` `a112ca5d` rather crassly smashed all the compiler flags together into AM_CFLAGS. Separate them out the way they were before, putting pre-processor flags into AM_CPPFLAGS, so assembly source gets preprocessed with the correct pre-processor flags as well. Also, remove unneeded CFLAGS from AM_CFLAGS, and CXXFLAGS from AM_CXXFLAGS Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Tested-by: Brian Paul <brianp@vmware.com>	2012-07-16 22:54:36 +01:00
Chad Versace	8dc074cd92	intel: Fix build broken by ETC1 patch I suck at resolving merge conflicts and broke the build in `a5a34b1`. This patch adds the missing field intel_mipmap_tree::wraps_etc1. Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:29:24 -07:00
Chad Versace	a5a34b153d	intel: Enable GL_OES_compressed_ETC1_RGB8_texture Enable it for all hardware. No current hardware supports ETC1, so this patch implements it by translating the ETC1 data to RGBX data during the call to glCompressedTexImage2D(). For details, see the doxygen for intel_mipmap_tree::wraps_etc1. Passes the Piglit test spec/OES_compressed_ETC1_RGB8_texture/miptree and the ETC1 test in the GLES2 conformance suite. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:11:12 -07:00
Chad Versace	8ec721264c	mesa: Add function for decoding ETC1 textures Add function _mesa_etc1_unpack_rgba8888. It is intended to be used by glCompressedTexSubImage2D to decode ETC1 textures into RGBA. CC: Chia-I <olv@lunarg.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:07:57 -07:00
Chad Versace	d7458e401e	gallium/util, mesa: Refactor etc1 unpack function Move the body of util_etc1_rgb8_unpack_rgba_unorm8 into a new function that can be shared between gallium and dri drivers, texcompress_etc_tmp.h:etc1_unpack_rgba8888. CC: Chia-I <olv@lunarg.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-16 14:07:57 -07:00
Kristian Høgsberg	7250cd506b	gbm: Rename gbm_bo_get_pitch to gbm_bo_get_stride We use pitch for 'pixels per row' and stride for 'bytes per row' pretty consistently in mesa and most other places, so rename the gbm API.	2012-07-16 16:29:16 -04:00
Kristian Høgsberg	44f066b9ff	gbm: Add new gbm_bo_import entry point This generalizes and replaces gbm_bo_create_for_egl_image. gbm_bo_import will create a gbm_bo from either an EGLImage or a struct wl_buffer.	2012-07-16 16:29:15 -04:00
Roland Scheidegger	43ccded1e1	llvmpipe: destroy setup variants on context destruction lp_delete_setup_variants() used to be called in garbage collection, but this no longer exists hence the setup shaders never got freed. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-16 19:00:54 +01:00
James Benton	8684ffc141	llvmpipe: Unified common code between AoS and SoA blending. Added a new file lp_bld_blend.c for the common code. Merged and added some simple optimisations. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-07-16 19:00:54 +01:00
Kristian Høgsberg	636646a481	intel: Don't call _mesa_get_format_bytes for MESA_FORMAT_NONE When we don't intend to texture from or render to a __DRIimage we use __DRI_IMAGE_FORMAT_NONE. In that case, we just create the __DRIimage to reference the underlying buffer, and will create usable __DRIimages from it using createSubImage later. If we try to use _mesa_get_format_bytes() on MESA_FORMAT_NONE in a debug build, we hit an assertion, so let's not do that.	2012-07-16 11:00:16 -04:00
Jon TURNEY	81de0431d6	Fix building glsl when using automake-1.12 after `68e04cc6` Commit `68e04cc6` was tested using automake-1.11. Unfortunately, automake-1.12 made a "slightly backward-incompatible change" in the use of yacc with C++, and for a .yy file, the generated header file is now named .hh, not .h To work with both, write our own rule for running yacc, which generates a header file named .h, rather than using automake's rule. Also, remove things from BUILD_SOURCES which don't need to be there Also, update EXCLUDE rules in doxygen/glsl.doxy, for change of generated files from .cpp -> .cc, and glsl_lexer.h has never existed. Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk>	2012-07-15 15:27:26 +01:00
Marek Olšák	bc6bff7947	r600g: compute needed CS space for vertex buffers correctly	2012-07-15 15:26:14 +02:00
Marek Olšák	15ca9d159e	r600g: don't check the R600_GLSL130 env var GLSL 1.3 has been enabled by default for quite a while.	2012-07-15 02:16:46 +02:00
Jerome Glisse	e634651024	r600g: fix DB decompression on evergreen Separated out of the hyperz patch by Marek with minor modifications. Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-15 02:06:44 +02:00
Tom Stellard	c2f444c54d	r600g: Emit vertex buffers using the same method as constant buffers Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-15 02:00:27 +02:00
Tom Stellard	9b76ee70b2	r600g: Unify 3D and compute vertex buffer emission Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-15 02:00:21 +02:00
Marek Olšák	0b4c5dbb8c	r600g: fix grammar constant_buffer -> constant_buffers	2012-07-15 01:41:11 +02:00
Andreas Boll	e3ff4d4c10	radeon/llvm: Fix CR/LF in AMDILSIDevice.h	2012-07-13 16:35:22 +00:00
Tom Stellard	cc3907856e	radeon/llvm: Clean up AMDILIntrinsicInfo.cpp	2012-07-13 16:29:46 +00:00
Tom Stellard	f323c6260d	radeon/llvm: Coding style fixes	2012-07-13 16:29:46 +00:00
Jon TURNEY	39d82a1b20	Fix linking gallium drivers and with dricore after `defadf2b1` Commit `defadf2b1` erroneously tries to make gallium drivers link with libdricore as a static library, not a shared library Also, change uses of DRI_LIB_DEPS in gallium driver Makefiles to GALLIUM_DRI_LIB_DEPS, so the libraries added are used in the linking the gallium driver Also, fix the path to the libdricore.so symlink, it's made in LIB_DIR, not in the libdricore directory Also repair quoting of dricore settings of DRI_LIB_DEPS and GALLIUM_DRI_LIB_DEPS variables so VERSION is interpolated in configure but TOP and LIB_DIR are interpolated later (where they are known, but VERSION isn't) Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-13 17:20:39 +01:00
Christoph Bumiller	9ed65301e0	nouveau: implement missing timer query functionality	2012-07-13 17:28:00 +02:00
Kristian Høgsberg	426a23af14	wayland: Stop trying to use make rules from aclocal, just copy and paste Defeated by autotool, copy and paste to the rescue. https://bugs.freedesktop.org/show_bug.cgi?id=51997 https://bugs.freedesktop.org/show_bug.cgi?id=51531 Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-13 11:20:17 -04:00
José Fonseca	b3ba0a7afa	mesa/st: Generates TGSI that always recognizes INSTANCEID/VERTEXID as integers. Tested by running piglit draw-instanced, and by forcing llvmpipe advertise no native integer support, which now produces: VERT DCL IN[0] DCL SV[0], INSTANCEID DCL OUT[0], POSITION DCL OUT[1], COLOR DCL CONST[0..19] DCL TEMP[0], LOCAL DCL TEMP[1], LOCAL DCL TEMP[2], LOCAL DCL ADDR[0] 0: U2F TEMP[0].x, SV[0] 1: ARL ADDR[0].x, TEMP[0].xxxx 2: MOV TEMP[1].xy, CONST[ADDR[0].x+8].xyxx 3: ADD TEMP[2].x, IN[0].xxxx, TEMP[1].xxxx 4: ADD TEMP[1].x, IN[0].yyyy, TEMP[1].yyyy 5: MUL TEMP[2], CONST[16], TEMP[2].xxxx 6: MAD TEMP[2], CONST[17], TEMP[1].xxxx, TEMP[2] 7: MAD TEMP[2], CONST[18], IN[0].zzzz, TEMP[2] 8: MAD TEMP[2], CONST[19], IN[0].wwww, TEMP[2] 9: ARL ADDR[0].x, TEMP[0].xxxx 10: MOV TEMP[1], CONST[ADDR[0].x] 11: MOV OUT[0], TEMP[2] 12: MOV OUT[1], TEMP[1] 13: END	2012-07-13 13:01:52 +01:00
José Fonseca	6dddd18480	draw,gallivm: Fix draw_get_shader_param. - Use LLVM limits when LLVM is being used, instead of TGSI limits - Provide draw_get_shader_param_no_llvm for when llvm is never used (softpipe) - Eliminate several of the hacks around draw shader caps in several drivers Unfortunately the hack for PIPE_MAX_VERTEX_SAMPLERS is still necessary. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-13 13:01:51 +01:00
Jon TURNEY	99728076ec	Don't explicitly link libOsmesa with libmesa's dependency libglsl The libmesa convenience library is linked with the libglsl convenience library. libOsmesa is linked with libmesa, and also directly with libglsl. When using libtool, this gives rise to duplicate symbol errors. Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:44 +01:00
Jon TURNEY	b2a37e242e	automake: convert libglapi * "configure substitutions are not allowed in _SOURCES variables" in automake, so remove the AC_SUBST'ed GLAPI_ASM_SOURCES and instead use some AM_CONDITIONALS to choose which asm sources are used * Change GLAPI_LIB to point to the .la file in other Makefile.am files, and make a link to the .a file for the convenience of other Makefiles which have not yet been converted to automake v2: - Use AM_CPPFLAGS for cleaner build output - EXTRA_SOURCES is not needed - Remove libglapi.a compatibility link on clean Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:07 +01:00
Jon TURNEY	1e48dfeee6	Rename X86-64_API -> X86_64_API automake doesn't allow hyphens in variable names Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:05 +01:00
Jon TURNEY	defadf2b15	Link dri drivers with mesa or dricore libtool library Now mesa/drivers/dri is converted to automake, we want to update DRI_LIB_DEPS so that we link with the libmesa or libdricore libtool library, as appropriate. However, this is complicated by the fact that gallium/targets is not (yet) converted, so we can't share the DRI_LIB_DEPS autoconf variable with that anymore. Add an additional autoconf variable GALLIUM_DRI_LIB_DEPS, which is now used in gallium/targets/Makefile.dri, to link with the libdircore or libmesa native library. v2: libdricore$VERSION.a needs to be libdricore$(VERSION).a Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:03 +01:00
Jon TURNEY	cf362d00b9	Remove unused MESA_MODULES autoconf variable Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:44:01 +01:00
Jon TURNEY	a112ca5d5f	automake: convert libmesa and libmesagallium * "configure substitutions are not allowed in _SOURCES variables" in automake, so instead of MESA_ASM_FILES, use some AM_CONDITIONALS to choose which architecture's asm sources are used in libmesa_la_SOURCES. (Can't remove MESA_ASM_FILES autoconf variable as it's still used in sources.mak) * Update to link with the .la file in other Makefile.am files, and make a link to the .a file for the convenience of other Makefiles which have not yet been converted to automake v2: Remove stray -static from LDFLAGS v3: Remove .a compatibility link on clean Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:58 +01:00
Jon TURNEY	8676890018	Rename sparc/clip.S -> sparc/sparc_clip.S Automake can't handle having both clip.S and clip.c, even though they have different paths "src/mesa/Makefile.am: object `clip.lo' created by `$(SRCDIR)/sparc/clip.S' and `$(SRCDIR)/main/clip.c'" Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:56 +01:00
Jon TURNEY	68e04cc601	automake: convert libglsl v2: Use AM_V_GEN to silence generated code rules. Add BUILT_SOURCES to CLEANFILES v3: - Fix an accidental // in a path - Use automake make rules for lex/yacc rather than writing our own - Update .gitignore appropriately - Build a libglcpp convenience library rather than awkwardly including the files in libglsl and delegating the generation - Remove libglsl.a compatibility link on clean v4: - Automake's rules for lex/yacc make .cc if source is .ll or .yy, and apparently we must use those extensions "because of scons", so update everywhere glsl_parser.cpp -> glsl_parser.cc and glsl_lexer.cpp -> glsl_lexer.cc. This fixes 'make tarballs' and building with dricore enabled. Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:41 +01:00
Laurent Carlier	284325d97b	automake: convert libOSmesa This also currently fix the installation of libOSmesa. v2: Remove old Makefile, libOSmesa is now versioned, fix typos v3: Keep config substitution alphabetized v4: Update .gitignore v5: Libraries will be in the builddir, not the srcdir. Reviewed-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Matt Turner <mattst88@gmail.com>	2012-07-13 12:43:39 +01:00
Marek Olšák	1a06e8454e	mesa,st/mesa: implement GL_RGB565 from ARB_ES2_compatibility This was not implemented, because the spec was changed just recently. Everything has been in place already. Gallium has PIPE_FORMAT_B5G6R5_UNORM, while Mesa has MESA_FORMAT_RGB565. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-13 01:36:07 +02:00
Kenneth Graunke	fe911c1d43	i965: Move loop over texture units into brw_populate_sampler_prog_key. The whole reason I avoided this was because it might operate on a brw_vertex_program or a brw_fragment_program. However, that isn't a problem: all we need is the gl_program base type. This avoids awkwardly passing the loop counter 'i' as a parameter, simplifies both callers, and also plumbs prog in place for future use. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-12 14:17:44 -07:00
Kenneth Graunke	86e401b771	i965: Always emit alpha when nr_color_buffers == 0. If alpha-testing is enabled, we need to send alpha down the pipeline even if nr_color_buffers == 0. However, tracking whether alpha-testing is enabled in the WM program key is expensive: it causes us to compile multiple specializations of the same shader, using program cache space. This patch removes the check for alpha-testing, and simply emits alpha whenever nr_color_buffers == 0. We believe this will also be necessary for alpha-to-coverage, and it should add minimal overhead to an uncommon case. Saving the recompiles should more than make up the difference. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-12 13:35:46 -07:00
Kenneth Graunke	16060531ba	i965: Use the blitter in intel_bufferobj_subdata for busy BOs on Gen6+. Previously we only did this pre-Gen6, and used pwrite on Gen6+. In one workload, this cuts significant amount of overhead. v2: Simplify the function based on Eric's suggestions. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-12 13:35:46 -07:00
José Fonseca	978807ef01	gallivm: Use %.9g to print floats. So that we can see them in their full denormalized glory. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-12 21:14:35 +01:00
José Fonseca	5b8d80a783	scons: Remove -ffast-math. We rely on proper IEEE 754 behavior in too many places for this. See also commit `2fdbbeca43` with equivalent change for autoconf. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-12 21:14:29 +01:00
José Fonseca	bd3aab8d79	scons: Also require recent XCB. And don't trip when it's not found -- simply skip building src/glx.	2012-07-12 21:13:10 +01:00
Eric Anholt	6882381a2e	mesa: Require current libxcb. Without that, people with buggy apps that looked at just the server string for GLX_ARB_create_context would call this function that just threw an error when you tried to make a context. Google shows plenty of complaints about this. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 12:29:12 -07:00
Tom Stellard	f92873be2c	radeon/llvm: Don't use lp_build_swizzle_aos() for swizzles This function assumes that lp_build_context::type is a vector type, which is not true for r600 or radeonsi. This fixes an assertion failure using glamor 2D accel.	2012-07-12 13:53:22 -04:00
Tom Stellard	185fc9a5ef	radeonsi: Dump TGSI code prior to doing TGSI->LLVM conversion. This way if the conversion fails, we know what the TGSI shader looks like.	2012-07-12 13:53:22 -04:00
Kenneth Graunke	b546aebae9	i965: Delete previous workaround for textureGrad with shadow samplers. It had many problems: - The shadow comparison was done post-filtering. - It required state-dependent recompiles whenever the comparison function changed. - It didn't even work: many cases hit assertion failures. - I never implemented it for the VS. The new lowering pass which converts textureGrad to textureLod by computing the LOD value works much better. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:26 -07:00
Kenneth Graunke	b0c8d3be73	i965: Add a lowering pass to convert TXD to TXL by computing the LOD. Intel hardware doesn't natively support textureGrad with shadow comparisons. So we need to generate code to handle it somehow. Based on the equations of page 205 of the OpenGL 3.0 specification, it's possible to compute the LOD value that would be selected given the gradient values. Then, we can simply convert the TXD to a TXL. Currently, this passes 34/46 of oglconform's shadow-grad subtests; four cubemap tests are regressed. We should investigate this in the future. v2: Apply abs() to the scalar case (thanks to Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:23 -07:00
Kenneth Graunke	d9da350a83	glsl/ir_builder: Add a new swizzle_for_size() function. This swizzles away unwanted components, while preserving the order of the ones that remain. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:20 -07:00
Kenneth Graunke	0bb3d4ba54	glsl/ir_builder: Add a generic constructor for unary expressions. I needed to compute logs and square roots in a patch I was working on, and wanted to use the convenient interface. We already have a similar constructor for binops; adding one for unops seems reasonable. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:20:18 -07:00
Kenneth Graunke	b656df990f	glsl: Initialize coordinate to NULL in ir_texture constructor. I ran into this while trying to create a TXS query, which doesn't have a coordinate. Since it didn't get initialized to NULL, a bunch of visitors tried to access it and crashed. Most of the time, this won't be a problem, but it's just a good idea. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-12 10:19:38 -07:00
José Fonseca	d9a8cd76e5	st/xorg: Fix build failure due to symbol clash.	2012-07-12 16:02:49 +01:00
Marek Olšák	0f3659bb56	docs: update relnotes-8.1 and GL3 status	2012-07-12 13:05:59 +02:00
Marek Olšák	63d8c8baa9	st/mesa: expose new transform feedback extensions	2012-07-12 13:05:59 +02:00
Marek Olšák	d24ece97e5	mesa: add ARB_transform_feedback_instanced extension enable flag Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	db7404defd	mesa: implement new DrawTransformFeedback functions Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	7e0cb473b0	mesa: implement display list support for new DrawTransformFeedback functions Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	ce16ca4635	mesa: implement display list support for indexed query functions Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:59 +02:00
Marek Olšák	553e13dbc2	mesa: implement indexed query functions from ARB_transform_feedback3 Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	375e73d859	mesa: implement glGet queries and error handling for ARB_transform_feedback3 Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	21cb5ed20d	glsl: implement ARB_transform_feedback3 in the linker Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	9576d555e0	glapi: add ARB_transform_feedback_instanced Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	6d13d91f4e	glapi: add ARB_transform_feedback3 Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-12 13:05:58 +02:00
Marek Olšák	e773a48a3b	r600g: fix uploading non-zero mipmap levels of depth textures This fixes piglit/depth-level-clamp. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	fe1fd67556	r600g: don't flush depth textures set as colorbuffers The only case a depth buffer can be set as a color buffer is when flushing. That wasn't always the case, but now this code isn't required anymore. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	6842d5fced	r600g: don't set dirty_db_mask for a flushed depth texture A flush depth texture is never set as a depth buffer and never flushed. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	5a17d8318e	r600g: flush depth textures bound to vertex shaders This was missing/broken. There are also minor code cleanups. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:31 +02:00
Marek Olšák	dee58f94af	r600g: do fine-grained depth texture flushing - maintain a mask of which mipmap levels are dirty (instead of one big flag) - only flush what was requested at a given point and not the whole resource (most often only one level and one layer has to be flushed) Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	df79eb5956	r600g: remove is_flush from DSA state we can just update the state when decompressing, there's no need to add additional info into the DSA state Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	43e3f19c76	r600g: set DISABLE in CB_COLOR_CONTROL if colormask is 0 this will be useful for in-place DB decompression, otherwise should be harmless Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	4fe74412cf	r600g: move CB_SHADER_MASK setup into cb_misc_state Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	a1a1ff5ec0	r600g: move MULTIWRITE setup into cb_misc_state for r6xx-r7xx Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	0ea76916e6	r600g: move CB_TARGET_MASK setup into new cb_misc_state to remove some overhead from draw_vbo. This is a derived state. BTW, I've got no idea how compute interacts with 3D here, but it should use cb_misc_state, so that 3D and compute don't conflict. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	5ba15d8d38	st/mesa: implement accelerated stencil blitting using shader stencil export Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	a7f3697eb8	st/mesa: set colormask to zero when blitting depth Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	5a74e17ab0	gallium/u_blit: remove useless memset calls the structure is calloc'd. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	24e0a26335	gallium/u_blit: drop not-very-useful wrapper around util_blit_pixels_writemask just rename it to util_blit_pixels Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	3f13b5da15	gallium/u_blit: don't do two copies for non-2D textures Because u_blit couldn't sample a 1D, 3D, CUBE and ARRAY texture, we created a 2D texture holding a copy of one slice of the source texture (even for 1D). Let's just do it right. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	2dca61bcb3	gallium/util: move pipe_tex_to_tgsi_tex helper function into u_inlines Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	bdaf0a085b	gallium/u_blitter: accelerate stencil-only copying This doesn't seem to be used by anything yet, but better safe than sorry. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	12fd81f9e7	gallium/u_blitter: accelerate depth-stencil copying using shader stencil export This fixes stencil buffer write transfers on r600g. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	76db2c121c	gallium: add util_format_stencil_only helper function used for stencil sampler views. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	a730838a42	gallium/u_blitter: minify depth0 when initializing last_layer Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	91cf9fe988	gallium/u_gen_mipmap: accelerate depth texture mipmap generation Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Marek Olšák	13b0af721a	mesa: remove assertions that do not allow compressed 2D_ARRAY textures NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-07-12 02:08:30 +02:00
Paul Berry	33202b4876	i965/msaa: Enable CMS layout on Gen7 for the formats that support it. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:50 -07:00
Paul Berry	4ebbc76621	i965/msaa: Add CMS support to blorp. This patch updates the blorp engine to properly handle the case where the surface being textured from uses Gen7's CMS MSAA layout. The following changes were necessary: - Before reading color values from the surface, we need to read from the MCS buffer using the ld_mcs sampler message. This is done by the mcs_fetch() function, and the result is stored in the mcs_data register. This only needs to be done once per pixel, since the MCS value is shared between all samples belonging to a pixel. - When reading color values from the surface, we need to use the ld2dms sampler message instead of the ld2dss message, and we need to provide the value read from the MCS buffer as an argument. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	754953693d	i965/msaa: Add CMS-related sampler messages to brw_defines.h. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	7b3263af69	i965/msaa: Set SURFACE_STATE properly when CMS MSAA is in use. When a buffer using Gen7's CMS MSAA layout is bound to a texture or a render target, the SURFACE_STATE structure needs to point to the MCS buffer and to indicate its pitch. This patch updates the functions that emit SURFACE_STATE to handle CMS layout properly. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	0ba813506d	i965/msaa: Add CMS MSAA settings to brw_structs.h. Previously the DWORD used to control the CMS MSAA layout was just a pad value, because we didn't use it. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	ccae1b1cd7	i965/msaa: Allocate MCS buffer when CMS MSAA is in use. To implement Gen7's CMS MSAA layout, we need an extra buffer, the MCS (Multisample Control Surface) buffer. This patch introduces code for allocating and deallocating the buffer, and storing a pointer to it in the intel_mipmap_tree struct. No functional change, since the CMS layout is not enabled yet. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	1bd4d456cd	i965/msaa: Add an enum to describe MSAA layout. From the Ivy Bridge PRM, Vol 1 Part 1, p112: There are three types of multisampled surface layouts designated as follows: - IMS Interleaved Multisampled Surface - CMS Compressed Mulitsampled Surface - UMS Uncompressed Multisampled Surface Previously, the i965 driver only used IMS and UMS formats, and distinguished beetween them using the boolean intel_mipmap_tree::msaa_is_interleaved. To facilitate adding support for the CMS format, this patch replaces that boolean (and other booleans derived from it) with an enum INTEL_MSAA_LAYOUT_{IMS,CMS,UMS}. It also updates the terminology used in comments throughout the driver to match the IMS/CMS/UMS terminology used in the PRM. CMS layout is not yet used. The enum has a fourth possible value, INTEL_MSAA_LAYOUT_NONE, which is used for non-multisampled surfaces. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Paul Berry	67b0f7c7dd	i965/msaa: Move {rt,tex}_interleaved into blorp program key. On Gen6, MSAA buffers always use an interleaved layout and non-MSAA buffers always use a non-interleaved layout, so it is not strictly necessary to keep track of the layout of the texture and render target surfaces in the blorp program key. However, it is cleaner to do so, since (a) it makes the blorp compiler less dependent on implicit knowledge about how the GPU pipeline is configured, and (b) it paves the way for implementing compressed multisampled surfaces in Gen7. This patch won't cause any redundant compiles, because the layout of the texture and render target surfaces depends on other parameters that are already in the blorp program key. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-11 15:14:49 -07:00
Kristian Høgsberg	2adfce4a18	mapi: Move GL_NV_draw_buffers extension to es_EXT.xml We don't generate public entrypoints for GLES extensions, so move the GL_NV_draw_buffers definition from ARB_draw_buffers.xml to es_EXT.xml. When the extension is defined in ARB_draw_buffers.xml, we end up with a public entry point for it, but no prototype, which gives an error when compiled with --disable-asm and --disable-shared-glapi. Instead, just move the GLES extension to es_EXT.xml so this doesn't happen. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:36 -04:00
Kristian Høgsberg	e6a33570b7	egl: Add EGL_WAYLAND_PLANE_WL attribute This lets us specify the plane to create the image for for multiplanar wl_buffers. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:36 -04:00
Kristian Høgsberg	1aaec8c609	wayland-drm: Add protocol to create planar buffers	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	379eb47ea6	wayland-drm: Pass struct wl_drm_buffer to the driver We're going to extend this to support multi-plane buffers, so pass this to the driver so it can access the details.	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	95bc0527e9	intel: Implement __DRIimage::createSubImage and bump supported version to 5 We use the new miptree offset to pick out the sub-image when we bind the EGLImage to a texture. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	02ebad900d	intel: Add offset field to miptree This lets us specify an offset into the bo where the miptree starts, which will let us set up a texture for a single plane in a planar buffer. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-11 15:28:35 -04:00
Kristian Høgsberg	44a2b57f93	intel: Add support for new __DRIimage formats	2012-07-11 15:28:34 -04:00
Kristian Høgsberg	c029834808	__DRIimage: version 5, add new formats and createSubImage The additions in version 5 enables creating EGLImages for different planes of a YUV buffer. createImageFromName is still used to create the containing __DRIimage, and createSubImage can then be used no that __DRIimage to create __DRIimages that correspond to the y, u, and v planes (__DRI_IMAGE_FORMAT_R8) or the uv planes (__DRI_IMAGE_FORMAT_RG88) for formats such as NV12 where the u and v components are interleaved. Packed formats such as YUYV etc doesn't require any special treatment, we just sample those as a regular ARGB texture. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-11 15:28:34 -04:00
Tom Stellard	c0f7fe7b79	r600g/compute: Disable growing the memory pool The code for growing the memory pool (which is used for storing all of the global buffers) wasn't working. There seem to be two separate issues with the memory pool code. The first was the way it was growing the pool. When the memory pool needed more space, it would: 1. Copy the data from the memory pool's backing texture to system memory. 2. Delete the memory pool's texture 3. Create a bigger backing texture for the memory pool. 4. Copy the data from system memory into the bigger texture. The copy operations didn't seem to be working, and I suspect that since they were using fragment shaders to do the copy, that there might have been a problem with the mixing of compute and 3D state. The other issue is that the size of 1D textures is limited, and I was having trouble getting 2D textures to work. I think these problems will be easier to solve once more code is shared between 3D and compute, which is why I decided to disable it for now rather than continue searching for a fix.	2012-07-11 17:53:54 +00:00
Tom Stellard	49ae102ee3	radeon/llvm: Use multiclasses for floating point loads The original strategy for handling floating point loads, which was to lower (f32 load) to (f32 bitcast (i32 load)) wasn't really working. The main problem was that the DAG legalizer couldn't handle replacing a node with two results (load) with a node with only one result (bitcast).	2012-07-11 17:47:20 +00:00
Tom Stellard	bbdf3af857	radeon/llvm: Don't set the IMM bit in SMRD instruction definitions. The IMM bit is already being set in SICodeEmitter.	2012-07-11 17:47:20 +00:00
Tom Stellard	d36499aa62	r600g/compute: Add more debugging output	2012-07-11 17:46:59 +00:00
Eric Anholt	f9b3e257d1	i965: Revert the VBOs-in-system-memory hack. It didn't change performance on Lightsmark or Nexuiz, which both used DYNAMIC_DRAW buffers, but it was killing performance (40% CPU wasted pwriting buffers) on a closed-source app we're looking at. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-11 09:20:21 -07:00
Eric Anholt	b5c037f6b1	Add emacs setup for the docs/devinfo.html comment wrapping recommendation. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-11 09:20:21 -07:00
Ian Romanick	a8724d85f8	glx/dri2: Add support for GLX_ARB_create_context_robustness Add the infrastructure required for this extension. There is no xserver support and no driver support yet. Drivers can enable this be advertising DRI2 version 4 and accepting the __DRI_CTX_FLAG_ROBUST_BUFFER_ACCESS flag and the __DRI_CTX_ATTRIB_RESET_STRATEGY attribute in create context. Some additional Mesa infrastructure is needed before drivers can do this. The GL_ARB_robustness spec, which all Mesa drivers already advertise, requires: "If the behavior is LOSE_CONTEXT_ON_RESET_ARB, a graphics reset will result in the loss of all context state, requiring the recreation of all associated objects." It is necessary to land this infrastructure now so that the related infrastructure can land in the xserver. The xserver has very long release schedules, and the remaining Mesa parts should land long, long before the next xserver merge window opens. v2: Expose robustness as a DRI2 extension rather than bumping __DRI_DRI2_VERSION. v3: Add a comment explaining why dri2->base.version >= 3 is also required for GLX_ARB_create_context_robustness. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-11 08:54:50 -07:00
Ian Romanick	de9ed51525	dri2: Hard-code the DRI2 version This allows revising the dri_interface.h separately from adding driver support. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-11 08:54:50 -07:00
Ian Romanick	2879f758b5	glapi: Apply Xorg indent rules to all files generated for the xserver Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-11 08:54:50 -07:00
Kenneth Graunke	a0698b000b	docs: Update GL3.txt. We neglected to list the deprecation model/forward compatible context support. inverse() has been done for a while. None of us know what "highp change" means; GLSL 1.30 already added the ability to recognize precision keywords, and it doesn't look like 1.40 has any new requirements there (precision keywords still have no meaning). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-10 16:53:49 -07:00
Chad Versace	551078bb62	mesa: Remove unneeded extern qualifiers Remove 'extern' from the functions declared in texcompress_etc.h. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-10 16:51:19 -07:00
Vadim Girlin	3770847960	r600g: improve flushed depth texture handling v2 Use r600_resource_texture::flished_depth_texture for GPU access, and allocate it in the VRAM. For transfers we'll allocate texture in the GTT and store it in the r600_transfer::staging. Improves performance when flushed depth texture is frequently used by the GPU, e.g. in Lightsmark (~30%) Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-07-11 02:39:59 +04:00
Kenneth Graunke	860d5bdf98	i965: Add hardware context support. With fixes and updates from Ben Widawsky and comments from Paul Berry. v2: Use drm_intel_gem_context_destroy to destroy hardware context; remove useless initialization of hw_ctx, both suggested by Eric. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Acked-by: Paul Berry <stereotype441@gmail.com>	2012-07-10 15:09:58 -07:00
Ian Romanick	4fae5e32d5	mesa/test: Update name of GL_TIME_ELAPSED `4952caa` caused the _EXT to fall off the name of this enum. This is fine. Update the unit test to expect the new value. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=51956	2012-07-10 14:46:25 -07:00
Andreas Boll	40742fa686	docs/relnotes-8.0.4: fix html markup	2012-07-10 12:59:34 -07:00
Marek Olšák	67a8ee891b	gallium/docs: document interface changes for timestamp query the query type is already documented	2012-07-10 19:04:13 +02:00
Marek Olšák	a3fccafda9	identity: implement get_timestamp	2012-07-10 19:04:13 +02:00
Marek Olšák	e66d90ec6b	noop: implement get_timestamp	2012-07-10 19:04:13 +02:00
Marek Olšák	642539e3f9	trace: implement get_timestamp	2012-07-10 19:04:12 +02:00
Marek Olšák	a471d268ec	galahad: implement get_timestamp	2012-07-10 19:04:12 +02:00
Marek Olšák	768589e836	docs: update relnotes-8.1 and GL3 status	2012-07-10 19:04:12 +02:00
Marek Olšák	5ddcda060c	softpipe: implement get_timestamp and expose ARB_timer_query PIPE_QUERY_TIMESTAMP is already implemented and working.	2012-07-10 19:04:12 +02:00
Marek Olšák	21f78d2189	st/mesa: implement ARB_timer_query	2012-07-10 19:04:12 +02:00
Marek Olšák	bcc735aaca	gallium: add QUERY_TIMESTAMP cap and get_timestamp screen function	2012-07-10 19:04:12 +02:00
Marek Olšák	d5a7866902	mesa: implement glGet(GL_TIMESTAMP) v2 This is adds a new driver function to retrieve the timestamp. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-10 19:04:12 +02:00
Marek Olšák	5094533040	mesa: add ARB_timer_query to the extension list Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Marek Olšák	204777c5dc	mesa: add QueryCounter display list support Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Marek Olšák	f601dcdf70	mesa: implement TIMESTAMP query and glQueryCounter Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Marek Olšák	4952caad2d	glapi: add ARB_timer_query Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-10 19:04:12 +02:00
Ian Romanick	25fec2e9ca	docs: Add 8.0.4 release notes Also add news story. Extra, extra! Read all about it! Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-10 09:05:39 -07:00
Eric Anholt	2d03f48a65	glsl: Add parsing for GLSL uniform blocks. This doesn't do anything with the uniform block declarations yet, so usage of those uniforms finds them to be undeclared. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:13:33 -07:00
Eric Anholt	912a429bc5	glsl: Don't hide the type of struct_declaration_list. I've been trying to derive from this for UBO support, and the slightly obfuscated types were putting me over the edge. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:12:18 -07:00
Kenneth Graunke	532e99cbf2	glcpp: Add built-in #define for GL_ARB_uniform_buffer_object. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-09 11:11:59 -07:00
Vincent Lejeune	7fabb2b593	glsl: Parser handles "#extension GL_ARB_uniform_buffer_object" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:11:38 -07:00
Eric Anholt	f4fb6bf088	glsl: Reduce a bit of extra code in the merging of layout qualifiers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:05:33 -07:00
Eric Anholt	60a784d56e	glsl: Take advantage of the layout qualifier flags union to clean up parsing. The got_one variable was set iff one of the bits in flags.i was set. v2: Fix incorrect dropping of the ARB_conservative_depth warning. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-07-09 11:04:45 -07:00
Tom Stellard	9b00edc79a	r600g: Don't create a texture for the memory_pool during screen init This fixes a segfault in r600_screen_create() introduced by `eb065f5d9d` Reported by tilman on irc.	2012-07-09 12:14:07 -04:00
Tom Stellard	76b44034b9	radeon/llvm: Rename namespace from AMDIL to AMDGPU	2012-07-09 13:43:11 +00:00
Tom Stellard	39323e8f79	r600g: Update number of gprs when adding a vertex instruction	2012-07-09 13:42:24 +00:00
Tom Stellard	da9c8a73ec	r600g/compute: Use evergreen_cb() for binding RATs	2012-07-09 13:41:18 +00:00
Tom Stellard	960906d16b	r600g: Add support for RATs in evergreen_cb()	2012-07-09 13:41:18 +00:00
Tom Stellard	eb065f5d9d	r600g: Use a texture as the underlying resource for compute_memory_pool This the first step towards being able to use evergreen_cb to bind RATs.	2012-07-09 13:41:18 +00:00
Tom Stellard	9d36441374	r600g: Add is_rat flag to r600_resource_texture	2012-07-09 13:41:18 +00:00
Tom Stellard	3d3194e93c	r600g: Add r600_context_pipe_state_emit() This function is used when dispatching compute shader in order to avoid mixing compute and 3D registers in the context's dirty list. This allows the compute code to resuse 3D functions like evergreen_cb, which return a struct r600_pipe_state and still have control over when and how the register writes are emitted.	2012-07-09 13:41:17 +00:00
Tom Stellard	e00e1586dd	r600g: Add pkt_flag parameter to r600_context_block_emit_dirty() This allows the shader type bit to be set in the pm4 header when emitting registers for compute shaders. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-09 13:41:17 +00:00
Tom Stellard	25145de03e	r600g/compute: Move LOOP_CONST initialization to start_compute_cs atom	2012-07-09 13:41:17 +00:00
Tom Stellard	5016fe2d47	r600g: Add start_compute_cs atom to struct r600_context The start_compute_cs atom initializes some config and context registers to the values needed for running compute shaders. When a compute shader is dispatched, this atom is emitted after the start_cs_cmd atom, which initializes registers that are common to both 3D and compute. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-09 13:41:17 +00:00
Tom Stellard	38be0966c7	r600g: Add pkt_flag member to struct r600_command_buffer Some packets require the shader type bit (bit 1) to be set when used for compute shaders. The pkt_flag will be initialized to RADEON_CP_PACKET3_COMPUTE_MODE for any struct r600_command_buffer used for dispatching compute shaders and it will be or'd against the result of the PKT3 macro when adding a new packet to a struct r600_command buffer. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-07-09 13:41:17 +00:00
Tom Stellard	7d0c17fe74	r600g: Only emit start_cs_cmd atom once for compute command streams	2012-07-09 13:41:17 +00:00
Marek Olšák	0a21b561c7	r600g: fix stencil texturing with Z32_FLOAT_S8X24_UINT	2012-07-09 13:58:00 +02:00
Marek Olšák	a460df9299	r600g: add assertions after translate_colorswap/colorformat/dbformat/texformat	2012-07-09 13:57:59 +02:00
Marek Olšák	c1e8c845ea	r600g: inline r600_hw_copy_region	2012-07-09 13:57:59 +02:00
Marek Olšák	9974e9ac5d	r600g: enable dual src blending on r7xx No lockups here.	2012-07-09 13:57:59 +02:00
Marek Olšák	6657a7af61	r600g: use depth format from pipe_surface, not pipe_resource	2012-07-09 13:57:59 +02:00
Marek Olšák	b278aba423	r600g: use u_box_origin_2d helper function	2012-07-09 13:57:59 +02:00
Marek Olšák	1f50f463eb	gallium/u_blitter: consolidate some state changes	2012-07-09 13:57:59 +02:00
Marek Olšák	22d032707e	r600g: remove stray semicolon	2012-07-07 15:09:57 +02:00
Marek Olšák	461e9f99c7	docs: document ARB_blend_func_extended and EXT_texture_rg in relnotes-8.1 also sort the extensions	2012-07-07 15:09:57 +02:00
Eric Anholt	1e28f55ab7	i965/fs: Invalidate live intervals after copy propagation. For copy propgation, we've dropped the use of a GRF in favor of a (probably later) use of a different GRF. This definitely requires invalidating intervals. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:33 -07:00
Eric Anholt	2343fe9a5d	i965/fs: Invalidate live intervals in passes that remove an instruction. Since live intervals are based on ip, removing an instruction trashes the intervals unless we were to go do some surgery. These happen to usually remove a use of a grf, so it's time to recalculate, anyway. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> NOTE: This is a candidate for the 8.0 release branch.	2012-07-06 14:20:33 -07:00
Eric Anholt	25ca9cc823	i965/vs: Move the other two src_reg/dst_reg constructors to brw_vec4.cpp. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:33 -07:00
Eric Anholt	b2f5d4c3ec	i965/vs: Move class functions to brw_vec4.cpp. This has less impact than for the FS (4k savings), because it was partially done already, but makes things more consistent. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:32 -07:00
Eric Anholt	fe27916ddf	i965/fs: Move class functions from the header to .cpp files. Cuts compile time for brw_fs.h changes from 2.7s to .7s and reduces i965_dri.so size by 70k. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-06 14:20:32 -07:00
José Fonseca	8b1f1900d1	galahad: Check that texture format is supported.	2012-07-06 20:38:41 +01:00
José Fonseca	ff8ddf399a	galahad: More detailed resource checks.	2012-07-06 20:22:29 +01:00
José Fonseca	f8e13e6d69	galahad: Fix zealous warnings.	2012-07-06 20:12:56 +01:00
José Fonseca	7bd926af89	galahad: Enumerate all methods that are missing.	2012-07-06 19:13:44 +01:00
José Fonseca	3d2550be9c	galahad: Implement render_condition.	2012-07-06 18:45:14 +01:00
José Fonseca	5b45775e41	galahad: Don't implement context methods that are not implemented by the underlying pipe driver.	2012-07-06 18:38:51 +01:00
José Fonseca	3cb994afca	galahad: Use debug_printf. stderr is not visible on windows.	2012-07-06 18:38:39 +01:00
José Fonseca	1abb070633	galahad: Silence creation messages. Let galahad warnings be true warnings.	2012-07-06 18:37:48 +01:00
José Fonseca	d78dee1671	galahad: Use reference counting when destroying the wraped objects. As the wrapped pipe driver may hold internal references.	2012-07-06 18:35:44 +01:00
José Fonseca	fe602da63f	galahad: Point to the galahad objects from the galahad sampler view. And not the wraped driver's objects.	2012-07-06 18:35:32 +01:00
José Fonseca	04d29afb8b	galahad: Don't defer index buffer when it's NULL.	2012-07-06 17:02:39 +01:00
José Fonseca	232073b0d9	target-helpers: Enable debug helpers only on debug builds. Some of these helpers use debug_get_option, which works also on releases.	2012-07-06 15:05:16 +01:00
Marek Olšák	c445b0f76d	st/mesa: only expose ARB_shader_bit_encoding with GLSL 1.3 I don't think it's possible or even useful to use the extension with GLSL 1.2. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-06 00:45:38 +02:00
Kristian Høgsberg	5f5746a692	egl_dri2: Reorganize the EGLImage constructors to share more code We factor out all the EGL book-keeping into dri2_create_image() and simplify the wayland case by using dupImage. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-05 14:22:07 -04:00
Kristian Høgsberg	1bb15c0a08	intel: Share common __DRIimage allocation code We have the same switch and allocation code in two places. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-05 14:22:07 -04:00
Kristian Høgsberg	454fc07dde	intel: Just look up image->internal_format using _mesa_get_format_base_format Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-05 14:22:07 -04:00
Kristian Høgsberg	e408c17767	intel: Remove unused __DRIimage::data_type field Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-05 14:22:06 -04:00
Brian Paul	bbe92dc608	svga: whitespace fixes	2012-07-05 08:07:26 -06:00
Brian Paul	76a6801240	Revert "mesa: #define fprintf to be __mingw_fprintf() on Mingw32" This reverts commit `cbffaf20e9`. Use the PRIx64 macro in the fprintf() call instead, as suggested by Dylan Noblesmith. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:07:26 -06:00
Brian Paul	df2d81ea59	mesa: use the PRIx64 macro for printing 64-bit hexadecimal values We'll revert the #define fprintf __mingw_fprintf change next. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:07:25 -06:00
Brian Paul	1ab37a2284	svga: implement TGSI_OPCODE_ROUND ROUND and TRUNC are implemented with one function to reduce code duplication. Note: ROUND isn't actually used yet, but probably will be soon. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:07:03 -06:00
Brian Paul	d594f72e16	svga: fix CMP translation for vertex shaders Converting CMP to SLT+LRP didn't work when src2 or src3 was Inf/NaN. That's the case for GLSL sqrt(0). sqrt(0) actually happens in many piglit auto-generated tests that use the distance() function. v2: remove debug/devel code, per Jose Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:03:19 -06:00
Brian Paul	30f8575fde	svga: properly implement TRUNC instruction Was previously implemented with FLOOR. Fixes quite a few piglit tests of float->int conversion, integer division, etc. v2: clean up left over debug/devel code, per Jose Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:03:19 -06:00
Brian Paul	0bd3a75de9	svga: fix register collision issue in emit_conditional() If the 'dst' register is the same as the 'pass' register we'll generate invalid code. Use a temporary register in that case. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-07-05 08:03:19 -06:00
Brian Paul	9b3d87b092	svga: emit some debug messages when shader compilation fails	2012-07-05 07:59:20 -06:00
Eric Anholt	33526a2ffe	intel: Fix a comment typo.	2012-07-04 13:59:14 -07:00
Gwenole Beauchesne	69f031cc19	mesa: add GL_EXT_texture_rg extension for OpenGL ES 2.x.	2012-07-04 15:26:22 -04:00
Kristian Høgsberg	3ed8d42853	GLES2: upgrade gl2ext.h to version 18099 Redo this commit, and remove the inclusion of gl2ext.h from src/mapi/glapi/glapi_priv.h. The include was added in `8f3be33985` to fix a missing prototype for glDrawBuffersNV and others, but it's not possible to include both glext.h and gl2ext.h from the same file. I don't see the missing prototype here (with or without shared glapi) so I'm just removing the offending #include. Also, since we're redoing this, update to the most recent gl2ext.2. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2012-07-04 15:26:22 -04:00
Olivier Galibert	e620f3e763	mesa/st: gl_ClipDistance must be interpolated in 3d space. That old bug was hidden but the clipper always interpolating in 3d space no matter what it should have been doing. Now that the interpolation has been fixed, the bug shows up. Fixes fdo 51364. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-07-04 10:47:14 +01:00
Stuart Abercrombie	95ce454c8c	gallium/util: Save and restore vertex buffer state in util_gen_mipmap. Calling glGenerateMipmap could overwrite vertex buffer state, leading to incorrect rendering or crashes depending on the Gallium driver. This was happening on WebGL Conformance test texture-size. Before `784dd51198` this was covered up by redundant vertex buffer validation. Reviewed-by: Stéphane Marchesin <marcheu@chromium.org> Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-07-04 03:48:29 +02:00
Marek Olšák	567fcd2eb9	Revert "GLES2: upgrade gl2ext.h to version 16994." This reverts commit `8818b88748`. I get a lot of errors like this one: In file included from ../../../src/mapi/glapi/glapi_priv.h:49:0, from glapi_dispatch.c:40: ../../../include/GLES2/gl2ext.h:1074:28: error: redefinition of typedef ‘PFNGLRENDERBUFFERSTORAGEMULTISAMPLEEXTPROC’ ../../../include/GL/glext.h:10237:25: note: previous declaration of ‘PFNGLRENDERBUFFERSTORAGEMULTISAMPLEEXTPROC’ was here This with a clean build (with git clean -fdX). I don't get the errors on my other machine. I didn't investigate why, a wild guess is that this depends on the version of gcc.	2012-07-04 01:40:05 +02:00
Marek Olšák	2668aaa557	Revert "mesa: add GL_EXT_texture_rg extension for OpenGL ES 2.x." This reverts commit `d1665388ce`.	2012-07-04 01:39:52 +02:00
Gwenole Beauchesne	d1665388ce	mesa: add GL_EXT_texture_rg extension for OpenGL ES 2.x.	2012-07-03 16:23:38 -04:00
Gwenole Beauchesne	8818b88748	GLES2: upgrade gl2ext.h to version 16994.	2012-07-03 16:23:38 -04:00
Eric Anholt	dd4282e38f	i965/fs: Allow copy propagation on uniforms. This is a big win for savage2, hon and yofrankie. 62 new programs for savage2/hon get 16-wide mode, along with one for humus demos and two for tropics. Even a few shaders from tropics see reductions of 15% or more. total instructions in shared programs: 216536 -> 207353 (-4.24%) instructions in affected programs: 123941 -> 114758 (-7.41%) In benchmarking Tropics, only a .040% +/- 034% performance improvement was observed (n=90). Rather disappointing, but I was primarily motivated to do this patch by a regression in the number of 16-wide shaders compiled after a GRF texturing on IVB patch I'm working on. Hopefully this helps avoid that regression. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-03 12:57:10 -07:00
Eric Anholt	0c4630bae0	i965/fs: Allow copy propagation with source modifiers. This shaves a few instructions off of a ton of programs. For 12 shaders from tropics and sanctuary, it's enough reduction in register pressure to get 16-wide mode. 7 shaders from heroes of newerth and savage2 are hurt by about 1.1%, where copy propagation of negates ends up preventing coalescing, but we could regain that by doing dataflow analysis in our copy propagation. No significant performance difference in tropics (n=11) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-03 12:57:04 -07:00
Eric Anholt	458f7f0141	i965/fs: Move copy propagation test out to a separate function. It's going to get more complicated in a moment. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-03 12:55:47 -07:00
Ian Romanick	5fb178ee43	glx/tests: Fix off-by-one error in allocating extension string buffer NOTE: This is a candidate for the 8.0 release branch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50621 Bugzilla: https://bugs.gentoo.org/show_bug.cgi?id=418161 Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: Markus Oehme <oehme.markus@gmx.de>	2012-07-03 12:28:45 -07:00
Brian Paul	1853f467c6	glsl: fix unop/binop errors in comments	2012-07-03 09:42:59 -06:00
Paul Berry	f34764ea53	msaa: Make meta-ops save and restore state of GL_MULTISAMPLE. The meta-ops _mesa_meta_Clear() and _mesa_meta_glsl_Clear() need to ignore the state of GL_SAMPLE_ALPHA_TO_COVERAGE, GL_SAMPLE_ALPHA_TO_ONE, GL_SAMPLE_COVERAGE, GL_SAMPLE_COVERAGE_VALUE, and GL_SAMPLE_COVERAGE_INVERT when clearing multisampled buffers. The easiest way to accomplish this is to disable GL_MULTISAMPLE during the clear meta-ops. Note: this patch also causes GL_MULTISAMPLE to be disabled during _mesa_meta_GenerateMipmap() and _mesa_meta_GetTexImage() (since those two meta-ops use MESA_META_ALL). Arguably this isn't strictly necessary, since those meta-ops use their own non-MSAA fbo's, but it shouldn't do any harm. Fixes Piglit tests "EXT_framebuffer_multisample/clear {2,4} {color,stencil}" on i965. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-07-02 14:09:27 -07:00
Paul Berry	8313f44409	i965/msaa: Fix centroid interpolation of unlit pixels. From the Ivy Bridge PRM, Vol 2 Part 1 p280-281 (3DSTATE_WM: Barycentric Interpolation Mode): "Errata: When Centroid Barycentric mode is required, HW may produce incorrect interpolation results when a 2X2 pixels have unlit pixels." To work around this problem, after doing centroid interpolation, we replace the centroid-interpolated values for unlit pixels with non-centroid-interpolated values (which are interpolated at pixel centers). This produces correct rendering at the expense of a slight increase in shader execution time. I've conditioned the workaround with a runtime flag (brw->needs_unlit_centroid_workaround) in the hopes that we won't need it in future chip generations. Fixes piglit tests "EXT_framebuffer_multisample/interpolation {2,4} {centroid-deriv,centroid-deriv-disabled}". All MSAA interpolation tests pass now. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-02 13:27:36 -07:00
Paul Berry	3f929efa28	i965/fs: Add FS_OPCODE_MOV_DISPATCH_TO_FLAGS to fragment shader backend. In order to compute centroid varyings correctly, the fragment shader needs to be able to load the current pixel/sample mask into a flag register. This patch adds an opcode to the fragment shader back-end to do this; the opcode gets translated into the instruction mov(1) f0<1>UW g1.14<0,1,0>UW { align1 WE_all } Since this instruction clobbers f0, instruction scheduling has to treat it the same as instructions that have a conditional modifier. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-02 13:27:36 -07:00
Jordan Justen	8aa78c104a	i965: fix transform feedback with primitive restart When querying GL_PRIMITIVES_GENERATED, if primitive restart is also used, then take the software primitive restart path so GL_PRIMITIVES_GENERATED is returned correctly. GL_TRANSFORM_FEEDBACK_PRIMITIVES_WRITTEN is also updated since it will also affected by the same issue. As noted in brw_primitive_restart.c, with further work we should be able to move this situation back to a hardware handled path. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 11:42:48 -07:00
Kenneth Graunke	14311ef3f2	i965: Re-enable rendering to SNORM formats. Commit `d73f6375f5` fixed the cause of the Piglit failure with ARB_color_buffer_float fragment clamp modes. Now that it's fixed, there's no reason to leave snorm format rendering disabled. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 11:23:37 -07:00
Kenneth Graunke	b1802a2115	glsl: Remove unused ir_loop_jump::loop pointer. Commit `0c005bd7` intended to make ir_loop_jump::mode public, but also accidentally added a new pointer to the enclosing loop. Furthermore, it tried to initialize the new field by adding "this->loop = loop;" to the constructor, but since there is no loop parameter, this only initialized the field to itself---so it will likely be a garbage pointer. A lot of code, such as lower_jumps, allocates new loop jumps without setting this field appropriately, so any uses would probably just crash. Thankfully, there were none, so we can just delete the field. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=51574 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-07-02 11:08:59 -07:00
Kenneth Graunke	d73f6375f5	meta: Don't alter fragment color clamp in DrawPixels(). DrawPixels uses the MESA_META_CLAMP_FRAGMENT_COLOR flag to save/restore the fragment color clamp mode. This is unnecessary since it never alters it. It's also harmful: when the clamp mode is GL_FIXED_ONLY, setting this flag causes _mesa_meta_begin to force it to GL_FALSE, breaking clamping on SNORM formats. DrawPixels should use the user-specified clamp mode and not change it. Fixes Piglit's spec/ARB_color_buffer_float/GL_RGBA8_SNORM-drawpixels test on i965/Sandybridge (with SNORM render targets re-enabled). Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 11:08:48 -07:00
Marek Olšák	9f0f2f9512	mesa: use FLUSH_CURRENT and not FLUSH_VERTICES in _mesa_validate_* ASSERT_OUTSIDE_BEGIN_END_AND_FLUSH_WITH_RETVAL calls FLUSH_VERTICES, which is not what we want. This fixes a breakage in classic drivers, introduced in: `62b9716739` vbo: first ASSERT_OUTSIDE_BEGIN_END then FLUSH, not the other way around It should fix: https://bugs.freedesktop.org/show_bug.cgi?id=51629 https://bugs.freedesktop.org/show_bug.cgi?id=51642 Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-02 17:48:36 +02:00
Dylan Noblesmith	876889b355	mesa: point to Makefile.old in the srcdir Gets out-of-tree builds slightly closer to working. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:14:46 +00:00
Dylan Noblesmith	91ecba9d05	mesa: fix parser source gen for out-of-tree builds Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:14:39 +00:00
Dylan Noblesmith	261b1389eb	mesa: fix api source gen for out-of-tree builds Add $(srcdir) where needed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:14:27 +00:00
Dylan Noblesmith	43bca86c1b	glapi/gen: fix out of tree build Add "-f $(srcdir)/gl_API.xml" to the arguments of all the scripts that by default look for gl_API.xml in the working directory when run with no arguments, and prepend $(srcdir) to those scripts that are already using an explicit -f argument. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-02 15:13:58 +00:00
José Fonseca	f5c41e16d7	gallium/tgsi: Don't declare temps individually when they are all similar. tgsi_ureg was recently enhanced to support local temporaries, and as result temps are declared individually. This change avoids many TEMP register declarations on common shaders. (And fixes performance regression due to mismatches against performance sensitive shaders.) Reviewed-by: Brian Paul <brianp@vmware.com>	2012-07-02 12:14:53 +01:00
José Fonseca	e75fe7ba08	gallivm: Cleanup the 4 x float -> 16 ub special path in lp_build_conv. No behaviour change intended. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-07-02 12:13:52 +01:00
José Fonseca	63e0e4b8f5	gallium/util: Add ULL suffix to large constants. As suggested by Andy Furniss: it looks like some old gcc versions require it.	2012-07-02 12:12:42 +01:00
Tom Stellard	1d21bd057a	clover: Handle NULL devs argument in clBuildProgram If devs is NULL, then the kernel should be compiled for all devices associated with the program.	2012-07-01 15:45:24 +02:00
Francisco Jerez	c6bb41c28b	clover: Define non-templated copy constructor for clover::ref_ptr. The templated copy constructor doesn't prevent the compiler from emitting a default copy constructor, which leads to inconsistent memory handling and was reported to cause segfaults when doing event manipulation. Reported-by: Tom Stellard <thomas.stellard@amd.com>	2012-07-01 15:37:30 +02:00
Brian Paul	db2b6ca504	llvmpipe: fix comment typo	2012-06-29 17:19:12 -06:00
Brian Paul	9dfe92019a	st/mesa: use DEBUG_INCOMPLETE_FBO debug flag	2012-06-29 17:19:12 -06:00
Brian Paul	b186a9df32	mesa: remove some unused gl_dlist_state fields	2012-06-29 17:19:12 -06:00
Tom Stellard	ca8fa02308	clover: Add a function internalizer pass before LTO v2 The function internalizer pass marks non-kernel functions as internal, which enables optimizations like function inlining and global dead-code elimination. v2: - Pass vector arguments by const reference	2012-06-29 18:46:18 +00:00
Tom Stellard	a31b2f7107	radeon/llvm: Enable vec4 loads on R600	2012-06-29 18:46:18 +00:00
Tom Stellard	e17c586d08	radeon/llvm: Enable floating point stores on R600	2012-06-29 18:46:18 +00:00
Tom Stellard	b66ef1f48c	radeon/llvm: Handle floating point loads on R600	2012-06-29 18:46:18 +00:00
Tom Stellard	c01199dfc0	radeon/llvm: Expand UDIV and UREM nodes	2012-06-29 18:46:18 +00:00
Tom Stellard	2c485cda20	radeon/llvm: Emit raw ISA for vertex fetch instructions	2012-06-29 18:46:18 +00:00
José Fonseca	16e0ebccb6	gallium/util: Truly disable INF/NAN tests on MSVC. Thanks to Brian for spotting this.	2012-06-29 14:49:23 +01:00
José Fonseca	c9bada497c	gallium/util: Disable INF/NAN tests on MSVC. Somehow they are not recognized as constants.	2012-06-29 13:39:07 +01:00
José Fonseca	fa8dcb848f	translate: Free elt8_func/elt16_func too. These were leaking. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-06-29 12:21:08 +01:00
James Benton	6dd8e6f9cb	util: Reimplement half <-> float conversions. Removed u_half.py used to generate the table for previous method. Previous implementation of float to half conversion was faulty for denormalised and NaNs and would require extra logic to fix, thus making the speedup of using tables irrelevant. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:21:02 +01:00
James Benton	c8d3481cdb	tests: Updated tests to properly handle NaN for half floats. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:59 +01:00
James Benton	60dca53833	util: Updated u_format_tests to rigidly test half-float boundary values. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:57 +01:00
James Benton	d069d8ef38	util: Added functions for checking NaN / Inf for double and half-floats. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:54 +01:00
James Benton	34075d4133	util: Added util_format_is_array. This function checks whether a format description is in a simple array format. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-29 12:20:37 +01:00
Marek Olšák	fcebb157f0	vbo: optimize validation for glMultiDrawElements Some parameters need to be checked only once. check_valid_to_render needs to be called only once. The validate function is based on the one for DrawElements. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
Marek Olšák	62b9716739	vbo: first ASSERT_OUTSIDE_BEGIN_END then FLUSH, not the other way around Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
Marek Olšák	d9eb1a1225	vbo: don't call twice _mesa_valid_to_render in DrawArraysInstancedBaseInstance It's called in _mesa_validate_DrawArraysInstanced already. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
Marek Olšák	15ac66e331	mesa: rename MaxTransformFeedbackSeparateAttribs to MaxTransformFeedbackBuffers This is a cleanup for ARB_transform_feedback3, where GL_MAX_TRANSFORM_FEEDBACK_BUFFERS is introduced for interleaved attribs and has the same meaning as GL_MAX_.._SEPARATE_ATTRIBS for separate attribs. Also, the maximum number of TFB buffers is reduced from 32 to 4, which makes this patch useful even without the extension. I don't know of any hardware which can do more than 4. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 22:46:51 +02:00
José Fonseca	638779e445	gallivm: Refactor lp_build_broadcast(_scalar) to share code. Doesn't really change the generated assembly, but produces more compact IR, and of course, makes code more consistent. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-28 20:20:34 +01:00
Johannes Obermayr	bf679ce1dc	gallivm: Fix potential buffer overflowing in strncat. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-28 11:47:23 +01:00
Marcin Slusarz	1906d2b46b	nv50: dynamically allocate space for shader local storage Fixes 21 piglit tests: spec/glsl-1.10/execution/variable-indexing/ fs-temp-array-mat4-index-col-row-wr vs-temp-array-mat4-index-col-row-wr vs-temp-array-mat4-index-row-wr spec/glsl-1.20/execution/variable-indexing/ fs-temp-array-mat3-index-col-row-rd fs-temp-array-mat3-index-row-rd fs-temp-array-mat4-col-row-wr fs-temp-array-mat4-index-col-row-rd fs-temp-array-mat4-index-col-row-wr fs-temp-array-mat4-index-row-rd fs-temp-array-mat4-index-row-wr vs-temp-array-mat3-index-col-row-rd vs-temp-array-mat3-index-col-row-wr vs-temp-array-mat3-index-row-rd vs-temp-array-mat3-index-row-wr vs-temp-array-mat4-col-row-wr vs-temp-array-mat4-index-col-row-rd vs-temp-array-mat4-index-col-row-wr vs-temp-array-mat4-index-col-wr vs-temp-array-mat4-index-row-rd vs-temp-array-mat4-index-row-wr vs-temp-array-mat4-index-wr ... and prevents a lot of GPU lockups	2012-06-28 00:01:02 +02:00
Marcin Slusarz	0fceaee4fd	nv50: streamline screen_create error handling Remove macro which changes control flow (it's evil). Make all fail paths print (correct) error message.	2012-06-28 00:01:02 +02:00
Marcin Slusarz	96259b5128	nv50/ir: make colorful ir dump output optional	2012-06-28 00:01:02 +02:00
Brian Paul	9881bf6e69	mesa: more const qualifiers to match the latest glext.h For some reason regular gcc on Linux didn't catch these but the mingw compiler did (generated errors, not warnings). v2: include the changes in src/mapi/ too	2012-06-27 15:37:10 -06:00
Brian Paul	827bdee7d1	glapi: add const qualifier to glShaderSourceARB() parameter Fixes the es2 build with gcc. Note: in glext.h the prototypes for glShaderSource() and glShaderSourceARB() disagree: only the former has the extra const qualifier. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-27 15:37:10 -06:00
Jordan Justen	3588098ed8	i965: enable ARB_instanced_arrays extension Set the step_rate value when drawing to implement ARB_instanced_arrays for gen >= 4. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-27 10:35:14 -07:00
Brian Paul	8fb1e4a462	glsl: be more careful about counting varying vars in the linker Previously, we were counting gl_FrontFacing, gl_FragCoord and gl_PointCoord against the limit of varying variables. This prevented some valid shaders from linking. The other potential solution to this is to have the driver advertise more varying vars or set the GLSLSkipStrictMaxVaryingLimitCheck flag. But the above-mentioned variables aren't conventional varying attributes so it doesn't seem right to count them. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-27 11:31:16 -06:00
Andreas Boll	d9d84068e7	docs/helpwanted: add some useful todo lists Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-27 11:19:21 -06:00
Brian Paul	098aa5f9ab	softpipe: fix numFragsEmitted debug code	2012-06-27 07:50:57 -06:00
Brian Paul	81e2a238bc	gallium: minor whitespace, comment changes	2012-06-27 07:50:57 -06:00
Brian Paul	51b0a0b33c	mesa: update glext.h to version 81	2012-06-27 07:50:57 -06:00
Brian Paul	52dd8961eb	mesa: update glxext.h to version 33	2012-06-27 07:50:57 -06:00
Brian Paul	8459f4a63a	mesa: make _mesa_reference_array_object() an inline function As we do for texture objects, buffer objects, etc.	2012-06-27 07:50:57 -06:00
Brian Paul	dcf1dafa9e	mesa: look up enum name for glEnable/Disable errors	2012-06-27 07:50:56 -06:00
Brian Paul	86ccd9aaac	mesa: move TEXGEN defines closer to gl_texgen struct	2012-06-27 07:50:56 -06:00
Brian Paul	4cb3579e52	mesa: rename ColorMaterialBitmask to _ColorMaterialBitmask Since it's a derived field.	2012-06-27 07:50:56 -06:00
Brian Paul	b114ff3783	mesa: re-order, update comments on lighting-related structs	2012-06-27 07:50:56 -06:00
José Fonseca	d1c5ea9207	gallium/util: Fix parsing of options with underscore. For example GALLIVM_DEBUG=no_brilinear which was being parsed as two options, "no" and "brilinear".	2012-06-27 11:16:18 +01:00
James Benton	789436f1e0	gallivm: Added a generic lp_build_print_value which prints a LLVMValueRef. Updated lp_build_printf to share common code. Removed specific lp_build_print_vecX. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-27 11:16:18 +01:00
Stéphane Marchesin	45fc069600	i915g: Implement sRGB textures Since we don't have them in hw we emulate them in the shader. Although not recommended by the spec it is legit. As a side effect we also get GL 2.1. I think this is as far as we can take the i915.	2012-06-26 23:18:15 -07:00
Brian Paul	3bc39414ab	svga: return 120 for PIPE_CAP_GLSL_FEATURE_LEVEL Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-26 17:03:33 -06:00
Brian Paul	ac8613c298	llvmpipe: return 120 for PIPE_CAP_GLSL_FEATURE_LEVEL Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-26 17:03:33 -06:00
Carl Worth	d8e61f8f86	glsl: glcpp: Extend testing of #line directives The most recent commit adds support for comments and macro expansion on #line directives. Add testing to verify the new features. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:23:55 -07:00
Carl Worth	aac78ce823	glsl: glcpp: Move handling of #line directives from lexer to parser. The GLSL specification requires that #line directives be interpreted after macro expansion. Our existing implementation of #line macros in the lexer prevents conformance on this point. Moving the handling of #line from the lexer to the parser gives us the macro expansion we need. An additional benefit is that the preprocessor also now supports comments on the same line as #line directives. Finally, the preprocessor now emits the (fully-macro-expanded) #line directives into the output. This allows the full GLSL compiler to also see and interpret these directives so it can also generate correct line numbers in error messages. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:23:49 -07:00
Carl Worth	39f8c46eaa	glsl: glcpp: Rename and document _glcpp_parser_expand_if This function is currently used only in the expansion of #if lines, but we will soon be using it more generally (for the expansion of (_glcpp_parser_expand_and_lex_from) and some more documentation. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:21:16 -07:00
Carl Worth	1db463ce2e	glsl: Consistently use length-based ralloc string functions for info_log. Commit `b823b99ec0` switched from using functions such as ralloc_asprintf and ralloc_strcat to ralloc_asprintf_rewrite_tail. This change maintains the string's length as a aparamter that is updated by the ralloc functions (rather than recomputing it with strlen over and over). However, the change failed to updated two locations (glcpp_error and glcpp_warning), with the result that the string's length wasn't updated by these calls. Then, subsequent calls to other ralloc_asprintf_rewrite_tail would overwrite the text appended by glcpp_error. This commit fixes the two missing updates, and restores line numbers to the output of glcpp error messages, (as noticed by a glcpp unit test case that has been failing since the above-mentioned commit). Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:20:53 -07:00
Carl Worth	c96b8302a3	glsl: glcpp: Allow "#if undefined-macro' to evaluate to false. A strict reading of the GLSL specification would have this be an error, but we've received reports from users who expect the preprocessor to interepret undefined macros as 0. This is the standard behavior of the rpeprocessor for C, and according to these user reports is also the behavior of other OpenGL implementations. So here's one of those cases where we can make our users happier by ignoring the specification. And it's hard to imagine users who really, really want to see an error for this case. The two affected tests cases are updated to reflect the new behavior. Signed-off-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-26 15:20:03 -07:00
Jerome Glisse	b75f1d973c	r600g: enable DUAL_EXPORT mode when possible on r6xx/r7xx DUAL_EXPORT can be enabled on r6xx/r7xx when all CBs use 16-bit export and there is no depth/stencil export. Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-06-27 02:06:55 +04:00
Vadim Girlin	470d00c0e2	r600g: enable DUAL_EXPORT mode when possible It seems DUAL_EXPORT on evergreen may be enabled when all CBs use 16-bit export mode (EXPORT_4C_16BPC), also there should be at least one CB, and the PS shouldn't export depth/stencil. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-06-27 02:06:55 +04:00
Vadim Girlin	0c47d9dcab	r600g: avoid unnecessary shader exports v2 In some cases TGSI shader has more color outputs than the number of CBs, so it seems we need to limit the number of color exports. This requires different shader variants depending on the nr_cbufs, but on the other hand we are doing less exports, which are very costly. v2: fix various piglit regressions Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-06-27 02:06:55 +04:00
Vadim Girlin	4acf71f01e	r600g: cache shader variants instead of rebuilding v3 Shader variants are stored in the list, the key for lookup is based on the states that require different hw shaders - currently it's rctx->two_side (all gpus) and rctx->nr_cbufs (evergreen/cayman, when writes_all property is set). v2: - use simple list instead of keymap as suggested by Marek on irc - call r600_adjust_gprs from r600_bind_vs_shader for r6xx/r7xx (r600_shader_select isn't used for vertex shaders currently) v3: - fix call to r600_adjust_gprs - do it after updating current shader Improves performance for some apps, e.g. FlightGear - see https://bugs.freedesktop.org/show_bug.cgi?id=50360 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2012-06-27 02:06:55 +04:00
Brian Paul	55a89889ba	svga: handle missing PIPE_CAP_x queries And fix incorrect error message for a bad shader type/number. Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-26 15:03:44 -06:00
Brian Paul	056e9b4511	llvmpipe: handle more PIPE_CAP_x queries As with the previous commit for softpipe. v2: remove 'default' case to get compile-time warning Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-26 15:03:44 -06:00
Brian Paul	7d23dcdacc	softpipe: handle more PIPE_CAP_x queries These all return zero. Add a debug_printf() to catch the default case so we don't accidently mishandle something important in the future. v2: remove 'default' case to get compile-time warning Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-26 15:03:43 -06:00
Brian Paul	80efb524ee	svga: return 1 for PIPE_CAP_MIXED_COLORBUFFER_FORMATS This is actually required for GL_ARB_framebuffer_object, but the state tracker doesn't currently check it. Direct3D 9 allows mixed format color buffers with some restrictions. Setting this allows Unigine Heaven 2.5 and 3.0 to run. Tested both on GL and D3D hosts. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com>	2012-06-26 15:03:43 -06:00
Brian Paul	36b3ee2ffc	glsl: fix comment typo	2012-06-26 10:01:03 -06:00
Olivier Galibert	27e94ba4ea	u2f_emit: Fix type parameter in LLVM call. The type is the destination type (i.e. float vector) and not the source type. Fixes piglit fs-{in,de}crement-uint. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-26 16:55:40 +01:00
Paul Berry	6c355cca91	i965/msaa: Set KILL_ENABLE when GL_ALPHA_TO_COVERAGE enabled. i965 hardware needs to be informed of situations in which it's possible for pixels (or samples) to be discarded for reasons other than depth/stencil testing (e.g. due to an explicit "discard" in the fragment shader). One of these situations is when GL_ALPHA_TO_COVERAGE is enabled, since that can cause samples to be discarded by the color calculator when the pixel's alpha value is less than 1.0. Without this patch, GL_ALPHA_TO_COVERAGE does not take effect on depth buffers. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-06-26 07:45:54 -07:00
Paul Berry	bc53e14d98	i965/msaa: Implement GL_SAMPLE_ALPHA_TO_{COVERAGE,ONE}. This patch enables the multisampling parameters GL_SAMPLE_ALPHA_TO_COVERAGE and GL_SAMPLE_ALPHA_TO_ONE, which allow the fragment shader's alpha output to be converted into a sample coverage mask and ignored for blending. i965 supports these parameters through the BLEND_STATE structure. The GL spec allows, but does not require, the implementation to dither the conversion from alpha to a sample coverage mask, so that alpha values that aren't a multiple of 1/num_samples result in the correct proportion of samples being lit. A bit exists in the BLEND_STATE structure to enable this functionality, but according to the hardware docs it must be disabled on Sandy Bridge (see the Sandy Bridge PRM, Vol2, Part1, p379: AlphaToCoverage Dither Enable). So it is enabled for Gen7 only. Fixes piglit tests "EXT_framebuffer_multisample/sample-alpha-to-{coverage,one} {2,4}". Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-06-26 07:45:54 -07:00
Paul Berry	9ea60ce58f	i965/msaa: Implement glSampleCoverage. This patch enables glSampleCoverage() functionality, which allows the client program to specify that only a portion of the samples be lit up when performing multisampled rendering. i965 supports glSampleCoverage() through the 3DSTATE_SAMPLE_MASK command packet, which allows the driver to specify a bitfield indicating which samples to light up. Fixes piglit tests "EXT_framebuffer_multisample/sample-coverage {2,4} {inverted,non-inverted}". Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-06-26 07:45:54 -07:00
José Fonseca	4bde1ba7fb	st/wgl: Add a few more comments.	2012-06-26 10:15:36 +01:00
Marek Olšák	cc2cd8b356	r600g: don't disable streamout if it hasn't been started	2012-06-26 03:37:24 +02:00
Marek Olšák	496399d8e9	u_blitter: disable streamout before rendering This fixes piglit EXT_transform_feedback tests: - intervening-read output - intervening-read prims_written	2012-06-26 03:37:23 +02:00
Chad Versace	cf0bbb30f6	i965/fs: Fix conversions float->bool, int->bool Fixes gles2conform GL.equal.equal_bvec2_frag. This fixes brw_fs_visitor's translation of ir_unop_f2b. It used CMP to convert the float to one of 0 or ~0. However, the convention in the compiler is that true is represented by 1, not ~0. This patch adds an AND to convert ~0 to 1. By inspection, a similar problem existed with ir_unop_i2b, with a similar fix. [v2 kayden]: eliminate extra temporary register. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=49621 Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-06-25 15:56:40 -07:00
Brian Paul	345ee593e9	st/wgl: 80-column wrapping	2012-06-25 16:10:01 -06:00
Andreas Boll	19534579cf	docs/lists: add piglit mailing list Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	24eebf4f88	docs/helpwanted: update some info Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	f29f5e8695	docs/sourcetree: update some info Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	b347bb5dbc	docs/devinfo: update release info Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	398d8be3ab	docs/systems: add some useful driver links Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	297309ce23	docs: update some broken/old links Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	dae9b0f1d8	docs: whitespace cleanup Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	ddb0557868	docs: escape html special char Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	a5447aab96	docs: add missing target attribute target is needed for the frame based layout Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Andreas Boll	d52419e0c3	docs/shading: use proper markup use dl instead of ul Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-25 16:10:01 -06:00
Brian Paul	75e62024c3	docs: document the GALLIUM_LOG_FILE env var	2012-06-25 16:10:01 -06:00
Brian Paul	9ccf5bffe3	mesa: new MESA_LOG_FILE env var to log errors, warnings, etc., to a file Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-25 16:10:01 -06:00
Marek Olšák	0f530d2dff	docs: update GL3.3 status	2012-06-25 23:53:49 +02:00
Marek Olšák	4891c5dc64	r600g: inline r600_blit_push_depth and use resource_copy_region We are going to have a separate resource for depth texturing and transfers and this is just a transfer thing.	2012-06-25 23:53:49 +02:00
Marek Olšák	da98bb6fc1	r600g: split flushed depth texture creation and flushing	2012-06-25 23:53:49 +02:00
Paul Berry	d1056541e2	i965/msaa: Add backend support for centroid interpolation. This patch causes the fragment shader to be configured correctly (and the correct code to be generated) for centroid interpolation. This required two changes: brw_compute_barycentric_interp_modes() needs to determine when centroid barycentric coordinates need to be included in the pixel shader thread payload, and fs_visitor::emit_general_interpolation() needs to interpolate using the correct set of barycentric coordinates. Fixes piglit tests "EXT_framebuffer_multisample/interpolation {2,4} centroid-edges" on i965. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Paul Berry	cf0e7aa9f8	i965/fs: Refactor interpolation code to prepare for adding centroid support. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Paul Berry	6d7ebb21f8	i965/msaa: Adapt clip setup for centroid noperspective interpolation. To save time, we only instruct the clip stage of the pipeline to compute noperspective barycentric coordinates if those coordinates are needed by the fragment shader. Previously, we would determine whether the coordinates were needed by seeing whether the fragment shader used the BRW_WM_NONPERSPECTIVE_PIXEL_BARYCENTRIC interpolation mode. However, with MSAA, it's possible that the fragment shader might use BRW_WM_NONPERSPECTIVE_CENTROID_BARYCENTRIC instead. In the future, when we support ARB_sample_shading, it might use BRW_WM_NONPERSPECTIVE_SAMPLE_BARYCENTRIC. This patch modifies the upload_clip_state() functions to check for all three possible noperspective interpolation modes. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Paul Berry	bebb043811	glsl: Add IsCentroid bitfield to gl_fragment_program. This bitfield tells the back-ends which of a fragment shader's inputs require centroid interpolation. It is only set for GLSL fragment shaders, since assembly fragment shaders don't support centroid interpolation. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-25 11:03:26 -07:00
Brian Paul	2a4af651e6	st/mesa: added some simple fbo debugging/helper code	2012-06-25 11:28:03 -06:00
Brian Paul	45df3eb1db	llvmpipe: fix the LP_NO_RAST debug option It was only no-oping the clear() function, not actual triangle rasterization. Move the no_rast field from lp_context down into lp_rasterizer so it's accessible where it's needed. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-06-25 08:14:33 -06:00
Vinson Lee	37d699a296	scons: Add glsl/glcpp to the include path. Fixes this build failure on Solaris. Compiling build/sunos-debug/glsl/glcpp/glcpp-lex.c ... "src/glsl/glcpp/glcpp-lex.l", line 30: cannot find include file: "glcpp-parse.h" Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-23 13:40:09 -07:00
Laurent Carlier	78ac9af580	automake: add missing inclusion of GL headers Building fail when GL headers are not installed in the system, so add inclusion of these headers. Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-22 17:24:37 -06:00
Brian Paul	cbffaf20e9	mesa: #define fprintf to be __mingw_fprintf() on Mingw32 So that formats such as "%llx" are understood. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-22 17:24:37 -06:00
Brian Paul	fe68af6e0d	svga: init pointer to NULL to silence MSVC warning	2012-06-22 17:24:37 -06:00
Tom Stellard	ea76f03310	clover: Add --with-clang-libdir option and verify CLANG_RESOURCE_DIR $CLANG_RESOURCE_DIR is the directory that contains all resources needed by clang to compile programs. When clover uses clang to compile kernels it needs to specify a resource dir, so that clang can find its internal headers (e.g. stddef.h). clang defines $CLANG_RESOURCE_DIR as $CLANG_LIBDIR/clang/$CLANG_VERSION This patch adds the --with-clang-libdir option in order to accommodate clang intalls to non-standard locations, and it also adds a check to the configure script to verify that $CLANG_RESOURCE_DIR/include contains the necessary header files.	2012-06-22 16:59:24 -04:00
Paul Berry	82d25963a8	i965: Compute dFdy() correctly for FBOs. On i965, dFdx() and dFdy() are computed by taking advantage of the fact that each consecutive set of 4 pixels dispatched to the fragment shader always constitutes a contiguous 2x2 block of pixels in a fixed arrangement known as a "sub-span". So we calculate dFdx() by taking the difference between the values computed for the left and right halves of the sub-span, and we calculate dFdy() by taking the difference between the values computed for the top and bottom halves of the sub-span. However, there's a subtlety when FBOs are in use: since FBOs use a coordinate system where the origin is at the upper left, and window system framebuffers use a coordinate system where the origin is at the lower left, the computation of dFdy() needs to be negated for FBOs. This patch modifies the fragment shader back-ends to negate the value of dFdy() when an FBO is in use. It also modifies the code that populates the program key (brw_wm_populate_key() and brw_fs_precompile()) so that they always record in the program key whether we are rendering to an FBO or to a window system framebuffer; this ensures that the fragment shader will get recompiled when switching between FBO and non-FBO use. This will result in unnecessary recompiles of fragment shaders that don't use dFdy(). To fix that, we will need to adapt the GLSL and NV_fragment_program front-ends to record whether or not a given shader uses dFdy(). I plan to implement this in a future patch series; I've left FIXME comments in the code as a reminder. Fixes Piglit test "fbo-deriv". NOTE: This is a candidate for stable release branches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-22 07:59:34 -07:00
Brian Paul	d988ea5e81	mesa: minor transform feedback comments	2012-06-22 08:48:45 -06:00
Brian Paul	09af5783b3	mesa: fix comments on UBO buffer binding functions The old comments were for transform feedback.	2012-06-22 08:44:00 -06:00
Olivier Galibert	b8068afafa	draw: Handle the case when there isn't a fragment shader. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-22 09:58:39 +01:00
Zack Rusin	af98c6b05b	mesa: update the emacs indent files dirvars package has been replaced by built-in functionality of dir-locals. preserve the settings in the new infrastructure	2012-06-21 17:29:11 -04:00
Tom Stellard	ff2b417245	r600g: Unify SURFACE_SYNC packet emission for 3D and compute Drop the compute specific evergreen_set_buffer_sync() function and instead use the r600_surface_sync_command atom for emitting SURFACE_SYNC packets.	2012-06-21 20:42:07 +00:00
Tom Stellard	ff08f1ec6f	r600g: Enable reusing of compute state	2012-06-21 20:42:07 +00:00
Tom Stellard	5cd6ce939d	r600g: Fix reading vtx instruction offset from bytestream	2012-06-21 20:42:07 +00:00
Tom Stellard	563a764110	radeon/llvm: Turn on the BitExtract peephole optimization Thie BitExtract optimization folds a mask and shift operation together into a single instruction (BFE_UINT).	2012-06-21 20:42:06 +00:00
Tom Stellard	c53c8d0555	radeon/llvm: Lower ROTL to BIT_ALIGN	2012-06-21 20:42:06 +00:00
Tom Stellard	cd287301ec	radeon/llvm: Use the VLIW Scheduler for R600->NI It's not optimal, but it's better than the register pressure scheduler that was previously being used. The VLIW scheduler currently ignores all the complicated instruction groups restrictions and just tries to fill the instruction groups with as many instructions as possible. Though, it does know enough not to put two trans only instructions in the same group. We are able to ignore the instruction group restrictions in the LLVM backend, because the finalizer in r600_asm.c will fix any illegal instruction groups the backend generates. Enabling the VLIW scheduler improved the run time for a sha1 compute shader by about 50%. I'm not sure what the impact will be for graphics shaders. I tested Lightsmark with the VLIW scheduler enabled and the framerate was about the same, but it might help apps that use really big shaders.	2012-06-21 20:42:06 +00:00
Brian Paul	b73cf49c91	mesa: set GL_ARB_uniform_buffer_object extension year to 2009	2012-06-21 13:08:34 -06:00
Eric Anholt	cb9f35d16f	mesa: Add a comment explaining my thoughts on glBindBufferBase(). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:18 -07:00
Eric Anholt	d103fead19	mesa: Add support for glGetIntegeri_v from GL_ARB_uniform_buffer_object. Fixes piglit ARB_uniform_buffer_object/getintegeri_v. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:10 -07:00
Eric Anholt	fb76ddc133	mesa: Add support for glBindBufferBase/Range on GL_UNIFORM_BUFFER. Fixes piglits: GL_ARB_uniform_buffer_object/bindbuffer-general-point. GL_ARB_uniform_buffer_object/negative-bindbuffer-buffer GL_ARB_uniform_buffer_object/negative-bindbuffer-index GL_ARB_uniform_buffer_object/negative-bindbuffer-target GL_ARB_uniform_buffer_object/negative-bindbufferrange-range Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:07 -07:00
Eric Anholt	b82c472156	mesa: Move glBindBufferBase and glBindBufferRange() to bufferobj. The rest of the TFB implementation remains in transformfeedback.c, and this will be shared with UBOs. v2: Move the size/offset checks shared with UBOs to common code as well. (Kenneth's review) Reviewed-by: Brian Paul <brianp@vmware.com> (v1) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:58:00 -07:00
Eric Anholt	9627660448	mesa: Move buffer object dispatch setup to bufferobj.c. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:58 -07:00
Eric Anholt	5527c2d220	mesa: Add indexed binding points for uniform buffer objects. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:56 -07:00
Eric Anholt	c5c696e7fb	mesa: Add support for the GL_UNIFORM_BUFFER general binding point. Fixes piglit ARB_uniform_buffer_object/buffer-targets. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:54 -07:00
Eric Anholt	5426b1ade9	mesa: Add state and getters for the GL_ARB_uniform_buffer_object maximums. Fixes piglit GL_ARB_uniform_buffer_object/minmax. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:51 -07:00
Vincent Lejeune	3e17d38457	glapi: Add uniform buffer object API v2: Fix a typo spotted by Eric Anholt. v3: Fix missing "GL" on types, fix style, fix Studly_Caps extension name, drop commented code duplicated with GL3x.xml [anholt] Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-21 10:57:45 -07:00
Eric Anholt	37c3cbe053	dricore: Turn it into a normal library. Our intention is still that it's not abi stable, so make the package version number get included in the library name. Now you can parallel install dricore-using drivers from multiple mesa versions. We can put it into lib now that we're following library versioning rules (assuming that ABIs don't change within a single Mesa point release). LD_LIBRARY_PATH still doesn't work with a non-/, non-/usr prefix because libtool uses rpath instead of runpath for nonstandard prefixes.	2012-06-21 10:10:46 -07:00
Eric Anholt	4113ac6a0f	automake: Convert Mesa built sources generation to automake.	2012-06-21 10:10:46 -07:00
Eric Anholt	2d51ac84fd	mesa: Move GL header installation to automake. This cuts some cruft related to osmesa where we were being careful to not install headers twice.	2012-06-21 10:10:46 -07:00
Eric Anholt	1bbd22ada0	automake: Move mesa subdirs processing to automake.	2012-06-21 10:10:46 -07:00
Eric Anholt	39785488e6	automake: Move .pc installation to automake.	2012-06-21 10:10:46 -07:00
Eric Anholt	417c1a6421	automake: Move the master Mesa makefile to Makefile.old. This will let me incrementally move stuff to automake without converting libmesa.a all at once.	2012-06-21 10:10:46 -07:00
Eric Anholt	bd18a236de	automake: Convert osmesa.pc to be generated by configure.	2012-06-21 10:10:43 -07:00
Eric Anholt	fa4cf4dc0c	mesa: Convert gl.pc to be generated by configure. This saves a step of mashing variables around in our Makefile.	2012-06-21 10:10:08 -07:00
Eric Anholt	2d4b77c7c6	automake: Convert src/mesa/drivers/x11/Makefile to automake. The weird versioning of the libGL where the package version was sort of expressed as a big integer is dropped. libtool didn't like the 0 prefix, and it didn't really make sense anyway -- if you interpret it as an integer version number, old Mesa 071200 was bigger than current Mesa 08100. Instead, just bump the minor version and drop the patchlevel.	2012-06-21 10:09:17 -07:00
Eric Anholt	2fb0f770a4	automake: Convert src/gallium/Makefile to automake.	2012-06-21 10:08:26 -07:00
Eric Anholt	27383cbb0b	automake: Convert src/mapi/glapi/gen to silent build.	2012-06-21 10:08:26 -07:00
Eric Anholt	3a70f7526a	automake: Convert src/mapi/glapi/gen/Makefile to automake.	2012-06-21 10:08:24 -07:00
Eric Anholt	d59149d3f4	automake: Convert src/mesa/drivers/Makefile to automake.	2012-06-21 10:07:38 -07:00
Eric Anholt	9ff2709ca5	automake: Directly generate configs/current instead of symlinking from it.	2012-06-21 10:07:38 -07:00
Eric Anholt	95836b46e7	automake: Convert gen_matypes building to automake.	2012-06-21 10:07:36 -07:00
Eric Anholt	acf27121a5	make: Drop HOST_CC and HOST_CFLAGS. Except for the deleted linux-cell target, these were just the target cc/cflags. The only usage was for gen_matypes, which wants the target's structure packing, not the host, anyway.	2012-06-21 09:58:12 -07:00
Eric Anholt	e426949cf1	make: Fold ASM_CFLAGS into DEFINES. Every place that uses ASM_FLAGS already uses DEFINES. Not including it in DEFINES is just a way to screw up potential users, as I've done several times while working on the build system.	2012-06-21 09:58:12 -07:00
Eric Anholt	07b28af5b5	automake: Convert src/egl/Makefile to automake.	2012-06-21 09:58:12 -07:00
Eric Anholt	a4ff3342d2	automake: Don't warn on gmake portability issues. Even pre-automake, we rely on gmake features for pattern substitutions, and replacing those with reams more make code is not interesting. This will let us turn the old Makefiles using pattern substitutions into automake without spewing warnings. Reviewed-by: Dan Nicholson <dbn.lists@gmail.com>	2012-06-21 09:57:52 -07:00
Marcin Slusarz	19fd04f5ea	nv50: fix buffer reuse issues 1) We need to insert a barrier between consecutive transform feedback calls. 2) VBO cache needs to be flushed when TFB output is used as VBO draw input. Fixes Piglit test EXT_transform_feedback/immediate-reuse. Thanks to Christoph Bumiller for pointing out bugs in previous versions of this patch.	2012-06-20 21:24:53 +02:00
Marcin Slusarz	7e63b613a5	st/mesa: fix transform feedback of unsubscripted gl_ClipDistance array gl_ClipDistance needs special treatment in form of lowering pass which transforms gl_ClipDistance representation from float[] to vec4[]. There are 2 implementations - at glsl linker level (enabled by LowerClipDistance option) and at glsl_to_tgsi level (enabled unconditionally for gallium drivers). Second implementation is incomplete - it does not take into account transform feedback (see commit `642e5b413e` "mesa: Fix transform feedback of unsubscripted gl_ClipDistance array" for details). There are 2 possible fixes: - adding transform feedback support into glsl_to_tgsi version - ripping gl_ClipDistance support from glsl_to_tgsi and enabling gl_ClipDistance lowering on glsl linker side This patch implements 2nd option. All it does is: - reverts most of the commit `59be691638` "st/mesa: add support for gl_ClipDistance" - changes LowerClipDistance to true Fixes Piglit tests "EXT_transform_feedback/builtin-varyings gl_ClipDistance[{2,3,4,5,6,7,8}]-no-subscript" at least on nv50 and evergreen cards.	2012-06-20 21:16:20 +02:00
Paul Berry	f2f05e50b1	glx/tests: Fix signed/unsigned comparison warnings.	2012-06-20 11:42:42 -07:00
Paul Berry	cde6544ad7	i965/msaa: Only do multisample rasterization if GL_MULTISAMPLE enabled. From the GL 3.0 spec (p.116): "Multisample rasterization is enabled or disabled by calling Enable or Disable with the symbolic constant MULTISAMPLE." Elsewhere in the spec, where multisample rasterization is described (sections 3.4.3, 3.5.4, and 3.6.6), the following text is consistently used: "If MULTISAMPLE is enabled, and the value of SAMPLE_BUFFERS is one, then..." So, in other words, disabling GL_MULTISAMPLE should prevent multisample rasterization from occurring, even if the draw framebuffer is multisampled. This patch implements that behaviour by setting the WM and SF stage's "multisample rasterization mode" to MSRAST_ON_PATTERN only when the draw framebuffer is multisampled and GL_MULTISAMPLE is enabled. Fixes piglit test spec/EXT_framebuffer_multisample/enable-flag. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-20 11:28:09 -07:00
Paul Berry	3b0279a693	i965/msaa: Disable unsupported formats. Due to hardware limitations, MSAA is unsupported on Gen6 for formats containing >64 bits of data per pixel. From the Sandy Bridge PRM, vol4 part1, p72 ("Surface Format"): If Number of Multisamples is set to a value other than MULTISAMPLECOUNT_1, this field cannot be set to the following formats: - any format with greater than 64 bits per element - any compressed texture format (BC) - any YCRCB format Gen7 has a similar, but less stringent limitation: formats with >64 bits of data per pixel only support 4x MSAA. This patch causes the unsupported formats to report GL_FRAMEBUFFER_UNSUPPORTED. Fixes piglit "multisample-formats" tests on Gen6. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-20 11:28:09 -07:00
Andreas Boll	3becf98424	mesa: remove obsolete confdiff.sh this script is obsolete since `0cc216676c`	2012-06-20 01:51:38 -07:00
Christian König	0f269c5e7b	st/vdpau: use template size as default for source_rect. Fixes alignment problems with flash player. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-06-20 10:13:38 +02:00
Christian König	d37c3c6ebe	st/vdpau: clear Cb&Cr with 0.5f That makes the output black in case of decoding errors. Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-06-20 10:13:29 +02:00
Kenneth Graunke	2f8351a5ac	i965: Don't set brw_wm_prog_key::iz_lookup on Gen6+. Sandy Bridge and later don't use this field, so there's no point in setting it. It can only cause harmful state-based recompiles. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-19 17:36:48 -07:00
Olivier Galibert	c790c2c759	llvmpipe: Add vertex id support. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Olivier Galibert	46931ecf48	llvmpipe: Simplify and fix system variables fetch. The system array values concept doesn't really because it expects the system values to be fixed per call, which is wrong for gl_VertexID and iffy for gl_SampleID. So this patch does two things: - kill the array, have emit_fetch_system_value directly pick the values it needs (only gl_InstanceID for now, as the previous code) - correctly handle the expected type in emit_fetch_system_value Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Olivier Galibert	4625a9b1ad	draw: fix flat shading and screen-space linear interpolation in clipper This includes: - picking up correctly which attributes are flatshaded and which are noperspective - copying the flatshaded attributes when needed, including the non-built-in ones - correctly interpolating the noperspective attributes in screen-space instead than in a 3d-correct fashion. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Olivier Galibert	cfc5b30941	softpipe: Offset is not to be applied to the layer parameter of array texture fetches. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 14:40:44 -06:00
Brian Paul	fc855ed5d9	st/mesa: clamp glDrawPixels size to max texture size	2012-06-19 14:40:44 -06:00
Brian Paul	7f4786ad29	st/mesa: move st_validate_state() call earlier in st_DrawPixels()	2012-06-19 14:40:44 -06:00
Jerome Glisse	b4f0ab0b22	r600g: fix z/stencil texture creation v2 z or stencil texture should not be created with the z/stencil flags for surface creation as they are intended to be bound as texture. v2: remove broken code Signed-off-by: Jerome Glisse <jglisse@redhat.com>	2012-06-19 15:03:36 -04:00
Török Edwin	988ad7831c	radeon/llvm: Fix CR/LF in Processors.td Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-06-19 16:38:23 -04:00
Török Edwin	7c005d5687	radeon/llvm: Fix sin/cos codegen on R700 Based on https://bugs.freedesktop.org/show_bug.cgi?id=50317#c4 Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=50316 https://bugs.freedesktop.org/show_bug.cgi?id=50317 Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-06-19 16:38:13 -04:00
Fredrik Höglund	4e943c375b	docs: update GL3.txt for ARB_base_instance Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Fredrik Höglund	c4c8c7a8f9	st/mesa: Add support for GL_ARB_base_instance Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Fredrik Höglund	af372129e5	gallium: Add PIPE_CAP_START_INSTANCE Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Fredrik Höglund	ae5d7d5e89	mesa: Add support for GL_ARB_base_instance Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-19 07:57:22 -06:00
Vinson Lee	ee99647e02	scons: Do not build svga if using Solaris Studio C compiler. Solaris Studio C compiler does not support anonymous structs and anonymous unions. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-18 16:37:46 -07:00
Kenneth Graunke	5b83bdc154	i965: Fix brw_swap_cmod() for LE/GE comparisons. The idea here is to rewrite comparisons like 2 >= x with x <= 2; we want to simply exchange arguments, not negate the condition. If equality was part of the original comparison, it should remain part of the swapped version. This is the true cause of bug #50298. It didn't manifest itself on Sandybridge because we embed the conditional modifier in the IF instruction rather than emitting a CMP. All other platforms use CMP. It also didn't manifest itself on the master branch because commit `be5f27a84d` ("glsl: Refine the loop instruction counting.") papered over the problem. NOTE: This is a candidate for stable release branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50298 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-18 15:25:31 -07:00
Brian Paul	6f7834ad36	docs: start release notes file for 8.1	2012-06-18 12:39:34 -06:00
Tom Stellard	7fab4b648b	radeon/llvm: Update comment in AMDGPU.td	2012-06-18 18:30:36 -04:00
Tom Stellard	984ad0788c	radeon/llvm: Remove unused AMDIL TableGen definitons	2012-06-18 18:30:36 -04:00
Tom Stellard	34ff22b75f	radeon/llvm: Eliminate getRegClassFromType() function We can use TargetLowering::getRegClassFor() instead.	2012-06-18 18:30:36 -04:00
Tom Stellard	440ab9ea02	radeon/llvm: Remove deadcode from AMDILISelLowering.cpp	2012-06-18 18:30:35 -04:00
Vinson Lee	cd62960a2e	gallium: Add support for Solaris Studio C++ compiler. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-18 10:01:26 -07:00
James Benton	f34e2f484b	llvmpipe: Implement cylindrical wrapping. Tested against mesa demos cylwrap and dx9 DCT address.exe which now passes 100%. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-18 17:55:05 +01:00
Vinson Lee	d1acae2bdc	st/glx: Do not undefine _R, _G, and _B. Fixes build error on Cygwin and Solaris. _R, _G, and _B are used in ctype.h on those platforms. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-18 09:42:08 -07:00
Brian Paul	8ae93c68ea	svga: fix synchronization bug between sampler views and surfaces This fixes a bug where a sampler view was using stale texture/resource data when the texture was modified through a surface (render to texture). Bumping the texture and layer ages triggers sampler view revalidation. Fixes piglit fbo-blit failure. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-06-18 10:22:59 -06:00
Kristian Høgsberg	2d7b2d7a87	gles2: Add GL_NV_read_buffer extension This lets us select the front buffer for reading under GLES2. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-18 11:53:18 -04:00
Kristian Høgsberg	e841a2426e	get.c: Rename EXTRA_VERSION_ES2 to EXTRA_API_ES2 This extra condition checks the API not the version of the API, so rename to reflect that. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-18 11:50:53 -04:00
Andreas Boll	1692d3ad94	docs/relnotes: comment out bug template Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
Andreas Boll	fb918727ef	docs/relnotes: replace tbd with release date Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
Andreas Boll	b9fad90350	docs/relnotes: fix validation errors Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
Andreas Boll	207d52eb46	docs/relnotes: consolidate html header Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-18 08:21:47 -06:00
José Fonseca	e48d26bf40	draw: Ensure that the vertex_header type size matches expectation. This is failing sometimes, probably because TargetData keeps a structure layout cache, which can becomes bogus, ever since the InvalidateStructLayoutInfo API was removed in LLVM r135245. This change merely makes the problem easier to diagnose (an assertion failure instead of a random crash).	2012-06-18 12:06:23 +01:00
Marek Olšák	6e7756db14	r600g: enable streamout by default on r7xx and DRM 2.17.0 Now that it's in Linus's tree. Has anyone had a chance to test streamout on Cayman recently?	2012-06-17 18:28:32 +02:00
Marek Olšák	7c3786d780	st/mesa: properly allocate MSAA renderbuffers Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-16 14:20:27 +02:00
Marek Olšák	c760283159	st/mesa: make unsupported renderbuffer formats always fail as FBO incomplete instead of failing to allocate a renderbuffer. This also fixes piglit/get-renderbuffer-internalformat with non-renderable formats. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-16 14:20:27 +02:00
Marek Olšák	e4b2e6b527	st/mesa: separate sw renderbuffer allocation from hw one Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-16 14:20:26 +02:00
Marek Olšák	a82227ce4a	mesa: if AllocStorage doesn't choose a format, report FRAMEBUFFER_UNSUPPORTED This allows drivers not to do any allocation in AllocStorage if the storage cannot be allocated because of an unsupported internalformat + samples combo. The little ugliness is that AllocStorage is expected to return TRUE in this case. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-16 14:20:26 +02:00
Stéphane Marchesin	841eee5d44	i915g: More ops commute. This allows using the optimizations more broadly.	2012-06-15 20:22:26 -07:00
Marek Olšák	cb4d1d377d	r600g: fix lockups with streamout on r7xx This requires the latest streamout kernel patches. Streamout is disabled by default on r7xx, so this patch is safe for regular users. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-15 22:43:00 +02:00
Marek Olšák	f01594be0e	r600g: compute CS space for streamout correctly, add comments SET_CONTEXT_REG was not counted in. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-15 22:43:00 +02:00
Marek Olšák	bb07e25131	r600g: set SMX_ACTION_ENA to fix streamout cache flushes on some chipsets It helps on R7xx. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-15 22:42:59 +02:00
Alexey Shvetsov	f56f03428d	clover: Fix build with LLVM libs installed to non-standard directories Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Tom Stellard <thomas.stellard@amd.com>	2012-06-15 13:22:16 -04:00
Marek Olšák	5e7e7d96b3	st/mesa: don't do srgb->linear conversion in decompress_with_blit This fixes piglit/getteximage-formats on r600g. NOTE: This is a candidate for stable branches. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-15 18:54:14 +02:00
Paul Berry	4d9c3cbce9	glsl: Use ir_unop_f2u to convert floats to uints. Fixes piglit tests spec/glsl-1.30/execution/{vs,fs}-float-uint-conversion on i965. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	9d57d483cb	gallium: Add TGSI_OPCODE_F2U to gallivm backend. Note: for the moment TGSI_OPCODE_F2U is implemented using lp_build_itrunc() (the same function used to implement TGSI_OPCODE_F2I). In the long run, we should create an lp_build_utrunc() function to do the proper conversion. But this should allow us to limp along with mostly correct behaviour for now.	2012-06-15 08:58:55 -07:00
Paul Berry	1be7661110	gallium: Add support for ir_unop_f2u to tgsi backend. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	fa584c50cf	ir_to_mesa: Add support for ir_unop_f2u to ir_to_mesa backend. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	11a7b93592	i965: Add support for ir_unop_f2u to i965 backend. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	613a8170ae	glsl: Add support for ir_unop_f2u to constant folding. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	8e31f961e6	glsl: Add unary operation ir_unop_f2u. Previously, we performed conversions from float->uint by a two step process: float->int->uint. However, on platforms that use saturating conversions (e.g. i965), this didn't work, because if the source value was larger than the maximum representable int (0x7fffffff), then converting it to an int would clamp it to 0x7fffffff. This patch just adds the new opcode; further patches will adapt optimization passes and back-ends to use it, and then finally the ast_to_hir logic will be modified to emit the new opcode. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-15 08:58:55 -07:00
Paul Berry	75f409d75c	i965/blorp: Implement source clipping. This patch modifies blorp blits (which are used for MSAA) to properly account for clipping of source coordinates. Previously, if we detected the possibility of source clipping, we would fall back to the blit meta-op, which doesn't support MSAA and is very slow for depth and stencil buffers. Fixes piglit tests "EXT_framebuffer_multisample/clip-and-scissor-blit" on i965/Gen6+. Also substantially speeds up the Humble Bundle V game "Psychonauts" on Gen6+ (without this patch, the game's depth buffer blits use the slow blit meta-op). Reviewed-by: Carl Worth <cworth@cworth.org>	2012-06-15 08:58:54 -07:00
Brian Paul	4d9f263d7c	scons: add st_atom_array.c to the build	2012-06-15 09:31:33 -06:00
Christian König	92af184690	winsys/radeon: enable IB submission to compute rings v2 This allows to submit things to the compute only rings on cayman+ v2: rebased on current master and actually make use of the new flag in evergreen_compute.c Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-06-15 09:52:38 +02:00
Marek Olšák	b4753dafcc	st/mesa: atomize vertex array state This moves the state validation to where all the other states are validated.	2012-06-15 03:15:50 +02:00
Maarten Lankhorst	6bb0151f1f	winsys/radeon: Remove unnecessary pipe_thread_destroy in radeon_drm_cs_destroy Fixes crash bug introduced with `210ddf0819` fd.o #49198 pthread_detach after a pthread_join is unneeded. Signed-off-by: Maarten Lankhorst <m.b.lankhorst@gmail.com> Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-06-15 03:01:23 +02:00
Marcin Slusarz	fc782bcbf0	nv50,nvc0: fix stream output target buffer leak It manifests at exit as: "WARNING: destroying GPU memory cache with some buffers still in use"	2012-06-14 23:38:28 +02:00
Christoph Bumiller	169a0ae40a	nv50: disable stream output before reconfiguring it If we don't, the GPU will just throw an ILLEGAL_OPERATION error.	2012-06-14 23:30:49 +02:00
Christoph Bumiller	ef51ce522b	nv50/ir: handle NEG,ABS modifiers for short RCP encoding	2012-06-14 23:25:48 +02:00
Brian Paul	f677954e07	st/mesa: fix glDrawPixels(GL_DEPTH_COMPONENT) color output When drawing a depth image the fragment shader also needs to emit the current raster color. The new piglit drawpix-z test exercises this. NOTE: This is a candiate for the 8.0 branch.	2012-06-14 14:37:31 -06:00
Brian Paul	8031aa134e	docs: add info about shortlog_mesa.sh script	2012-06-14 14:37:31 -06:00
Paul Berry	4b7b4c46c5	glx/tests and mesa/tests: Update .gitignore files. This patch updates .gitignore files to account for the new build artifacts introduced by the following commits: `ae376f0` glx/tests: Rename test as glx-test `8fecdcc` mesa/tests: Add tests for _mesa_lookup_enum_by_{name,nr} functions `a29ad2b` mesa/tests: Add tests for the generated dispatch table	2012-06-14 10:08:57 -07:00
Christian König	eb024c7488	st/vdpau: fix YCbCr down/up-loads for buffers larger than requested When the video buffer turns out to be larger than requested by the application we shouldn't upload or download more data into / from it original requested. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=39309 Signed-off-by: Christian König <deathsimple@vodafone.de>	2012-06-14 17:54:04 +02:00
Alexander von Gluck IV	cb3054c849	scons: Fix Haiku binary optimizations Haiku targets the Pentium or higher processor. To ensure compatibility we can do march 586 and mtune 686. Mesa will still use sse however if the cpu supports it (and the stack is properly aligned). These flags only effect the internal compiler optimizations.	2012-06-14 08:08:17 -07:00
Andreas Boll	c1dcf9665c	mesa: fix html in shortlog_mesa.sh script Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-14 08:25:42 -06:00
Brian Paul	51c9c67a2f	mesa: added Ian's shortlog_mesa.sh script in bin/	2012-06-14 08:22:54 -06:00
Brian Paul	5234b8902c	svga: make svga_surface_needs_propagation() surface const	2012-06-14 08:20:40 -06:00
Brian Paul	92b65637ab	svga: add svga_surface_const() cast wrapper	2012-06-14 08:20:40 -06:00
Brian Paul	bffb3997c3	svga: fix comment typo	2012-06-14 08:20:40 -06:00
Aaron Watry	fc3bac8a40	rbug: fix make process on Linux Mint 13 x64. Previously, rbug_.c would fail to compile with incomplete prototype errors when make was run from the command line on my machine. My IDE always built fine, and still does after this patch (Netbeans 7.1.2). Most of the includes from files in gallium/auxiliary/rbug/ were assuming an rbug/ subdirectory, while the headers are actually in the same directory as the .c files. The build error was also previously a problem for me on Ubuntu 11.10 and Mint 12. Fixes build for the following configuration: ./autogen.sh --enable-debug --enable-texture-float --with-gallium-drivers=r600 --with-dri-drivers=radeon --enable-r600-llvm-compiler Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-14 08:14:59 -06:00
José Fonseca	93a42d1314	windows/gdi: Remove GL_NV_register_combiners and GL_NV_vertex_array_range exports	2012-06-14 12:02:03 +01:00
Ian Romanick	4bfdc83135	glsl: Fix pi/2 constant in acos built-in function In single precision, 1.5707963 becomes 1.5707962513 which is too small. However, 1.5707964 becomes 1.5707963705 which is just right. The value 1.5707964 is already used in asin.ir. NOTE: This is a candidate for stable release branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Paul Berry <stereotype441@gmail.com>	2012-06-13 18:26:11 -07:00
Ian Romanick	f18d3fe0cb	glapi: Remove GL_NV_vertex_array_range from the dispatch table There is no GLX protocol for these functions. Open-source Linux driver have not supported this extension for many years, and it seems unlikely at this point that this support will return. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:56 -07:00
Ian Romanick	69d1851757	glapi: Remove GL_NV_fence from the dispatch table There is no GLX protocol for these functions. No open-source Linux driver has ever supported this extension, and it seems unlikely at this point that one ever will. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:56 -07:00
Ian Romanick	6db7cf29b5	glapi: Remove GL_NV_register_combiners from the dispatch table There is no GLX protocol for these functions. No open-source Linux driver has ever supported this extension, and it seems unlikely at this point that one ever will. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:56 -07:00
Ian Romanick	a6002909a3	glapi: Remove GL_APPLE_texture_range from the dispatch table There is no GLX protocol for these functions, and no Linux driver has ever supported this extension. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:54 -07:00
Ian Romanick	e62c4c765c	glapi: Remove GL_SGIX_pixel_texture from the dispatch table There is no GLX protocol for this function. Open-source Linux driver have not supported this extension for many years, and it seems unlikely at this point that this support will return. There's no reason to have slots for this function in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:54 -07:00
Ian Romanick	933714aabe	glapi: Remove GL_SGIS_pixel_texture from the dispatch table There is no GLX protocol for these functions, and no Linux driver has ever supported this extension. There's no reason to have slots for these functions in the dispatch table. The unit tests (GetProcAddress::TableDidntShrink and others) are also updated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:54 -07:00
Ian Romanick	a29ad2b421	mesa/tests: Add tests for the generated dispatch table Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:17:53 -07:00
Ian Romanick	8fecdcc587	mesa/tests: Add tests for _mesa_lookup_enum_by_{name,nr} functions Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 13:14:22 -07:00
Ian Romanick	e08f9080ff	glapi: Add missing GL_EXT_texture_sRGB_decode enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	1c25984b23	glapi: Add missing GL_EXT_framebuffer_sRGB enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	75c516c959	glapi: Add missing GL_EXT_packed_float enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	ffbccb8cef	glapi: Add missing framebuffer sRGB enum Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	2d8d85d7fb	glapi: Add uniform buffer object enums These are from OpenGL 3.1 and ARB_uniform_buffer_object. I only added them to 3.1 because that required the least work. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	c5071825b0	glapi: Add missing enums for GL_NV_fragment_program Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	2485a1332e	glapi: Add missing enums for GL_ARB_occlusion_query2 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:26 -07:00
Ian Romanick	22cdd7d817	glapi: Remove extraneous GL_ from TEXTURE_IMMUTABLE_FORMAT Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	21af1e9a0e	glapi: Add missing enums for GL_ATI_fragment_shader Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	502449d71f	glapi: Add texture swizzle enums These are from OpenGL 3.3, ARB_texture_swizzle, and EXT_texture_swizzle (with different names). I only added them to 3.3 because that required the least work. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	a4a0c1f09d	glapi: Add a couple missing 3.0 enums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	cc1e74bd19	glapi: Add missing _NV extension on COMBINE4 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	78b30938cc	glapi: Add missing enums for GL_EXT_vertex_array Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	8fcec14417	glapi: Add missing enums for GL_EXT_compiled_vertex_array Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:25 -07:00
Ian Romanick	3c22f79412	glx/tests: Add unit tests for generated code in indirect_init.c Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Ian Romanick	4c270f9c6b	glx/tests: Add unit tests for generated code in indirect_size.c Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Ian Romanick	ae376f0567	glx/tests: Rename test as glx-test This matches the existing test in src/glsl/tests. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Ian Romanick	2e8c866f10	glx: Move tests from tests/glx to src/glx/tests This matches the organization of other unit tests in Mesa. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-13 11:50:24 -07:00
Brian Paul	f68ab0398b	util: add some comments, fix indentation	2012-06-13 08:52:40 -06:00
Matt Turner	ae419a0159	glsl: Transform dot product by a basis vector into a swizzle Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Matt Turner	9aa3fbcc2e	glsl: Add is_basis function Determines whether it's a basis vector, i.e., a vector with one element equal to 1 and all other elements equal to 0. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Matt Turner	d7bef19c7f	glsl: Check for zero vectors in ir_binop_dot Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Brian Paul	82ce93a8fd	mesa: move variable declaration out of loop to fix MSVC build	2012-06-12 16:31:36 -06:00
Stéphane Marchesin	a74c4fb89d	mesa: Fix bool-int mismatch Also include stdbool for windows.	2012-06-12 15:22:48 -07:00
Antoine Labour	3c9fab8822	mesa: Fix hash table leak When a value was replaced, the new key was strdup'd and leaked. To fix this, we modify the hash table implementation to return whether the value was replaced and free() the (now useless) duplicate string.	2012-06-12 14:42:22 -07:00
Antoine Labour	e2e9b4b10f	mesa: Free uniforms correclty. This is an array of uniforms, not a single one. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> NOTE: This is a candidate for the 8.0 branch.	2012-06-12 14:42:22 -07:00
Antoine Labour	53feb8ecdc	meta: Cleanup the resources we allocate. When we have multiple shared contexts, and one of them is long-running, this will lead to never freeing those resources since they are shared. Instead, free them right away on context destruction since we know the other context isn't using them. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> NOTE: This is a candidate for the 8.0 branch.	2012-06-12 14:42:22 -07:00
Stéphane Marchesin	0256edd709	glx: Handle a null reply in QueryVersion. Works around crashes when X connections break. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> NOTE: This is a candidate for the 8.0 branch.	2012-06-12 14:42:22 -07:00
Michel Dänzer	1657dec72d	radeonsi: Don't always re-compile shaders after they're bound.	2012-06-12 20:18:24 +02:00
Dave Airlie	6d289390ec	st/xorg: Fix crash on startup. Signed-off-by: Dave Airlie <airlied@redhat.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com>	2012-06-12 18:48:28 +02:00
Michel Dänzer	90c6eacdb4	radeonsi: Use linear instead of constant interpolation for now. Constant interpolation still hangs the GPU for some reason.	2012-06-12 18:48:28 +02:00
Thomas Stellard	4c418cf1a3	radeonsi: Handle SUB_f32. Signed-off-by: Thomas Stellard <tom.stellard@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>	2012-06-12 18:48:16 +02:00
Michel Dänzer	4c4ef9c29a	radeonsi: Only dump shaders with environment variable RADEON_DUMP_SHADERS=1.	2012-06-12 18:33:54 +02:00
Eric Anholt	7b11051a28	mesa: Build git_sha1.h before computing dependencies. Otherwise, version.c doesn't get a dependency on it in a clean build, and then it doesn't necessarily get generated before version.c is compiled. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50976 Reviewed-by: Jakob Bornecrantz jakob@vmware.com	2012-06-12 08:10:41 -07:00
Andreas Boll	fd64b39727	docs: whitespaces cleanup Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	5dc59455f9	docs: remove some superfluous <p> tags Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	8155ed37a1	docs: remove unused table styles Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	908f788503	docs: remove unused anchor links Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	210a27d8c3	docs: prefer lowercase html tags Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	cc4188895b	docs: use id instead of <a name> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	f85d23cea4	docs/subset-A.html: fix markup fixes tidy warnings: line 11 column 1 - Warning: <center> isn't allowed in <h1> elements line 10 column 1 - Info: <h1> previously mentioned line 11 column 34 - Warning: discarding unexpected </center> line 14 column 1 - Warning: <center> isn't allowed in <h2> elements line 13 column 1 - Info: <h2> previously mentioned line 13 column 1 - Warning: missing </h2> before <h3> line 18 column 1 - Warning: discarding unexpected </center> line 19 column 1 - Warning: discarding unexpected </h2> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	2d7f319a0a	docs/news.html: use proper markup fixes tidy warnings: line 1227 column 9 - Warning: missing <li> line 1228 column 17 - Warning: missing <li> line 1235 column 25 - Warning: missing <li> line 1259 column 17 - Warning: missing <li> line 1267 column 9 - Warning: missing <li> line 1359 column 9 - Warning: missing <li> line 1361 column 55 - Warning: discarding unexpected </i> line 1354 column 1 - Warning: trimming empty <p> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	df2be226d9	docs: fix html end/start tags for more well-formed html Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:30 -06:00
Andreas Boll	703a662c15	docs: escape special html chars Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:29 -06:00
Andreas Boll	ecd5c7ceb8	docs: consolidate html header and footer add doctype add character encoding add missing <head> tag unify html header and footer Signed-off-by: Brian Paul <brianp@vmware.com>	2012-06-12 08:03:29 -06:00
Kenneth Graunke	45c21f852e	mesa: Unbind GL_TEXTURE_BUFFER on DeleteBuffers. Fixes oglconform's tbo/basic.buffer.delete test. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-11 13:30:32 -07:00
Kenneth Graunke	bbb67c3efc	mesa: Make glPrimitiveRestartIndex execute immediately in display lists. From the GL_NV_primitive_restart spec: "PrimitiveRestartIndexNV is not compiled into display lists, but is executed immediately." Prior to this patch, calls to glPrimitiveRestartIndex would hit the noop dispatch stub. +2 oglconforms. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-11 13:28:23 -07:00
Kenneth Graunke	a75e704326	mesa: Check for a negative "size" parameter in glCopyBufferSubData(). From the GL_ARB_copy_buffer spec: "An INVALID_VALUE error is generated if any of readoffset, writeoffset, or size are negative [...]" Fixes oglconform's copybuffer/negative.CNNegativeValues test. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-06-11 13:27:36 -07:00
Kenneth Graunke	4a5d020ee3	automake: Add AM_PROG_AR before LT_INIT to silence a lot of warnings. The warnings appear to occur with newer automake (probably 1.12). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-11 13:27:36 -07:00
José Fonseca	ea606ee7b4	scons: Fix scons build.	2012-06-11 19:38:07 +01:00
Brad King	f3cdcb839f	configure.ac: Add --with-(gl\|glu\|osmesa)-lib-name options These allow one to mangle the library names, without also mangling the symbol names, to make them distinct from other GL libraries on the system. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	337d9c955b	glsl: Put a bunch of optimization visitors under anonymous namespaces. Because these classes are used entirely from their own source files and not from separate DSOs, the linker gets to produce massively less code. This cuts about 13k of text in the libdricore case. In the non-libdricore case, the additional linkage information allows the compiler to inline some code, so libglsl.a size actually increases by about 300 bytes. For a dricore build, improves shader_runner runtime on glsl-fs-copy-propagation-texcoords-1 by 0.21% +/- 0.03% (n=353574, outliers removed). No statistically significant difference with n=322 on glslparsertest on a yofrankie shader intended to test compiler performance. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	279efce8bb	automake: Merge the dricore libglsl build into libdricore. Now we have just one library of "all of Mesa core" instead of both libdricore and libglsl that drivers link against. I did this change in a sort of nonrecursive make fashion: the generated files are still produced in the non-automake build, like the rest of dricore, but the GLSL files are stuffed into libdricore without building a convenience library in src/glsl (even though we could now). This would make a bit more sense if glsl was just another dir under src/mesa, because right now I had to contort the prefix variable name to look another ../ level up.	2012-06-11 09:28:00 -07:00
Eric Anholt	446faee094	automake: Add a prefix variable for libglsl sources. See `e86c40a84d` for reasoning. In the process I did s/:=/=/ to shut up automake about nonportable make syntax.	2012-06-11 09:28:00 -07:00
Eric Anholt	7edbf4b323	automake: Convert src/Makefile to automake. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	07abd913b6	automake: Move top-level makefile to automake. This is part of a series to fix our build issues in the automake case by hooking up the automatic Makefile regeneration support. The extract_git_sha1 is moved into src/mesa/Makefile so that we get correct dependency generation. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	743e505315	automake: Globally add stub automake targets to the old Makefiles. I tried to update all the old Makefiles that included the default config to be sure they had a default target if they didn't previously have one, since this new all target will always point at it. Almost everything had one. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	4038dda6cd	mesa: Move the version information right into configure.ac. Nothing else called version.mk. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Eric Anholt	0cc216676c	automake: Remove the old static configs system. With the incremental automake conversion, we'd broken those that included glx or egl. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Tapani Pälli	d5c1801a01	android: fix the build Some more of the files are now autogenerated, this caused build breakage, patch adds generation of these missing files. Patch also changes existing make so that the files are created to be part of the local source (not intermediate directory, this causes several problems). Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2012-06-11 09:27:59 -07:00
Michael Karcher	e2c08e824b	i915g: Fix depth/stencil glClear This patch fixes a copy/paste error and masking of depth/stencil (stencil is in the top 8 bits), and makes glean/readPixSanity happy. Both the stencil and the depth buffer piglit test also pass if glClear(DEPTH \| STENCIL) is executed instead of glClear(DEPTH)/glClear(STENCIL). Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Tested-by: Christopher Egert <cme3000@gmail.com> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-06-10 16:33:42 +02:00
Kenneth Graunke	306c9f0c57	mesa: Fix "glCopyBuffserSubData" typos in error messages and comments. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 22:04:34 -07:00
Eric Anholt	a018747ac8	glsl: Clean up warnings about deleting classes without virtual destructors. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 12:42:38 -07:00
Marcin Slusarz	ea055e19c2	glsl: fix deref_hash memory leak in constant_expression_value Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 21:00:40 +02:00
Andreas Boll	ca9977d5c6	glcpp: .gitignore cleanup .o, .lo and *~ are already in toplevel .gitignore Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 11:18:55 -07:00
Andreas Boll	6224e90247	glapi: .gitignore cleanup remove archaic .cvsignore .pyo is already in toplevel .gitignore .pyc is already in toplevel .gitignore Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-08 11:18:38 -07:00
Roland Scheidegger	dfbb18bdb5	gallivm: Fix calculating rho for 3d textures for the single-quad case Discovered by accident, this looks like a very old typo bug. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-06-08 17:46:57 +01:00
Kenneth Graunke	529476b5e4	i965: Add forgotten bitcast operations in brw_fs_channel_expressions. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:22:11 -07:00
Paul Berry	9fd0e76a19	i965/blorp: allow all buffer formats provided src and dst match. Previously, blits using the "blorp" mechanism only worked for 8-bit RGBA color buffers, 24-bit depth buffers, and 8 bit stencil buffers. This was not enough, because the blorp mechanism must be used for blitting whenever MSAA is in use. This patch allows all formats to be used, provided the source and destination formats match. So far I have confirmed that the following formats work properly with MSAA: - GL_RGB - GL_RGBA - GL_ALPHA - GL_ALPHA4 - GL_ALPHA8 - GL_R3_G3_B2 - GL_RGB4 - GL_RGB5 - GL_RGB8 - GL_RGB10 - GL_RGB12 - GL_RGB16 - GL_RGBA2 - GL_RGBA4 - GL_RGB5_A1 - GL_RGBA8 - GL_RGB10_A2 - GL_RGBA12 - GL_RGBA16 Fixes piglit tests "EXT_framebuffer_multisample/formats {2,4}" on Sandy Bridge and Ivy Bridge. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Paul Berry	530bda2aac	i965/blorp: Implement logic for additional buffer formats. Previously the blorp engine only supported RGBA8 color buffers and 24-bit depth buffers. This patch adds support for any color buffer format that is supported as a render target, and for 16-bit and 32-bit depth buffers. This required threading the brw_context struct through into brw_blorp_surface_info::set() so that it can consult the brw->render_target_format array. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Paul Berry	9dbd0b6778	i965/blorp: De-virtualize brw_blorp_{mip,surface}_info::set() function. Even though brw_blorp_surface_info is derived from brw_blorp_mip_info, this function doesn't need to be virtual, because it is never accessed through a base class pointer. Making the function non-virtual will allow it to take additional parameters in the brw_blorp_surface_info case. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Paul Berry	040d015734	i965/blorp: Refactor surface format determination. This patch moves the responsibility for deciding on the format of the source and destination surfaces from the gen{6,7}_blorp_emit_surface_state() functions to brw_blorp_surface_info::set(), which is shared between Gen6 and Gen7. This will make it possible to add support for more surface formats without code duplication. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 11:03:15 -07:00
Kenneth Graunke	05790746df	i965: Enable the GL_ARB_shader_bit_encode extension. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:17:21 -07:00
Olivier Galibert	a83be8b6d7	st/mesa: Finally activate the ARB_shader_bit_encoding extension. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:17:14 -07:00
Olivier Galibert	e16b0a51be	glsl: Bitwise conversion operator support in the software renderers. TGSI doesn't need an opcode, since registers are untyped (but beware once doubles come into the scene). Mesa IR doesn't handle native integers, so trying to handle them there is worthless, the case entries are only added for warning reasons. It was only tested with softpipe, since llvmpipe doesn't support glsl 1.3 yet. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:18 -07:00
Olivier Galibert	abe9767553	glsl: Bitwise conversion operator support in ir_constant_expression. A "test_out = floatBitsToUint(-1.0);" fired through the GLSL compiler gives a correct "(assign (x) (var_ref test_out) (constant uint (3212836864)))" Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:18 -07:00
Olivier Galibert	1b8a3aad09	glsl: Bitwise conversion operator support in ir_validate. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:18 -07:00
Olivier Galibert	4fab150559	glsl: Bitwise conversion operator support in ir_expression. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:17 -07:00
Olivier Galibert	500dcbb1aa	glsl: New unary opcodes for ARB_shader_bit_encoding support. The opcodes are bitcast_f2u, bitcast_f2i, bitcast_i2f and bitcast_u2f. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:17 -07:00
Olivier Galibert	199771bc32	glsl: Scaffolding for ARB_shader_bit_encoding. That adds support for activating the extension. It doesn't actually do anything yet, of course. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:06:00 -07:00
Kenneth Graunke	f8d40deea5	mesa: Return 8 bits for GL_TEXTURE_RED_SIZE on RGTC formats. From the issues section of the GL_ARB_texture_compression_rgtc extension: 15) What should glGetTexLevelParameter return for GL_TEXTURE_GREEN_SIZE and GL_TEXTURE_BLUE_SIZE for the RGTC1 formats? What should glGetTexLevelParameter return for GL_TEXTURE_BLUE_SIZE for the RGTC2 formats? RESOLVED: Zero bits. These formats always return 0.0 for these respective components and have no bits devoted to these components. Returning 8 bits for red size of RGTC1 and the red and green sizes of RGTC2 makes sense because that's the maximum potential precision for the uncompressed texels. Thus, we need to return 8 bits for GL_TEXTURE_RED_SIZE on all RGTC formats and 8 bits for GL_TEXTURE_GREEN_SIZE on RGTC2 formats. BLUE should be 0. Fixes oglconform/rgtc/advanced.texture_fetch.tex_param. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-07 00:01:40 -07:00
Kenneth Graunke	3603fdcebf	glsl: Hook up loop_variable_state destructor to plug a memory leak. While ~loop_state() is already freeing the loop_variable_state objects via ralloc_free(this->mem_ctx), the ~loop_variable_state() destructor was never getting called, so the hash table inside loop_variable_state was never getting destroyed. Fixes a memory leak in any shader with loops. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-07 00:01:40 -07:00
Tom Stellard	5f3f63b76d	radeon/llvm: Emulate RECIP_UINT instruction on Cayman	2012-06-06 20:51:00 -04:00
Tom Stellard	0c9f5f22d5	radeon/llvm: Remove some duplicate code in the R600 CodeEmitter	2012-06-06 20:51:00 -04:00
Tom Stellard	9c46cb2368	radeon/llvm: Fix MULLO* instructions on Cayman On Cayman, the MULLO* instructions must fill all slots in an instruction group.	2012-06-06 20:50:36 -04:00
Tom Stellard	0c4b19ac63	r600g: Compute support for Cayman	2012-06-06 10:49:36 -04:00
Dave Airlie	2bb2e6a6e3	xorg: port to new compat API. Signed-off-by: Dave Airlie <airlied@redhat.com>	2012-06-06 15:22:50 +01:00
Brian Paul	ec19bdd16c	mesa: consolidate internal glCompressedTexSubImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:56:00 -06:00
Brian Paul	e8fdd0e0d5	mesa: consolidate internal glCompressedTexImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:56:00 -06:00
Brian Paul	cd9ab2584f	mesa: consolidate internal glCopyTexSubImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:56:00 -06:00
Brian Paul	e42d00b3f4	mesa: consolidate internal glTexSubImage1/2/3D code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:55:59 -06:00
Brian Paul	8f5fffe75d	mesa: consolidate internal glTexImage1/2/3D code The functions for handling 1D, 2D and 3D texture images were nearly identical. This folds them all together. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-06 07:55:59 -06:00
Brian Paul	3a62e8bcac	translate_test: add support for half floats Fixes assertion reported in https://bugs.freedesktop.org/show_bug.cgi?id=44519 but there's still failing cases.	2012-06-06 07:55:59 -06:00
Brian Paul	adc58e96d0	docs: remove documentation of old Makefile system It's going away in the near future.	2012-06-06 07:55:59 -06:00
Tom Stellard	d4942eb9fa	radeon/llvm: Remove obselete hooks for the ConvertToISA pass We can't remove this pass yet, because we need it to convert AMDIL registers in BRANCH* instructions, but we don't need it for instruction conversion any more.	2012-06-06 13:46:04 -04:00
Tom Stellard	edceed1b9a	radeon/llvm: Remove AMDIL MOVE* instructions	2012-06-06 13:46:04 -04:00
Tom Stellard	f81e4663a7	radeon/llvm: Add isMov() to AMDILInstrInfo This enables the CFGStructurizer to work without the AMDIL::MOV* instructions.	2012-06-06 13:46:04 -04:00
Tom Stellard	1777c99bff	radeon/llvm: Remove deadcode from the AMDILISelLowering class	2012-06-06 13:46:03 -04:00
Tom Stellard	8cc9b463de	radeon/llvm: Don't lower RETURN to S_ENDPGM on SI Instead create an S_ENDPGM instruction in the CodeEmitter and emit it after all the other instructions.	2012-06-06 13:46:03 -04:00
Tom Stellard	de7366701d	radeon/llvm: Remove AMDIL VCREATE* instructions This obsoletes the AMDGPULowerInstruction pass.	2012-06-06 13:46:03 -04:00
Tom Stellard	8d53ddb375	radeon/llvm: Remove AMDIL LOADCONST* instructions This obsoletes the R600LowerInstruction and SIPropagateImmReads passes.	2012-06-06 13:46:03 -04:00
Marcin Slusarz	17e047242e	nouveau: fix scratch buffer leak ...and create common function for destroying nouveau_context	2012-06-05 23:58:43 +02:00
Marcin Slusarz	3232a86efe	nv50: fix nv50_stream_output_state leak	2012-06-05 23:58:43 +02:00
Marcin Slusarz	cfa7cb991c	nv50: fix symbol table memory leak	2012-06-05 23:58:43 +02:00
Kenneth Graunke	2f18698220	i965/fs: Fix user-defined FS outputs with less than four components. OpenGL allows you to declare user-defined fragment shader outputs with less than four components: out ivec2 color; This makes sense if you're rendering to an RG format render target. Previously, we assumed that all color outputs had four components (like the built-in gl_FragColor/gl_FragData variables). This caused us to call emit_color_write for invalid indices, incrementing the output virtual GRF's reg_offset beyond the size of the register. This caused cascading failures: split_virtual_grfs would allocate new size-1 registers based on the virtual GRF size, but then proceed to rewrite the out-of-bounds accesses assuming that it had allocated enough new (contiguously numbered) registers. This resulted in instructions that accessed size-1 GRFs which register numbers beyond virtual_grf_next (i.e. registers that were never allocated). Finally, this manifested as live variable analysis and instruction scheduling accessing their temporary array with an out of bounds index (as they're all sized based on virtual_grf_next), and the program would segfault. It looks like the hardware's Render Target Write message requires you to send four components, even for RT formats such as RG or RGB. This patch continues to use all four MRFs, but doesn't bother to fill any data for the last few, which should be unused. +2 oglconforms. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	cb18472eca	i965/vs: Fix texelFetchOffset() on pre-Gen7. Commit `4650aea7a5` fixed texelFetchOffset() on Ivybridge, but didn't update the Ironlake/Sandybridge code. +18 piglits on Sandybridge. NOTE: This and `4650aea7a5` are both candidates for stable branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	217b62bf00	i965/fs: Fix texelFetchOffset() on pre-Gen7. Commit `f41ecade7b` fixed texelFetchOffset() on Ivybridge, but didn't update the Ironlake/Sandybridge code. +15 piglits on Sandybridge. NOTE: This and `f41ecade7b` are both candidates for stable branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	7fde071f04	meta: Fix GL_RENDERBUFFER binding in decompress_texture_image(). This isn't saved/restored by _mesa_meta_begin, so we need to do it manually (like we do for the read/draw framebuffers). Additionally, we neglected to re-bind before the glRenderbufferStorage call. +13 oglconforms. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	3edd2ba22b	mesa: Unbind ARB_transform_feedback2 binding points on Delete too. DeleteBuffer needs to unbind from these binding points as well, based on the same rationale as the previous patch. +51 oglconforms (together with the last patch). NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:34 -07:00
Kenneth Graunke	05b086ce93	mesa: Support BindBuffer{Base,Offset,Range} with a buffer of 0. _mesa_lookup_bufferobj returns NULL for 0, which caused us to say "there's no such buffer object" and raise an error, rather than correctly binding the shared NullBufferObj. Now you can unbind your buffers. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:33 -07:00
Kenneth Graunke	cb8ed93dd0	mesa: Unbind ARB_copy_buffer and transform feedback buffers on delete. According to the GL 3.1 spec, section 2.9 ("Buffer Objects"): "If a buffer object is deleted while it is bound, all bindings to that object in the current context (i.e. in the thread that called DeleteBuffers) are reset to zero." The code already checked for a number of cases, but neglected these newer binding points. +21 oglconforms. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-06-05 14:41:33 -07:00
Kenneth Graunke	25edfbfccf	glsl/builtins: Fix textureGrad() for Array samplers. We were incorrectly assuming that the coordinate's dimensionality is equal to the gradient's dimensionality. For array types, the coordinate has one more component. Fixes 12 subcases of oglconform's glsl-bif-tex-grad test. NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-05 14:41:33 -07:00
Kristian Høgsberg	2c4f6ceeb4	configure.ac: Fail if egl x11 platform dependencies are not available Currently, if you pass --with-egl-platforms=x11 but xcb-dri2 isn't available we just silently fail and disables building the EGL DRI2 driver. This commit cleans up the EGL platfrom checking and fails if a selected platform can't find its required dependencies. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-05 14:34:33 -04:00
Alex Deucher	75f9d24ac4	r600g: add new Trinity PCI ids Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-05 10:16:42 -04:00
Alex Deucher	6ce298f9ce	r600g: add new Sumo, Palm, BTC pci ids Note this is a candidate for the stable branch. Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-05 10:15:16 -04:00
Alex Deucher	01b7eb7c74	radeonsi: add new SI pci ids Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2012-06-05 10:12:21 -04:00
Paul Berry	555e00fdc3	Fix .gitignore for ralloc-test Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-04 18:11:43 -07:00
Vinson Lee	105f307d90	st/mesa: Fix uninitialized members in glsl_to_tgsi_visitor constructor. Fix uninitialized scalar field defects reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2012-06-02 13:18:40 -07:00
Kenneth Graunke	adbfc4a09a	i965: Implement texture buffer objects on Gen6. Commit `a07cf3397e` added support for TBOs on Gen7, but missed Gen6. Passes piglit -t texture_buffer and oglconform's buffermapping basic.read.texture tests. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-02 12:02:42 -07:00
Kenneth Graunke	608c3d2083	mesa: Restore depth texture state on glPopAttrib(GL_TEXTURE_BIT). According to Table 6.17 in the GL 2.1 specification, DEPTH_TEXTURE_MODE, TEXTURE_COMPARE_MODE, and TEXTURE_COMPARE_FUNC need to be restored on glPopAttrib(GL_TEXTURE_BIT). Makes a number of oglconform tests happier. v2: Make restoration conditional on the ARB_shadow and ARB_depth_texture extensions, as suggested by Brian. I'm not sure that any implementations still remain that don't support those, but why not? NOTE: This is a candidate for stable release branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-02 12:02:42 -07:00
Eric Anholt	775ba11dcd	automake: Connect the libdricore target to make clean. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50480 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-01 16:25:39 -07:00
Tapani Pälli	a9cfd95c24	automake: use -m32 in CCASFLAGS when using --enable-32-bit this fixes libdricore directory build with --enable-32-bit on a x86_64 system Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-06-01 16:25:39 -07:00
Tom Stellard	0ebf2318b3	radeon/llvm: Fix VTX_READ patterns The VTX_READ instructions were using the ADDRParam ComplexPattern which allows a load instruction's offset to be a register, but VTX_READ instructions can only handle an immediate offset. Also, the load_param pattern fragment had an erroneous return true; statement that was causing it to match the wrong load instructions.	2012-06-01 16:52:26 -04:00
Tom Stellard	c108831d44	radeon/llvm: Emit 2 bytes for vertex fetch offsets	2012-06-01 16:52:26 -04:00
Tom Stellard	85a68814ee	radeon/llvm: Only use indirect (vertex fetch) parameters for kernels Kernel parameters can only be retrieved via vertex fetchs. Direct parameters (i.e parameters stored in the constant buffer) are not supported yet.	2012-06-01 16:52:26 -04:00
Kenneth Graunke	fb79ecb62d	intel: Change vendor string to "Intel Open Source Technology Center". Tungsten Graphics has not existed for several years, and the majority of ongoing development and support is done by Intel. I chose to include "Open Source Technology Center" to distinguish it from, say, the closed source Windows OpenGL driver. The one downside to this patch is that applications that pattern match against "Intel" may start applying workarounds meant for the Windows driver. However, it does seem like the right thing to do. This does change oglconform behavior. Acked-by: Eric Anholt <eric@anholt.net> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Eugeni Dodonov <eugeni.dodonov@intel.com> Acked-by: Keith Packard <keithp@keithp.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-01 14:24:57 -07:00
Ian Romanick	adfe531841	glsl: Remove spurious printf messages These look like debug messages from the switch-statement development. NOTE: This is a candidate for the 8.0 release branch. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-06-01 12:27:04 -07:00
Tom Stellard	d6c2d3722d	radeon/llvm: Eliminate CFGStructurizer dependency on AMDIL instructions Add some hooks to the R600,SI InstrInfo and RegisterInfo classes, so that the CFGStructurizer pass can run without any relying on AMDIL instructions.	2012-06-01 11:28:11 -04:00
Tom Stellard	65917004d9	radeon/llvm: Change prefix on tablegen files to AMDGPU	2012-06-01 11:28:11 -04:00
Tom Stellard	afea59bf65	radeon/llvm: Remove deadcode from the R600LowerInstructions pass	2012-06-01 11:28:10 -04:00
Tom Stellard	883a0af53a	radeon/llvm: Remove AMDIL GLOBALSTORE* instructions	2012-06-01 11:28:10 -04:00
Tom Stellard	f2781271c7	radeon/llvm: Remove AMDIL GLOBALLOAD* instructions	2012-06-01 11:28:10 -04:00
Adam Rak	6a829a1b72	r600g: compute support for evergreen Tom Stellard: - Updated for gallium interface changes - Fixed a few bugs: + Set the loop counter + Calculate the correct number of pipes - Added hooks into the LLVM compiler	2012-06-01 11:28:10 -04:00
Tom Stellard	46a13b3b11	clover: Add function for building a clover::module for non-TGSI targets v6 v2: -Separate IR type and LLVM triple -Do the OpenCL C->LLVM IR and linking steps for all PIPE_SHADER_IR types. v3: - Coding style fixes - Removed compatibility code for LLVM < 3.1 - Split build_module_llvm() into three functions: compile(), link(), and build_module_llvm() v4: - Use struct pipe_compute_program v5: - Don't malloc memory for struct pipe_llvm_program v6: - Fix serialization of llvm bytecode Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:10 -04:00
Tom Stellard	f2606413ec	gallium: Add struct pipe_llvm_program_header v3 This structure is used as a header that precedes LLVM bytecode programs that are passed to the drivers. v2: - s/pipe_compute_program/pipe_llvm_program/ v3: - Rename to struct pipe_llvm_program_header - Drop the char * prog member Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:10 -04:00
Tom Stellard	741463e18d	clover: Remove target argument from compile_program_tgsi() Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:10 -04:00
Tom Stellard	d724190bce	clover: Add constructors to some of the module classes v3 This is for the llvm code that can't use extended initializers. v2: - Use const references for vector arguments - Move constructor defs before data members - Initialize all values in the default constructors v3: - Fix typo	2012-06-01 11:28:09 -04:00
Tom Stellard	5cc08efe8f	clover: Add necessary flags to libclllvm_la_CXXFLAGS $(LLVM_CFLAGS) for LLVM defines -DLIBCLC_PATH for libclc path -DCLANG_RESOURCE_DIR for clang includes $(DEFINES) for -DHAVE_LLVM	2012-06-01 11:28:09 -04:00
Tom Stellard	7a6b5d42d8	clover: Link to the necessary LLVM and Clang libs	2012-06-01 11:28:09 -04:00
Tom Stellard	d416780f39	configure.ac: Add variables LLVM_CPPFLAGS and LLVM_LIBDIR	2012-06-01 11:28:09 -04:00
Tom Stellard	c79e7668b2	configure.ac: Add option for libclc path	2012-06-01 11:28:09 -04:00
Tom Stellard	613323b256	clover: Add a function for retrieving a device's preferred ir v3 A device now has two function for getting information about the IR it needs to return. ir_format() => returns the preferred IR ir_target() => returns the triple for the target that is understood by clang/llvm. v2: - renamed ir_target() to ir_format() - renamed llvm_triple() to ir_target() v3: - Remove unnecessary include - Do proper conversion from std::vector<char> to std::string Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:28:09 -04:00
Francisco Jerez	c4c51153bc	gallium/compute: Add PIPE_COMPUTE_CAP_IR_TARGET v4 v2: Tom Stellard - Update CAP description v3: Tom Stellard - TGSI targets should pass an empty string for this CAP. v4: Tom Stellard - TGSI targets can ignore this CAP. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:27:53 -04:00
Tom Stellard	1d118a2a76	gallium: Add PIPE_SHADER_IR_LLVM to enum pipe_shader_ir v2 v2: - s/PIPE_SHADER_IR_LLVM_R600/PIPE_SHADER_IR_LLVM/ Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:26:57 -04:00
Tom Stellard	d85e512374	configure.ac: Add HAVE_OPENCL AM_CONDITIONAL v2 v2: - Drop HAVE_OPENCL variable for non-automake builds - s/HAVE_OPENCL/HAVE_GALLIUM_COMPUTE Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2012-06-01 11:26:57 -04:00
Brian Paul	091a61a8d5	scons: generate the glapitable.h file too	2012-06-01 08:27:21 -06:00
Brian Paul	8009fca501	svga: fix saturated TEX instructions TEX instructions can't do saturation. Do the TEX into a temp reg w/out saturation, then do a MOV_SAT. Reviewed-by: Jakob Bornecrantz <jakob@vmware.com>	2012-05-31 12:54:04 -06:00
Brian Paul	dff36e900c	scons: add code to generate the various GL API files This fixes recent build breakage when we began building the generated API files from xml as part of the normal build process. Fixes http://bugs.freedesktop.org/show_bug.cgi?id=50475	2012-05-31 09:40:35 -06:00
Brian Paul	185ed21058	draw: simplify index buffer specification Replace draw_set_index_buffer() and draw_set_mapped_index_buffer() with draw_set_indexes() which simply takes a pointer and an index size.	2012-05-31 09:40:35 -06:00
Kenneth Graunke	151bf6e6cf	glsl/tests: Plumb $(PYTHON2) and $(PYTHON_FLAGS) into optimization-test. Some distributions (like Arch Linux) make /usr/bin/python Python 3, rather than Python 2. Since compare_ir uses /usr/bin/env python, such systems will fail to run optimization-test, causing 'make check' to always fail. Automake's TESTS_ENVIRONMENT variable provides a mechanism to run programs or set environment variables in the test environment. Ideally, I think we would want to use AM_TESTS_ENVIRONMENT, since TESTS_ENVIRONMENT is supposed to be user-overridable. However, it isn't supported using the default/serial test runner. Fixes 'make check' on Arch Linux and Gentoo. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Matt Turner <mattst88@gmail.com>	2012-05-30 21:49:41 -07:00
Kenneth Graunke	a44ccdc876	ralloc: Add some basic unit tests. I started writing unit tests for a new piece of code, and discovered they all failed due to a bug in ralloc. Clearly it needs a test suite. v2: Rename to 'ralloc-test' and fix copyright date. (idr review) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-30 21:49:41 -07:00
Kenneth Graunke	1559b2e2d7	ralloc: Fix ralloc_parent() of memory allocated out of the NULL context. If an object is allocated out of the NULL context, info->parent will be NULL. Using the PTR_FROM_HEADER macro would be incorrect: it would say that ralloc_parent(ralloc_context(NULL)) == sizeof(ralloc_header). Fixes the new "null_parent" unit test. NOTE: This is a candidate for the 7.9, 7.10, 7.11, and 8.0 branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-30 21:49:40 -07:00
Kenneth Graunke	2224fb6047	automake: Check for 'indent' and fall back to 'cat' if not found. The glapi generator code uses indent to produce more readable code. However, we don't want to make GNU indent a hard build dependency; check for it in configure.ac and fall back to 'cat' if it's not available. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50484 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Ben Widawsky <ben@bwidawsk.net>	2012-05-30 13:39:30 -07:00
Oliver McFadden	ff3eef1aff	mesa: don't compile integer clear shaders for unsupported APIs Discovered while running the Khronos conformance test suite and receiving "implementation error: meta program compile failed." This bug was recently introduced by the i965 clear patch set and would only be detected while using the ES2 API and only on gen6+ hardware. Signed-off-by: Oliver McFadden <oliver.mcfadden@linux.intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-30 15:20:34 +03:00
Paul Berry	47b64c9290	i965/blorp: Implement destination clipping and scissoring This patch implements clipping and scissoring of the destination rect for blits that use the blorp engine (e.g. MSAA blits).	2012-05-29 15:35:35 -07:00
Eric Anholt	6a15790632	mesa: Clean up some dricore-related detritus in the old Makefile. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:51 -07:00
Eric Anholt	f9d1562f35	automake: Convert dricore building to automake. This is performed in a subdirectory to avoid needing to convert all of src/mesa/Makefile in one go. I can now cherry-pick a commit containing glapi XML changes, do "(cd src/mapi/glapi/gen && make) && make", and get a working driver. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:48 -07:00
Eric Anholt	e86c40a84d	automake: Add a prefix variable to the common sources lists. In order to do the minimal change for libdricore conversion to automake, I need to put its Makefile.am in a subdirectory. Automake gets whiny/broken if you use GNU make features like "addprefix" or "$(FILES:%=../%)" to munge your *_SOURCES. So, use a plain old variable to be able to substitute in that "../" Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:45 -07:00
Eric Anholt	7d7fe1b037	automake: Rename variables in sources.mak to be automake compatible. *_SOURCES is reserved for files lists for particular automake targets. Also, "-" in the variable names is not allowed. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:42 -07:00
Eric Anholt	b284d4773b	mesa: Remove generated source files during make clean. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:40 -07:00
Eric Anholt	79273b1a7a	glapi: Enable silent rules for generation when used from automake. This variable won't be set when called from non-automake makefiles, but it cleans up shared-glapi's output. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:37 -07:00
Eric Anholt	559d592448	shared-glapi: Don't forget to clean our built file. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:33 -07:00
Eric Anholt	26eaee3245	mesa: Restore installing of libGL for non-dri builds. Reported-by: Sven Joachim <svenjoac@gmx.de> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 12:39:30 -07:00
Eric Anholt	0ce0f7c0c8	mesa: Remove the generated glapi from source control, and just build it. Mesa already always depends on python to build. The checked in changes are not reviewed (because any trivial change rewrites the world). We also have been pushing commits between xml change and regen where at-build-time xml-generated code disagrees with committed xml-generated code. And worst of all, sometimes we ("I") check in stale xml-generated code. Acked-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-29 11:51:57 -07:00
Kurt Roeckx	f92b2e5e90	i830: Fix crash for GL_STENCIL_TEST in i830Enable() commit `87f12bb2d9` tried to fix rb->mt being NULL, but change this case wrong. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Kurt Roeckx <kurt@roeckx.be> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-29 11:33:02 -07:00
Marcin Slusarz	8924133627	nv50: hook up forgotten short constant buffer upload method Fixes crash in xorg st.	2012-05-29 20:24:45 +02:00
Tom Stellard	83169900fb	radeon/llvm: Update and fix some comments	2012-05-29 11:59:01 -04:00
Tom Stellard	89ece086bc	radeonsi: Remove use.sgpr* intrinsics, use load instructions instead We now model loading uses sgpr values with LLVM IR load instructions that use the USER_SGPR address space. The definition of the sgpr parameter to the use_sgpr() helper function in radeonsi_shader.c has changed so that you can pass raw sgpr values rather than having to divide the sgpr value you want to use by the dword width of the type you want to load.	2012-05-29 11:55:53 -04:00
Tom Stellard	467f51613e	radeonsi: Handle TGSI CONST registers We now emit LLVM load instructions for TGSI CONST register reads, which are lowered in the backend to S_LOAD_DWORD* instructions.	2012-05-29 11:55:52 -04:00
Tom Stellard	32b83e0366	radeon/llvm: Remove AMDILIntrinsicInfo::GetDeclaration fuction body This function was causing compile errors in the tablegen'd code for some intrinsic definitions. I don't think we really need this function, so I'm removing the function body just as a temporary solution. I'll look into removing the entire AMDILIntrinsicInfo class later.	2012-05-29 11:55:52 -04:00
Tom Stellard	49fb99bd13	radeon/llvm: Remove AMDILTargetMachine	2012-05-29 11:55:52 -04:00
Christoph Bumiller	94a25b216b	nouveau: unreference fences on resource destruction	2012-05-29 17:00:20 +02:00
Christoph Bumiller	1a21e36b68	nvc0: optimize blend cso by checking which by-RT data actually differs Can save about 200 bytes of command buffer space.	2012-05-29 17:00:18 +02:00
Christoph Bumiller	f09ee76c98	nvc0: don't upload UCPs if the shader doesn't use them	2012-05-29 17:00:15 +02:00
Christoph Bumiller	79eed0d224	nvc0/ir: allow 64-bit constant loads on nve4 Looks like only 128-bit access doesn't work.	2012-05-29 17:00:10 +02:00
Christoph Bumiller	40c224a573	nvc0/ir: fix texture barrier insertion to prevent WAW hazards Fixes, for instance, object highlighting in Diablo 3 (wine).	2012-05-29 15:01:41 +02:00
Christoph Bumiller	0d818cdacc	nvc0/ir: TEX doesn't support JOIN modifier either	2012-05-29 15:01:41 +02:00
Christoph Bumiller	f80c2874ec	gallium: add st_api feature mask to prevent advertising MS visuals v2: use a define for the maximum sample count v3: also test odd sample counts (r300 supports MS3) While multisample renderbuffers are supported by mesa, MS visuals are not, so we need a way to tell dri/st not to advertise them even if the gallium driver does support multisampled surfaces. Otherwise applications selecting these non-functional visuals would run into trouble ... Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-29 15:01:41 +02:00
Roy Spliet	6404095fba	nv30: Fix generic passing to fragment program in NV34.	2012-05-25 22:42:54 +02:00
Christoph Bumiller	384ef28cb3	nv30: handle user index buffers	2012-05-25 22:42:54 +02:00
Tom Stellard	704eac0916	radeon/llvm: Use a custom inserter for MASK_WRITE	2012-05-25 15:40:59 -04:00
Tom Stellard	4863477e22	radeon/llvm: Use tablegen pattern to lower bitconvert	2012-05-25 15:40:59 -04:00
Tom Stellard	667cdba211	radeon/llvm: Use a custom inserter to lower FNEG	2012-05-25 15:40:58 -04:00
Tom Stellard	d784bc7740	radeon/llvm: Use a custom inserter to lower CLAMP	2012-05-25 15:40:58 -04:00
Tom Stellard	17f8528923	radeon/llvm: Use a custom inserter to lower FABS	2012-05-25 15:40:58 -04:00
Kai Wasserbäch	2df2c31087	r600g: handle R16G16B16_FLOAT and R32G32B32_FLOAT in translate_colorswap Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50318 Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org>	2012-05-25 20:41:01 +02:00
Brian Paul	1609efb418	draw: fix primitive restart bug by using the index buffer offset The code which scans the index buffer for restart indexes wasn't adding the index buffer offset so we were always starting at offset=0. The offset is usually zero so it wasn't noticed before. Fixes a failure in the piglit primitive-restart test when testing vertex data + index data in a single VBO. NOTE: This is a candidate for the 8.0 branch.	2012-05-25 10:02:22 -06:00
Brian Paul	93ea5cd80b	svga: remove the special zero-stride vertex array code This code actually hasn't been needed for some time now. We can just treat a zero-stride vertex array like any other non-zero-stride array.	2012-05-25 10:02:22 -06:00
Brian Paul	dcb4ec5ae1	gallium/docs: beef up the docs related to color clamping Reviewed-by: Marek Olšák <maraeo@gmail.com>	2012-05-25 10:02:22 -06:00
Brian Paul	9c85687439	util: add GALLIUM_LOG_FILE option for logging output to a file Useful for logging different runs to files and diffing, etc.	2012-05-25 10:02:21 -06:00
Paul Berry	ab014adaed	i965/msaa: Enable 4x MSAA on Gen7. Basic 4x MSAA support now works on Gen7. This patch enables it. As with Gen6, MSAA support is still fairly preliminary. In particular, the following are not yet supported: - 8x oversampling (Gen7 has hardware support for this, but we do not yet expose it). - Fully general blits between MSAA and non-MSAA buffers. - Formats other than RGBA8, DEPTH24, and STENCIL8. - Centrold interpolation. - Coverage parameters (glSampleCoverage, GL_SAMPLE_ALPHA_TO_COVERAGE, GL_SAMPLE_ALPHA_TO_ONE, GL_SAMPLE_COVERAGE, GL_SAMPLE_COVERAGE_VALUE, GL_SAMPLE_COVERAGE_INVERT). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	4725ba03ca	i965/msaa: Implement manual blending operation for Gen7. On Gen6, the blending necessary to blit an MSAA surface to a non-MSAA surface could be accomplished with a single texturing operation. On Gen7, the WM program must fetch each sample and blend them together manually. From the Bspec (Shared Functions/Messages/Initiating Message/Message Types/sample): [DevIVB+]:Number of Multisamples on the associated surface must be MULTISAMPLECOUNT_1. This patch implements the manual blend operation. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	8b1f467cce	i965/msaa: Modify blorp code to account for Gen7 MSAA layouts. Since blorp uses color textures and render targets to do all its work (even when blitting stencil and depth data), it always has to configure the Gen7 GPU to use the new "sliced" MSAA layout. However, when blitting stencil or depth data, the actual MSAA layout is interleaved (as in Gen6). Therefore, blorp has to do extra coordinate transformation work to account for the interleaving manually. This patch causes blorp to perform the necessary extra coordinate transformations. It also modifies the blorp SURFACE_STATE setup code for Gen7, so that it does not try to correct the surface width and height to account for MSAA, since "sliced" MSAA layout doesn't affect the surface width or height. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	31f3dfd59b	i965/msaa: Validate Gen7 surface state constraints. When a Gen7 SURFACE_STATE is configured for MSAA, a number of additional constaints come in to play. This patch adds a function gen7_check_surface_setup() which verifies that all of those constraints are met. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	455ac56272	i965/msaa: Properly handle sliced layout for Gen7. Starting in Gen7, there are two possible layouts for MSAA surfaces: - Interleaved, in which additional samples are accommodated by scaling up the width and height of the surface. This is the only layout available in Gen6. On Gen7 it is used for depth and stencil surfaces only. - Sliced, in which the surface is stored as a 2D array, with array slice n containing all pixel data for sample n. On Gen7 this layout is used for color surfaces. The "Sliced" layout has an additional requirement: it must be used in ARYSPC_LOD0 mode, which means that the surface doesn't leave any extra room between array slices for miplevels other than 0. This patch modifies the surface allocation functions to use the correct layout when allocating MSAA surfaces in Gen7, and to set the array offsets properly when using ARYSPC_LOD0 mode. It also modifies the code that populates SURFACE_STATE structures to ensure that ARYSPC_LOD0 mode is selected in the appropriate circumstances. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	0e11b2c5af	i965/msaa: Add defines for Gen7. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	b08545199a	i965/blorp: Enable blorp blits on Gen7. Gen7 support for blorp (blits using the render bath) now works for non-MSAA purposes. This patch enables it. Since blorp operations re-use the logic for HiZ ops, this required adding a case to the switch statement in gen7_blorp_emit_wm_config(), to allow for the case where no HiZ op is being performed. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	1c73c705fa	i965/blorp: Implement proper texel fetch messages for Gen7. On Gen6, texel fetch is always accomplished using the SAMPLE_LD message, which accepts arguments (u, v, r, lod, si). On Gen7, there are two* texel fetch messages: SAMPLE_LD for non-MSAA surfaces, taking arguments (u, lod, v), and SAMPLE_LD2DSS for MSAA surfaces, taking arguments (si, u, v). *Technically, there are other texel fetch messages, but they are used for "compressed" MSAA surfaces, which we don't yet support. This patch adds the proper message types and argument orderings for Gen7. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	f2cdfa4c85	i965/blorp: Use 16 pixel dispatch on Gen7. Gen7 hardware requires us to enable at least one WM dispatch mode, even if there is no program being dispatched to. When this code was only used for HiZ operations (which don't use a WM program), we used 32-pixel dispatch, because it didn't matter. But blit programs are compiled for 16-pixel dispatch. So just enable 16-wide dispatch unconditionally. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> v2: Enable 16-wide dispatch unconditionally rather than add the unnecessary complication of using 32-wide dispatch when there is no WM program.	2012-05-25 08:45:11 -07:00
Paul Berry	f7df7917e0	i965/blorp: Allocate space for push constants on Gen7. On Gen7, push constants for shader programs are stored in the URB, so blorp code needs to set aside space for them. This was previously unnecessary because blorp code was based on HiZ operations, which don't require any shaders. This patch adds a call from gen7_blorp_exec() to gen7_allocate_push_constants(), to ensure that push constants are assigned the correct location in the URB. It also extracts a new function gen7_emit_urb_state() from gen7_upload_urb(), which is re-used by gen7_blorp_emit_urb_config() to ensure that the URB regions used by all the pipeline stages leave room for the push constants. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	de9752a4e5	i965/blorp: Set the dynamic state upper bound. We know from previous bug fixes (commits `c25e5300cb` and `b2ace06cbb`) that texture border color doesn't work if the dynamic state upper bound is set to 0. Although the blorp engine doesn't make use of texture borders, it seems like we ought to err on the safe side and set this value properly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	f77959b2c9	i965/blorp: Factor gen6_blorp_emit_batch_head into separate functions. This patch separates out the portions of gen6_blorp_emit_batch_head() that emit 3DSTATE_MULTISAMPLE, 3DSTATE_SAMPLE_MASK, and STATE_BASE_ADDRESS. This paves the way for making the blorp code work on Gen7, where additional command packets (3DSTATE_PUSH_CONSTANT_ALLOC_VS and 3DSTATE_PUSH_CONSTANT_ALLOC_PS) need to be emitted before 3DSTATE_MULTISAMPLE. Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:11 -07:00
Paul Berry	34a5f12e35	i965/blorp: Use MSDISPMODE_PERSAMPLE rendering when necessary This patch modifies the "blorp" WM program so that it can be run in MSDISPMODE_PERSAMPLE (which means that every single sample of a multisampled render target is dispatched to the WM program, not just every pixel). Previously we were using the ugly hack of configuring multisampled destination surfaces as single-sampled, and generating sample indices other than zero by swizzling the pixel coordinates in the WM program. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-25 08:45:10 -07:00
Paul Berry	233c207e9e	i965/blorp: Emit sample index in SAMPLE_LD message when necessary This patch modifies the function brw_blorp_blit_program::texel_fetch() to emit the SI (sample index) argument to the SAMPLE_LD message when reading from a sample index other than zero. Previously we were using the ugly hack of configuring multisampled source surfaces as single-sampled, and accessing sample indices other than zero by swizzling the texture coordinates in the WM program. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:10 -07:00
Paul Berry	665dc82bdc	i965/blorp: Generalize sampling code in preparation for Gen7 This patch generalizes the function brw_blorp_blit_program::texture_lookup() so that it prepares the arguments to the sampler message based on a caller-provided array rather than assuming the argument order is always (u, v). This paves the way for the messages we will need to use in Gen7, which use argument orders (u, lod, v) and (si, u, v) (si=sample index). It will also will allow us to read from arbitrary sample indices on Gen6, by supplying the arguments (u, v, r, lod, si) to the SAMPLE_LD message instead of just (u, v). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-25 08:45:10 -07:00
Paul Berry	52fcc36f11	i965/msaa: Expand odd-sized MSAA surfaces to account for interleaving pattern. Gen6 MSAA buffers (and Gen7 MSAA depth/stencil buffers) interleave MSAA samples in a complex pattern that repeats every 2x2 pixel block. Therefore, when allocating an MSAA buffer, we need to make sure to allocate an integer number of 2x2 blocks; if we don't, then some of the samples in the last row and column will be cut off. Fixes piglit tests "EXT_framebuffer_multisample/unaligned-blit {2,4} color msaa" on i965/Gen6. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-25 08:45:10 -07:00
Thomas Gstädtner	93594f38be	gallium/targets: pass ldflags parameter to MKLIB Without passing the -ldflags parameter before $(LDFLAGS) in some cases flags will be passed to MKLIB which it does not understand. This might be -m64, -m32 or similar. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Thomas Gstädtner <thomas@gstaedtner.net> Signed-off-by: Brian Paul <brianp@vmware.com>	2012-05-25 09:36:24 -06:00
Vadim Girlin	a1a0974401	Revert "r600g: set round_mode to truncate and get rid of tgsi_f2i on evergreen" This reverts commit `60bf0f05b4`. It seems round_mode behaves differently in some cases depending on the instruction/slot. Reverting it for now. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50232 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:28:08 +04:00
Vadim Girlin	1c5c4243c9	radeon/llvm: add FLT_TO_UINT, UINT_TO_FLT instructions Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:27:46 +04:00
Vadim Girlin	5a1b59b4e6	radeon/llvm: prepare to revert the round mode state to default Use TRUNC before FLT_TO_INT on evergreen/cayman. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:27:33 +04:00
Vadim Girlin	7fa7c608cb	radeon/llvm: fix sampler index in llvm_emit_tex Sampler index isn't a second source operand for some tgsi texture instructions. Let's assume it's always the last. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50230 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:27:23 +04:00
Vadim Girlin	029776753b	radeon/llvm: fix opcode for RECIP_UINT_r600 Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50312 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Tested-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:23:06 +04:00
Vadim Girlin	6806f81fb4	radeon/llvm/loader: convert hardcoded gpu name to option Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:22:38 +04:00
Vadim Girlin	482041a538	r600g: add RECIP_INT, PRED_SETE_INT to r600_bytecode_get_num_operands Fixes https://bugs.freedesktop.org/show_bug.cgi?id=50315 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Tested-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-05-25 17:22:12 +04:00
Vinson Lee	35f302d97e	i915g: Check for geometry shader earlier in i915_set_constant_buffer. Fix resource leak defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-24 18:50:07 -07:00
Vinson Lee	5cf693266f	scons: Fix SCons build infrastructure for FreeBSD. This patch gets the FreeBSD SCons build working again. The build still fails though. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-24 18:49:40 -07:00
Tom Stellard	33e7db9a1d	radeon/llvm: Lower UDIV using the Selection DAG	2012-05-24 14:12:32 -04:00
Tom Stellard	d088da917b	radeon/llvm: Remove auto-generated AMDIL->ISA conversion code	2012-05-24 14:12:32 -04:00
Tom Stellard	662ccbfc21	radeon/llvm: Remove AMDIL instructions MULHI, SMUL	2012-05-24 14:12:32 -04:00
Tom Stellard	177b420283	radeon/llvm: Remove AMDIL bitshift instructions (SHL, SHR, USHR)	2012-05-24 14:12:32 -04:00
Tom Stellard	9d41a401dc	radeon/llvm: Remove AMDIL FTOI and ITOF instructions	2012-05-24 14:12:32 -04:00
Tom Stellard	a8ba697c1e	radeon/llvm: Remove AMDIL EXP* instructions	2012-05-24 14:12:31 -04:00
Tom Stellard	dd9927eb36	radeon/llvm: Remove AMDIL ADD instructions	2012-05-24 14:12:31 -04:00
Tom Stellard	1404e6b9fc	radeon/llvm: Remove AMDIL binary instrutions (OR, AND, XOR, NOT)	2012-05-24 14:12:31 -04:00
Tom Stellard	3059c075a7	radeon/llvm: Remove AMDILMachinePeephole pass	2012-05-24 14:12:31 -04:00
Tom Stellard	e9d8901a80	radeon/llvm: Remove AMDIL CMP instructions and associated lowering code	2012-05-24 14:12:31 -04:00
Tom Stellard	ea00632fe0	radeon/llvm: Remove AMDIL ROUND_NEAREST instruction	2012-05-24 14:12:31 -04:00
Tom Stellard	0bfa3b3e96	radeon/llvm: Remove AMDIL ROUND_POSINF instruction	2012-05-24 14:12:31 -04:00
Tom Stellard	d4984f3463	radeon/llvm: Add custom SDNode for FRACT	2012-05-24 14:12:30 -04:00
Tom Stellard	5523502ff9	radeon/llvm: Use -1 as true value for SET* integer instructions	2012-05-24 14:12:30 -04:00
Tom Stellard	86dfae1103	radeon/llvm: Handle SETGE_INT, SETGE_UINT, and SETGT_UINT opcodes Support for these was inadvertently dropped in commit `cee23ab246`	2012-05-24 14:12:30 -04:00
Tom Stellard	cc7a6d2691	radeon/llvm: Avoid error with SI in EmitInstrWithCustomInserter() We need to return immediately after inserting instructions that require S_WAITCNT so that the parent class' custom inserter won't try to insert them again.	2012-05-24 14:12:30 -04:00
Vinson Lee	0f6a3a7de3	tgsi: Initialize Padding struct fields. Fix uninitialized scalar variable defects report by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-23 21:58:37 -07:00
Kenneth Graunke	88128516d4	i965: Gut the separate OpenGL ES extension enabling. We should just set the bits of functionality that we support; the GL/ES1/ES2 flags in extensions.c will take care of advertising the appropriate extensions for the current API. This enables the GL_EXT_texture_compression_dxt1 extension on ES1/ES2 when libtxc_dxtn is installed or the force_s3tc driconf option is set. The main extension code set this up properly, but the ES-specific code failed to do so. Otherwise, the extension strings reported by es1_info, es2_info, and glxinfo all remain the same. This patch manually disables the ARB_framebuffer_object bit on ES to preserve the behavior of `1c0f5d8324`. v2: Rebase, fix the i915 Makefile, and unconditionally set the OES_draw_texture bit as core Mesa will only apply it to ES1 now. Tested-by: Daniel Charles <daniel.charles@intel.com> [v1] Reviewed-by: Chad Versace <chad.versace@linux.intel.com> [v1] Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 17:17:54 -07:00
Kenneth Graunke	d4667516b6	mesa: Remove the OES_draw_texture extension from ES2. This extension appears to be written against ES 1.0. In ES 2.0, you really want to be using FBOs instead. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 17:03:35 -07:00
Jordan Justen	dc50145253	i965: use cut index to handle primitive restart when possible If the primitive restart index and the primitive type can be handled by the cut index feature, then use the hardware to handle the primitive restart feature. The VBO module's software handling of primitive restart is used as a fall back. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 15:19:09 -07:00
Jordan Justen	f9389fbfb2	i965: add flag to enable cut_index When brw->prim_restart.enable_cut_index is set, the cut index will be enabled when uploading index_buffer commands. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 15:19:09 -07:00
Jordan Justen	df7d1323de	i965: create code path to handle primitive restart in hardware For newer hardware we disable the VBO module's software handling of primitive restart. We now handle primitive restarts in brw_handle_primitive_restart. The initial version of brw_handle_primitive_restart simply calls vbo_sw_primitive_restart, and therefore still uses the VBO module software primitive restart support. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 15:19:09 -07:00
Paul Berry	9f6932cb83	glsl/tests: Add .gitignore for uniform initialization unit test. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-23 14:24:33 -07:00
Paul Berry	aa173e16a0	glsl/constant propagation: kill whole var if LHS involves array indexing. When considering which components of a variable were killed by an assignment, constant propagation would previously just use the write mask of the assignment. This worked if the LHS of the assignment was simple, e.g.: v.xy = ...; // (assign (xy) (var_ref v) ...) But it did the wrong thing if the LHS of the assignment involved an array indexing operator, since in this case the write mask is always (x): v[i] = ...; // (assign (x) (deref_array (var_ref v) (var_ref i)) ...) In general, we can't predict which vector component will be selected by array indexing, so the only safe thing to do in this case is to kill the entire variable. Fixes piglit tests {fs,vs}-vector-indexing-kills-all-channels.shader_test. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2012-05-23 14:21:48 -07:00
Ian Romanick	b45052b3f7	glsl/tests: Add test for uniform initialization by the linker v2: Put unit tests in src/glsl/tests rather than tests/glsl. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	49da2590c2	mesa: Use initializers to configure samplers Now that the linker handles initializers of samplers just like any other uniform, a bunch of this annoying code is unnecessary. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	75dac69262	ir_to_mesa: Don't set initial uniform values again This work is now done by the linker, so we don't need to keep doing it here. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	c343b980d6	ir_to_mesa: Propagate initial values in _mesa_associate_uniform_storage The linker may have set initial values for uniforms. Propagate these values to the driver's backing storage when it is first associated. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:08 -07:00
Ian Romanick	76027f5b5c	glsl: Propagate sampler uniform initializers to gl_shader_program::SamplerUnits Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:42:07 -07:00
Ian Romanick	b610881317	glsl: Initialize samplers to 0, propagate sampler values to the gl_program The spec requires that samplers be initialized to 0. Since this differs from the 1-to-1 mapping of samplers to texture units assumed by ARB assembly shaders (and the gl_program structure), be sure to propagate this date from the gl_shader_program to the gl_program. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> CC: Vadim Girlin <vadimgirlin@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=49088	2012-05-23 11:42:07 -07:00
Ian Romanick	a2e623054b	glsl: Set initial values for uniforms in the linker v2: Fix handling of arrays-of-structure. Thanks to Eric Anholt for pointing this out. v3: Minor comment change based on feedback from Ken. Fixes piglit glsl-1.20/execution/uniform-initializer/fs-structure-array and glsl-1.20/execution/uniform-initializer/vs-structure-array. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 11:41:49 -07:00
Eric Anholt	29362875f2	i965/gen6+: Add support for GL_ARB_blend_func_extended. v2: Add support for gen6, and don't turn it on if blending is disabled. (fixes GPU hang), and note it in docs/GL3.txt Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-05-23 10:46:15 -07:00
Eric Anholt	175ad8050e	mesa: Keep a computed value for dual source blend func with each buffer. The i965 driver needed this as well for hardware setup, so instead of duplicating the logic, just save it off. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	2012-05-23 10:45:43 -07:00
Eric Anholt	68216f3581	i965/gen6+: Add support for fast depth clears. Improves citybench high-res performance 3.0% +- 0.4%, n=10. Improves Lightsmark 1024x768 performance 0.74% +/- 0.20% (n=78). No significant difference on openarena (n=5, didn't fast clear) or nexuiz (n=3). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:40:11 -07:00
Eric Anholt	5b248e5982	i965/gen6: Add CC viewport state setup to blorp code. While it doesn't have the same warning in the simulator as in gen7, let's emit it out of paranoia. We wouldn't want our resolves of some previous clear to get clamped to some current clamping value. Suggested-by: pretty much everyone	2012-05-23 10:39:45 -07:00
Eric Anholt	39a91be20d	i965/gen7: Add CC viewport setup to blorp code. When doing fast clears, a fulsim warning said that the batch was being emitted without the viewport set up. While the fast clear pass I was looking at doesn't use the clear value, the later resolves which also didn't set up the vieport would trigger the same. It's not obvious from the error message whether it meant "fast clear value gets clamped to something you haven't defined" or "fast clear value doesn't get clamped, and I saw it was out of the current (uninitialized) range, and you probably wanted it clamped to that (uninitialized) range". Be paranoid and assume the first case. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	54308f78a2	i965: Drop a layer of indirection in doing HiZ resolves. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	072634da4a	i965: Replace intel_need_resolve with the hiz ops it maps to. Having this enum separate caused us to need a bunch of helper functions to translate to the op to be executed. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	5b226ad603	i965: Add an interface for doing hiz ops from C code. This required moving gen6_hiz_op, and I put it in intel_resolve_map.h for the next commit. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:27 -07:00
Eric Anholt	7da9795070	i965: Rename the clear function for this driver. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	3e1656567c	i965: Simplify the remaining clear logic by relying on the meta clear. The GLSL clear path doesn't need any buffer presence checks, since those are already handled in the normal drawing path code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	7c3e88f1fc	i965: Switch blit color clears to tri clears on gen4/5. Our understanding is that the 3D engine is supposed to be faster anyway. We used to have more overhead in our tri clear path than we do today, which would have led to this choice. But given that we almost always see a depth clear along with a color clear, the path was hardly exercised anyway. Also, the color mask logic was broken in the presence of GL_EXT_draw_buffers2's per-buffer colormask. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	fa15b0f3f0	i965: Remove dead logic for non-tri depth/stencil clears. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	a3967ff441	i965: We always have GLSL, so always use it for tri clears. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	03c9044c2e	i915: Drop gen4+ code from the forked clear code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Eric Anholt	11892ea986	intel: Fork the intel_clear.c file between i915 and i965. This logic is wasted on i965 when we want to just always do GLSL tri clears. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2012-05-23 10:18:26 -07:00
Vadim Girlin	c91b4edff9	st/mesa: set stObj->lastLevel in guess_and_alloc_texture Fixes lockups/asserts with depthstencil-render-miplevels tests and r600g. Should also fix https://bugs.freedesktop.org/show_bug.cgi?id=50033 NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-23 06:07:00 +04:00
Paul Berry	ea8e854b2c	i965: Completely annotate the batch bo when aub dumping. Previously, when the environment variable INTEL_DEBUG=aub was set, mesa would simply instruct DRM to start dumping data to an .aub file, but we would not provide DRM with any information about the format of the data in various buffers. As a result, a lot of the data in the generate .aub file would be unannotated, making further data analysis difficult. This patch causes the entire contents of each batch buffer to be annotated using the data in brw->state_batch_list (which was previously used only to annotate the output of INTEL_DEBUG=bat). This includes data that was allocated by brw_state_batch, such as binding tables, surface and sampler states, depth/stencil state, and so on. The new annotation mechanism requires DRM version 2.4.34. Reviewed-by: Eric Anholt <eric@anholt.net>	2012-05-22 15:19:00 -07:00
Paul Berry	1b87a93983	intel: When AUB dumping, flush before emitting final bitmap command. When we are generating an AUB dump, we make a final call to aub_dump_bmp() as the context is being destroyed, to ensure that any rendering performed before the application exits can be seen during a simulation run. However, we were doing this before flushing the batch buffer; as a result simulation runs would not always see the effect of all rendering commands. This patch flushes the batch buffer just before making the final call to aub_dump_bmp(), to ensure that all rendering is properly captured in the final bitmap.	2012-05-22 15:19:00 -07:00
José Fonseca	7a75e7d6e8	llvmpipe: Fix alpha testing precision on rgba8 formats. This is a long standing problem, that recently surfaced with the change to enable perspective correct color interpolation. A fix for all possible formats is left to the future. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-05-22 19:23:49 +01:00
Vinson Lee	e4fb332af1	scons: Do not build glx and egl on Cygwin. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-22 10:15:14 -07:00
Christoph Bumiller	89155ba71d	nv30: check for NULL vertex buffers in prevalidate_vbufs	2012-05-22 15:22:10 +02:00
Christoph Bumiller	a054fd8268	nv50: make unaligned index buffer offsets work again Messed up in `ef7bb28129`.	2012-05-22 12:50:12 +02:00
Christoph Bumiller	91fb5e0394	nvc0: don't set NEW_IDXBUF in nvc0_switch_pipe_context if none is bound	2012-05-22 12:45:19 +02:00
James Benton	8a933e36d1	llvmpipe: Added a error counter to lp_test_conv. Useful for keeping track of progress when fixing errors! Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:53 +01:00
James Benton	383c1b649b	llvmpipe: Changed known failures in lp_test_conv. To comply with the recent fixes to lp_bld_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:51 +01:00
James Benton	4203a0b034	llvmpipe: Added fixed point types tests to lp_test_conv. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:49 +01:00
James Benton	a3d4af0c00	gallivm: Fixed erroneous optimisation in lp_build_min/max. Previously assumed normalised was 0 to 1, but it can be -1 to 1 if type is signed. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:47 +01:00
James Benton	fdeb0394cb	gallivm: Compensate for lp_const_offset in lp_build_conv. Fixing a /FIXME/ to remove errors in integer conversion in lp_build_conv. Tested using lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:46 +01:00
James Benton	f89b1f4ba4	gallivm: Fixed overflow in lp_build_clamped_float_to_unsigned_norm. Tested with lp_test_conv and lp_test_format, reduced errors. Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 20:24:44 +01:00
Brian Paul	c286278481	docs: add link to 8.0.3 release notes	2012-05-21 09:26:04 -06:00
Paul Seidler	a0dffe8701	tests: include mesa headers else they will fail for fresh installs Signed-off-by: Brian Paul <brianp@vmware.com>	2012-05-21 08:42:19 -06:00
Lukas Rössler	6178b653c7	glu: fix two Clang warnings This patch removes two Clang warnings in GLU: The first one seems to be an actual bug in mapdesc.cc: Clang complains that sizeof(dest) will return the size of REAL*[MAXCOORDS], instead of the intended REAL[MAXCOORDS][MAXCOORDS]. The second one is just cosmetic because Clang doesn't like extra parentheses. NOTE: This is a candidate for the 8.0 branch Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-21 08:29:23 -06:00
Homer Hsing	ed9d1bef81	docs: fix a typo Signed-off-by: Brian Paul <brianp@vmware.com>	2012-05-21 08:07:20 -06:00
ojab	3d2bf91cc1	Filter out -Wcovered-switch-default from LLVM_CFLAGS Signed-off-by: José Fonseca <jfonseca@vmware.com>	2012-05-21 08:37:06 +01:00
Tom Stellard	cee23ab246	radeon/llvm: Handle selectcc DAG node R600 can now select instructions from the selectcc DAG node, which is typically lowered to one of the SET* instructions.	2012-05-20 16:27:31 -04:00
Brian Paul	239792fb22	st/mesa: use pipe_sampler_view_release() in st_destroy_context_priv() Fixes another case of sampler views being created by one context, shared by another, then deleted by the first, leaving a dangling pipe context pointer. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	c9cb9cf050	mesa: use F_TO_I() instead of IROUND() Use it where performance matters more and the exact method of float->int conversion/rounding isn't terribly important. There should no net change here since F_TO_I() is the new name of the old IROUND() function. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	699c1894ee	mesa: reimplement IROUND(), add F_TO_I() The different implementations of IROUND() behaved differently and in the case of fistp, depended on the current x86 FPU rounding mode. This caused some tests like piglit roundmode-pixelstore and roundmode-getintegerv to fail on 32-bit x86 but pass on 64-bit x86. Now IROUND() always rounds to the nearest integer (away from zero). The new F_TO_I function converts a float to an int by whatever means is fastest. We'll use this where we're more concerned with performance and not too worried to how the conversion is done. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	31d59c78f0	mesa: fix Z32_FLOAT -> uint conversion functions The IROUND converted all arguments to 0 or 1. That's not what we wanted. NOTE: This is a candidate for the 8.0 branch. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-05-19 08:28:57 -06:00
Brian Paul	c3991e1c57	st/mesa: remove unused pipe variable	2012-05-19 08:28:57 -06:00
Brian Paul	bd302f36c4	svga: whitespace, comments, formatting clean-ups	2012-05-19 08:28:57 -06:00
Brian Paul	6792969cbc	st/mesa: added st_print_current_vertex_program(), for debugging	2012-05-19 08:28:56 -06:00
Brian Paul	2786343896	svga: return PIPE_OK instead of 0 And fix the emit_rss() function's return type.	2012-05-19 08:28:56 -06:00
Brian Paul	fc71e0b4a8	svga: fix zero-stride vertex array bug For zero-stride vertex arrays, the svga driver copies the value into the constant value and uses that value in the shader. The recent gallium-userbuf changes caused a regression in this. An example symptom was per-primitive glColor3f() calls getting ignored. Where we copied the vertex value from the vertex buffer to the constant buffer we neglected to take into account the pipe_vertex_buffer::buffer_offset field. Adding that value to the source offset fixes the problem. Actually, it looks like we should have been doing this all along, but it never was an issue before for some reason.	2012-05-19 08:28:56 -06:00
Brian Paul	0161691f35	mesa: add GLSL_REPORT_ERRORS debug flag If the MESA_GLSL env var contains "errors", GLSL compilation and link errors will be reported to stderr. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-19 08:28:56 -06:00
Brian Paul	1c333745f3	mesa: add some comments on shaderapi.c functions	2012-05-19 08:28:56 -06:00
Vinson Lee	315140969d	mesa: Remove undefinition of _P symbol. IRIX isn't used anymore. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-18 23:24:33 -07:00
Ian Romanick	0c6f4cd335	Import release notes for 8.0.3, add news item Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2012-05-18 16:27:17 -07:00
Jeremy Huddleston	27b821bc95	darwin: Address a build failure on Leopard and earlier OS versions <https://trac.macports.org/ticket/34499> Regression-from: `51691f0767` Signed-off-by: Jeremy Huddleston <jeremyhu@apple.com>	2012-05-18 11:32:40 -07:00
Michel Dänzer	d59b2c4b53	radeonsi: Only honour point related rasterizer state when rendering points. Avoids hangs when not rendering points.	2012-05-18 18:13:56 +02:00
Michel Dänzer	dd9d619459	radeonsi: Fix parameter cache offsets for fragment shader inputs.	2012-05-18 15:01:10 +02:00
Vinson Lee	e8a86d36f3	gallium/tgsi/text: Ensure ret is initialized in parse_immediate_data. Fix uninitialized scalar variable defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2012-05-17 21:59:08 -07:00
Tom Stellard	c20e741799	radeon/llvm: Fix segfault while lowering lrp intrinsic	2012-05-17 20:42:16 -04:00
Tom Stellard	7e3cd8df18	radeon/llvm: Add DAG nodes for MIN instructions Also, remove the AMDIL MIN* instruction defs.	2012-05-17 20:42:16 -04:00
José Fonseca	3f7a5ffac7	llvmpipe: Avoid adding floating point zero to flat inputs. Which could clobber integer inputs, if the addition is not optimized away (e.g., if optimizations are disabled for debugging purposes).	2012-05-18 01:03:13 +01:00
José Fonseca	00eb74b275	Fix fetching integer inputs.	2012-05-18 00:55:13 +01:00
Olivier Galibert	5d10d75727	llvmpipe: Implement TXQ. Piglits test for fragment shaders pass, vertex shaders fail. The actual failure seems to be in the interpolators, and not the textureSize query. Signed-off-by: Olivier Galibert <galibert@pobox.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: José Fonseca <jose.r.fonseca@gmail.com>	2012-05-18 00:27:28 +01:00
Olivier Galibert	1ec421823b	llvmpipe: Don't mess with the provoking vertex when inverting a triangle. Fixes a bunch of piglit tests related to flat interpolation of floats. Signed-off-by: Olivier Galibert <galibert@pobox.com> Signed-off-by: José Fonseca <jose.r.fonseca@gmail.com>	2012-05-18 00:07:18 +01:00
Tom Stellard	c6c8a05c50	radeon/llvm: Lower lrp intrinsic during ISel	2012-05-17 14:48:10 -04:00
Tom Stellard	ef8e66bc16	radeon/llvm: Remove AMDIL MAD instruction defs	2012-05-17 14:48:10 -04:00
Tom Stellard	d07473fcf4	radeon/llvm: Remove AMDIL MUL_IEEE* instructions	2012-05-17 14:48:10 -04:00
Tom Stellard	5187948bc2	r600g: Handle MUL_IEEE in r600_bytecode_get_num_operands	2012-05-17 14:48:09 -04:00
Tom Stellard	1fe70c6ae1	radeon/llvm: Expand fsub during ISel	2012-05-17 14:48:09 -04:00
Tom Stellard	9916f2d2af	radeon/llvm: Remove AMDIL floating-point ADD instruction defs	2012-05-17 14:48:09 -04:00
Tom Stellard	91484de22d	radeon/llvm: Remove AMDIL CMOVLOG* instruction defs	2012-05-17 14:48:09 -04:00
Tom Stellard	9a020092ae	radeon/llvm: Move lowering of ABS_i32 to ISel	2012-05-17 14:48:09 -04:00
Tom Stellard	89b945591b	radeon/llvm: Remove sub patterns from AMDILInstrPatterns.td	2012-05-17 14:48:09 -04:00
Tom Stellard	431bb79a41	radeon/llvm: Add custom SDNodes for MAX We now lower the various intrinsics for max to SDNodes and then use tablegen patterns to lower the SDNodes to instructions.	2012-05-17 14:48:09 -04:00

1085 changed files with 46115 additions and 154617 deletions

									
										11

.dir-locals.el
									
										Normal file
									
												View File
												
				@@ -0,0 +1,11 @@

				((nil

				  (indent-tabs-mode . nil)

				  (tab-width . 8)

				  (c-basic-offset . 3)

				  (c-file-style . "stroustrup")

				  (fill-column . 78)

				  (eval . (progn

					    (c-set-offset 'innamespace '0)

					    (c-set-offset 'inline-open '0)))

				  )

				 )

10

.emacs-dirvars

View File

@@ -1,10 +0,0 @@
 ;; -*- emacs-lisp -*-
 ;;
 ;; This file is processed by the dirvars emacs package.  Each variable
 ;; setting below is performed when this dirvars file is loaded.
 ;;
 indent-tabs-mode: nil
 tab-width: 8
 c-basic-offset: 3
 kde-emacs-after-parent-string: ""
 evaluate: (c-set-offset 'inline-open '0)

1

.gitignore vendored

View File

@@ -40,3 +40,4 @@ Makefile.in
 .dir-locals.el
 .deps/
 .libs/
 /Makefile

									
										271

Makefile
									
												View File
											
				@@ -1,271 +0,0 @@

				# Top-level Mesa makefile

				TOP = .

				SUBDIRS = src

				# The git command below generates an empty string when we're not

				# building in a GIT tree (i.e., building from a release tarball).

				default: $(TOP)/configs/current

					@$(TOP)/bin/extract_git_sha1

					@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE)) || exit 1 ; \

						fi \

					done

				all: default

				doxygen:

					cd doxygen && $(MAKE)

				check:

					make -C src/glsl/tests check

					make -C tests check

				clean:

					-@touch $(TOP)/configs/current

					-@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE) clean) ; \

						fi \

					done

					-@test -s $(TOP)/configs/current || rm -f $(TOP)/configs/current

				realclean: clean

					-rm -rf lib*

					-rm -f $(TOP)/configs/current

					-rm -f $(TOP)/configs/autoconf

					-rm -rf autom4te.cache

					-find . '(' -name '*.o' -o -name '*.a' -o -name '*.so' -o \

					  -name depend -o -name depend.bak ')' -exec rm -f '{}' ';'

				distclean: realclean

				install:

					@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE) install) || exit 1 ; \

						fi \

					done

				.PHONY: default doxygen clean realclean distclean install check

				# If there's no current configuration file

				$(TOP)/configs/current:

					@echo

					@echo

					@echo "Please choose a configuration from the following list:"

					@ls -1 $(TOP)/configs | grep -v "current\|default\|CVS\|autoconf.*"

					@echo

					@echo "Then type 'make <config>' (ex: 'make linux-x86')"

					@echo

					@echo "Or, run './configure' then 'make'"

					@echo "See './configure --help' for details"

					@echo

					@echo "(ignore the following error message)"

					@exit 1

				# Rules to set/install a specific build configuration

				aix \

				aix-64 \

				aix-64-static \

				aix-gcc \

				aix-static \

				autoconf \

				bluegene-osmesa \

				bluegene-xlc-osmesa \

				catamount-osmesa-pgi \

				darwin \

				darwin-fat-32bit \

				darwin-fat-all \

				freebsd \

				freebsd-dri \

				freebsd-dri-amd64 \

				freebsd-dri-x86 \

				hpux10 \

				hpux10-gcc \

				hpux10-static \

				hpux11-32 \

				hpux11-32-static \

				hpux11-32-static-nothreads \

				hpux11-64 \

				hpux11-64-static \

				hpux11-ia64 \

				hpux11-ia64-static \

				hpux9 \

				hpux9-gcc \

				irix6-64 \

				irix6-64-static \

				irix6-n32 \

				irix6-n32-static \

				irix6-o32 \

				irix6-o32-static \

				linux \

				linux-i965 \

				linux-alpha \

				linux-alpha-static \

				linux-debug \

				linux-dri \

				linux-dri-debug \

				linux-dri-x86 \

				linux-dri-x86-64 \

				linux-dri-ppc \

				linux-dri-xcb \

				linux-egl \

				linux-indirect \

				linux-fbdev \

				linux-ia64-icc \

				linux-ia64-icc-static \

				linux-icc \

				linux-icc-static \

				linux-llvm \

				linux-llvm-debug \

				linux-opengl-es \

				linux-osmesa \

				linux-osmesa-static \

				linux-osmesa16 \

				linux-osmesa16-static \

				linux-osmesa32 \

				linux-ppc \

				linux-ppc-static \

				linux-profile \

				linux-sparc \

				linux-sparc5 \

				linux-static \

				linux-ultrasparc \

				linux-tcc \

				linux-x86 \

				linux-x86-debug \

				linux-x86-32 \

				linux-x86-64 \

				linux-x86-64-debug \

				linux-x86-64-profile \

				linux-x86-64-static \

				linux-x86-profile \

				linux-x86-static \

				netbsd \

				openbsd \

				osf1 \

				osf1-static \

				solaris-x86 \

				solaris-x86-gcc \

				solaris-x86-gcc-static \

				sunos4 \

				sunos4-gcc \

				sunos4-static \

				sunos5 \

				sunos5-gcc \

				sunos5-64-gcc \

				sunos5-smp \

				sunos5-v8 \

				sunos5-v8-static \

				sunos5-v9 \

				sunos5-v9-static \

				sunos5-v9-cc-g++ \

				ultrix-gcc:

					@ if test -f configs/current -o -L configs/current; then \

						if ! cmp configs/$@ configs/current > /dev/null; then \

							echo "Please run 'make realclean' before changing configs" ; \

							exit 1 ; \

						fi ; \

					else \

						cd configs && rm -f current && ln -s $@ current ; \

					fi

					$(MAKE) default

				# Rules for making release tarballs

				PACKAGE_VERSION=8.1-devel

				PACKAGE_DIR = Mesa-$(PACKAGE_VERSION)

				PACKAGE_NAME = MesaLib-$(PACKAGE_VERSION)

				EXTRA_FILES = \

					aclocal.m4					\

					configure					\

					tests/Makefile.in				\

					tests/glx/Makefile.in				\

					src/glsl/glsl_parser.cpp			\

					src/glsl/glsl_parser.h				\

					src/glsl/glsl_lexer.cpp				\

					src/glsl/glcpp/glcpp-lex.c			\

					src/glsl/glcpp/glcpp-parse.c			\

					src/glsl/glcpp/glcpp-parse.h			\

					src/mesa/main/api_exec_es1.c			\

					src/mesa/main/api_exec_es1_dispatch.h		\

					src/mesa/main/api_exec_es1_remap_helper.h	\

					src/mesa/main/api_exec_es2.c			\

					src/mesa/main/api_exec_es2_dispatch.h		\

					src/mesa/main/api_exec_es2_remap_helper.h	\

					src/mesa/program/lex.yy.c			\

					src/mesa/program/program_parse.tab.c		\

					src/mesa/program/program_parse.tab.h

				IGNORE_FILES = \

					-x autogen.sh

				parsers: configure

					-@touch $(TOP)/configs/current

					$(MAKE) -C src/glsl glsl_parser.cpp glsl_parser.h glsl_lexer.cpp

					$(MAKE) -C src/glsl/glcpp glcpp-lex.c glcpp-parse.c glcpp-parse.h

					$(MAKE) -C src/mesa program/lex.yy.c program/program_parse.tab.c program/program_parse.tab.h

				# Everything for new a Mesa release:

				ARCHIVES = $(PACKAGE_NAME).tar.gz \

					$(PACKAGE_NAME).tar.bz2 \

					$(PACKAGE_NAME).zip \

				tarballs: md5

					rm -f ../$(PACKAGE_DIR) $(PACKAGE_NAME).tar

				# Helper for autoconf builds

				ACLOCAL = aclocal

				ACLOCAL_FLAGS =

				AUTOCONF = autoconf

				AC_FLAGS =

				aclocal.m4: configure.ac acinclude.m4

					$(ACLOCAL) $(ACLOCAL_FLAGS)

				configure: configure.ac aclocal.m4 acinclude.m4

					$(AUTOCONF) $(AC_FLAGS)

				manifest.txt: .git

					( \

						ls -1 $(EXTRA_FILES) ; \

						git ls-files $(IGNORE_FILES) \

					) | sed -e '/^\(.*\/\)\?\./d' -e "s@^@$(PACKAGE_DIR)/@" > $@

				../$(PACKAGE_DIR):

					ln -s $(PWD) $@

				$(PACKAGE_NAME).tar: parsers ../$(PACKAGE_DIR) manifest.txt

					cd .. ; tar -cf $(PACKAGE_DIR)/$(PACKAGE_NAME).tar -T $(PACKAGE_DIR)/manifest.txt

				$(PACKAGE_NAME).tar.gz: $(PACKAGE_NAME).tar ../$(PACKAGE_DIR)

					gzip --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.gz

				$(PACKAGE_NAME).tar.bz2: $(PACKAGE_NAME).tar

					bzip2 --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.bz2

				$(PACKAGE_NAME).zip: parsers ../$(PACKAGE_DIR) manifest.txt

					rm -f $(PACKAGE_NAME).zip ; \

					cd .. ; \

					zip -q -@ $(PACKAGE_NAME).zip < $(PACKAGE_DIR)/manifest.txt ; \

					mv $(PACKAGE_NAME).zip $(PACKAGE_DIR)

				md5: $(ARCHIVES)

					@-md5sum $(PACKAGE_NAME).tar.gz

					@-md5sum $(PACKAGE_NAME).tar.bz2

					@-md5sum $(PACKAGE_NAME).zip

				am--refresh:

				.PHONY: tarballs md5 am--refresh

									
										124

Makefile.am
									
										Normal file
									
												View File
												
				@@ -0,0 +1,124 @@

				# Copyright © 2012 Intel Corporation

				#

				# Permission is hereby granted, free of charge, to any person obtaining a

				# copy of this software and associated documentation files (the "Software"),

				# to deal in the Software without restriction, including without limitation

				# the rights to use, copy, modify, merge, publish, distribute, sublicense,

				# and/or sell copies of the Software, and to permit persons to whom the

				# Software is furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice (including the next

				# paragraph) shall be included in all copies or substantial portions of the

				# Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL

				# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING

				# FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS

				# IN THE SOFTWARE.

				SUBDIRS = src

				doxygen:

					cd doxygen && $(MAKE)

				check-local:

					$(MAKE) -C src/mapi/glapi/tests check

					$(MAKE) -C src/mesa/main/tests check

					$(MAKE) -C src/glsl/tests check

					$(MAKE) -C src/glx/tests check

				clean-local:

					-@touch $(top_builddir)/configs/current

					-@for dir in $(SUBDIRS) ; do \

						if [ -d $$dir ] ; then \

							(cd $$dir && $(MAKE) clean) ; \

						fi \

					done

					-@test -s $(top_builddir)/configs/current || rm -f $(top_builddir)/configs/current

				distclean-local:

					-rm -rf lib*

					-rm -f $(top_builddir)/configs/current

					-find . '(' -name '*.o' -o -name '*.a' -o -name '*.so' -o \

					  -name depend -o -name depend.bak ')' -exec rm -f '{}' ';'

				.PHONY: doxygen

				# Rules for making release tarballs

				PACKAGE_VERSION=8.1-devel

				PACKAGE_DIR = Mesa-$(PACKAGE_VERSION)

				PACKAGE_NAME = MesaLib-$(PACKAGE_VERSION)

				EXTRA_FILES = \

					aclocal.m4					\

					configure					\

					src/glsl/glsl_parser.cc				\

					src/glsl/glsl_parser.h				\

					src/glsl/glsl_lexer.cc				\

					src/glsl/glcpp/glcpp-lex.c			\

					src/glsl/glcpp/glcpp-parse.c			\

					src/glsl/glcpp/glcpp-parse.h			\

					src/mesa/main/api_exec_es1.c			\

					src/mesa/main/api_exec_es1_dispatch.h		\

					src/mesa/main/api_exec_es1_remap_helper.h	\

					src/mesa/main/api_exec_es2.c			\

					src/mesa/main/api_exec_es2_dispatch.h		\

					src/mesa/main/api_exec_es2_remap_helper.h	\

					src/mesa/program/lex.yy.c			\

					src/mesa/program/program_parse.tab.c		\

					src/mesa/program/program_parse.tab.h

				IGNORE_FILES = \

					-x autogen.sh

				parsers: configure

					-@touch $(top_builddir)/configs/current

					$(MAKE) -C src/glsl glsl_parser.cc glsl_parser.h glsl_lexer.cc

					$(MAKE) -C src/glsl/glcpp glcpp-lex.c glcpp-parse.c glcpp-parse.h

					$(MAKE) -C src/mesa program/lex.yy.c program/program_parse.tab.c program/program_parse.tab.h

				# Everything for new a Mesa release:

				ARCHIVES = $(PACKAGE_NAME).tar.gz \

					$(PACKAGE_NAME).tar.bz2 \

					$(PACKAGE_NAME).zip

				tarballs: md5

					rm -f ../$(PACKAGE_DIR) $(PACKAGE_NAME).tar

				manifest.txt: .git

					( \

						ls -1 $(EXTRA_FILES) ; \

						git ls-files $(IGNORE_FILES) \

					) | sed -e '/^\(.*\/\)\?\./d' -e "s@^@$(PACKAGE_DIR)/@" > $@

				../$(PACKAGE_DIR):

					ln -s $(PWD) $@

				$(PACKAGE_NAME).tar: parsers ../$(PACKAGE_DIR) manifest.txt

					cd .. ; tar -cf $(PACKAGE_DIR)/$(PACKAGE_NAME).tar -T $(PACKAGE_DIR)/manifest.txt

				$(PACKAGE_NAME).tar.gz: $(PACKAGE_NAME).tar ../$(PACKAGE_DIR)

					gzip --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.gz

				$(PACKAGE_NAME).tar.bz2: $(PACKAGE_NAME).tar

					bzip2 --stdout --best $(PACKAGE_NAME).tar > $(PACKAGE_NAME).tar.bz2

				$(PACKAGE_NAME).zip: parsers ../$(PACKAGE_DIR) manifest.txt

					rm -f $(PACKAGE_NAME).zip ; \

					cd .. ; \

					zip -q -@ $(PACKAGE_NAME).zip < $(PACKAGE_DIR)/manifest.txt ; \

					mv $(PACKAGE_NAME).zip $(PACKAGE_DIR)

				md5: $(ARCHIVES)

					@-md5sum $(PACKAGE_NAME).tar.gz

					@-md5sum $(PACKAGE_NAME).tar.bz2

					@-md5sum $(PACKAGE_NAME).zip

				.PHONY: tarballs md5

1

bin/.gitignore vendored

View File

@@ -5,3 +5,4 @@ install-sh
 /missing
 ylwrap
 compile
 ar-lib

									
										48

bin/confdiff.sh
									
												View File
											
				@@ -1,48 +0,0 @@

				#!/bin/bash -e

				usage()

				{

					echo "Usage: $0 <target1> <target2>"

					echo "Highlight differences between Mesa configs"

					echo "Example:"

					echo "  $0 linux linux-x86"

				}

				die()

				{

					echo "$@" >&2

					return 1

				}

				case "$1" in

				-h|--help) usage; exit 0;;

				esac

				[ $# -lt 2 ] && die 2 targets needed. See $0 --help

				target1=$1

				target2=$2

				topdir=$(cd "`dirname $0`"/..; pwd)

				cd "$topdir"

				[ -f "./configs/$target1" ] || die Missing configs/$target1

				[ -f "./configs/$target2" ] || die Missing configs/$target2

				trap 'rm -f "$t1" "$t2"' 0

				t1=$(mktemp)

				t2=$(mktemp)

				make -f- -n -p <<EOF | sed '/^# Not a target/,/^$/d' > $t1

				TOP = .

				include \$(TOP)/configs/$target1

				default:

				EOF

				make -f- -n -p <<EOF | sed '/^# Not a target/,/^$/d' > $t2

				TOP = .

				include \$(TOP)/configs/$target2

				default:

				EOF

				diff -pu -I'^#' $t1 $t2

20

bin/extract_git_sha1

View File

@@ -1,20 +0,0 @@
 #!/bin/sh
 if [ ! -f src/mesa/main/git_sha1.h ]; then
 	touch src/mesa/main/git_sha1.h
 fi
 if [ ! -d .git ]; then
 	exit
 fi
 if which git > /dev/null; then
     # Extract the 7-digit "short" SHA1 for the current HEAD, convert
     # it to a string, and wrap it in a #define.  This is used in
     # src/mesa/main/version.c to put the GIT SHA1 in the GL_VERSION string.
     git log -n 1 --oneline |\
 	sed 's/^\([^ ]*\) .*/#define MESA_GIT_SHA1 "git-\1"/' \
 	> src/mesa/main/git_sha1.h.tmp
     if ! cmp -s src/mesa/main/git_sha1.h.tmp src/mesa/main/git_sha1.h; then
     	mv src/mesa/main/git_sha1.h.tmp src/mesa/main/git_sha1.h
     fi
 fi

									
										23

bin/shortlog_mesa.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,23 @@

				#!/bin/bash

				# This script is used to generate the list of changes that

				# appears in the release notes files, with HTML formatting.

				typeset -i in_log=0

				git shortlog $* | while read l

				do

				    if [ $in_log -eq 0 ]; then

					echo '<p>'$l'</p>'

					echo '<ul>'

					in_log=1

				    elif echo "$l" | egrep -q '^$' ; then

					echo '</ul>'

					echo

					in_log=0

				    else

				        mesg=$(echo $l | sed 's/ (cherry picked from commit [0-9a-f]\+)//;s/\&/&amp;/g;s/</\&lt;/g;s/>/\&gt;/g')

					echo '  <li>'${mesg}'</li>'

				    fi

				done

									
										17

bin/version.mk
									
												View File
											
				@@ -1,17 +0,0 @@

				#!/usr/bin/make -sf

				# Print the various Mesa version fields. This is mostly used to add the

				# version to configure.

				# This reflects that this script is usually called from the toplevel

				TOP = .

				include $(TOP)/configs/default

				version:

					@echo $(MESA_VERSION)

				major:

					@echo $(MESA_MAJOR)

				minor:

					@echo $(MESA_MINOR)

				tiny:

					@echo $(MESA_TINY)

									
										2

common.py
									
												View File
												
				@@ -89,7 +89,7 @@ def AddOptions(opts):

					opts.Add(EnumOption('machine', 'use machine-specific assembly code', default_machine,

															 allowed_values=('generic', 'ppc', 'x86', 'x86_64')))

					opts.Add(EnumOption('platform', 'target platform', host_platform,

															 allowed_values=('linux', 'windows', 'darwin', 'cygwin', 'sunos', 'freebsd8', 'haiku')))

															 allowed_values=('cygwin', 'darwin', 'freebsd', 'haiku', 'linux', 'sunos', 'windows')))

					opts.Add(BoolOption('embedded', 'embedded build', 'no'))

					opts.Add('toolchain', 'compiler toolchain', default_toolchain)

					opts.Add(BoolOption('gles', 'EXPERIMENTAL: enable OpenGL ES support', 'no'))

27

configs/aix

View File

@@ -1,27 +0,0 @@
 # Configuration for AIX, dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix
 # Compiler and flags
 CC = cc
 CXX = xlC
 CFLAGS = -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -O -DAIXV3 -DPTHREADS
 # Misc tools and flags
 MKLIB_OPTIONS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS = -lX11 -lXext -lpthread -lm
 GLU_LIB_DEPS = -L$(TOP)/lib -l$(GL_LIB) -lm -lC
 GLW_LIB_DEPS = -L$(TOP)/lib -l$(GL_LIB) -lXm -lXt -lX11
 OSMESA_LIB_DEPS = -L$(TOP)/lib -l$(GL_LIB)

24

configs/aix-64

View File

@@ -1,24 +0,0 @@
 # Configuration for AIX 64-bit, dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix-64
 # Compiler and flags
 CC = xlc
 CXX = xlC
 CFLAGS = -q64 -qmaxmem=16384 -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -q64 -qmaxmem=16384 -O -DAIXV3 -DPTHREADS
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS = -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lC
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lXm -lXt -lX11

21

configs/aix-64-static

View File

@@ -1,21 +0,0 @@
 # Configuration for AIX, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix-64-static
 # Compiler and flags
 CC = cc
 CXX = xlC
 CFLAGS = -q64 -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -q64 -O -DAIXV3 -DPTHREADS
 MKLIB_OPTIONS = -static
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

21

configs/aix-gcc

View File

@@ -1,21 +0,0 @@
 # Configuration for AIX with gcc
 include $(TOP)/configs/default
 CONFIG_NAME = aix-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O2 -DAIXV3
 CXXFLAGS = -O2 -DAIXV3
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MKLIB_OPTIONS = -arch aix-gcc
 GL_LIB_DEPS = -lX11 -lXext -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm

20

configs/aix-static

View File

@@ -1,20 +0,0 @@
 # Configuration for AIX, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = aix-static
 # Compiler and flags
 CC = cc
 CXX = xlC
 CFLAGS = -O -DAIXV3 -DPTHREADS
 CXXFLAGS = -O -DAIXV3 -DPTHREADS
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

31

configs/bluegene-osmesa

View File

@@ -1,31 +0,0 @@
 # Configuration for building only libOSMesa on BlueGene, no Xlib driver
 # This doesn't really have a lot of dependencies, so it should be usable
 # on other (gcc-based) systems too.
 # It uses static linking and disables multithreading.
 include $(TOP)/configs/default
 CONFIG_NAME = bluegene-osmesa
 # Compiler and flags
 CC = /bgl/BlueLight/ppcfloor/blrts-gnu/bin/powerpc-bgl-blrts-gnu-gcc
 CXX = /bgl/BlueLight/ppcfloor/blrts-gnu/bin/powerpc-bgl-blrts-gnu-g++
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURC
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MKLIB_OPTIONS = -static
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

27

configs/bluegene-xlc-osmesa

View File

@@ -1,27 +0,0 @@
 # Configuration for building only libOSMesa on BlueGene using the IBM xlc compiler
 # This doesn't really have a lot of dependencies, so it should be usable
 # on similar systems too.
 # It uses static linking and disables multithreading.
 include $(TOP)/configs/default
 CONFIG_NAME = bluegene-osmesa
 # Compiler and flags
 CC = /opt/ibmcmp/vacpp/bg/8.0/bin/blrts_xlc
 CXX = /opt/ibmcmp/vacpp/bg/8.0/bin/blrts_xlC
 CFLAGS = -O3 -pedantic -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 CXXFLAGS = -O3 -pedantic -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

30

configs/catamount-osmesa-pgi

View File

@@ -1,30 +0,0 @@
 # Configuration for building only libOSMesa on Cray Xt3
 # for the compute nodes running Catamount using the
 # Portland Group compiler. The Portland Group toolchain has to be
 # enabled before using "module switch PrgEnv-gnu PrgEnv-pgi" .
 # This doesn't really have a lot of dependencies, so it should be usable
 # on other similar systems too.
 # It uses static linking and disables multithreading.
 include $(TOP)/configs/default
 CONFIG_NAME = catamount-osmesa-pgi
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -target=catamount -fastsse -O3 -Mnontemporal -Mprefetch=distance:8,nta   -fPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 CXXFLAGS = -target=catamount -fastsse -O3 -Mnontemporal -Mprefetch=distance:8,nta -fPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

22

configs/autoconf.in → configs/current.in

View File

@@ -11,13 +11,12 @@ CC = @CC@
 CXX = @CXX@
 OPT_FLAGS = @OPT_FLAGS@
 ARCH_FLAGS = @ARCH_FLAGS@
 ASM_FLAGS = @ASM_FLAGS@
 PIC_FLAGS = @PIC_FLAGS@
 DEFINES = @DEFINES@
 API_DEFINES = @API_DEFINES@
 SHARED_GLAPI = @SHARED_GLAPI@
 CFLAGS_NOVISIBILITY = @CPPFLAGS@ @CFLAGS@ \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(ASM_FLAGS) $(DEFINES)
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 CXXFLAGS_NOVISIBILITY = @CPPFLAGS@ @CXXFLAGS@ \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 CFLAGS = $(CFLAGS_NOVISIBILITY) @VISIBILITY_CFLAGS@
@@ -34,21 +33,20 @@ X11_LIBS = @X11_LIBS@
 X11_CFLAGS = @X11_CFLAGS@
 LLVM_BINDIR = @LLVM_BINDIR@
 LLVM_CFLAGS = @LLVM_CFLAGS@
 LLVM_CPPFLAGS = @LLVM_CPPFLAGS@
 LLVM_CXXFLAGS = @LLVM_CXXFLAGS@
 LLVM_LDFLAGS = @LLVM_LDFLAGS@
 LLVM_LIBDIR = @LLVM_LIBDIR@
 LLVM_LIBS = @LLVM_LIBS@
 LLVM_INCLUDEDIR = @LLVM_INCLUDEDIR@
 GLW_CFLAGS = @GLW_CFLAGS@
 GLX_TLS = @GLX_TLS@
 DRI_CFLAGS = @DRI_CFLAGS@
 DRI_CXXFLAGS = @DRI_CXXFLAGS@
 # dlopen
 DLOPEN_LIBS = @DLOPEN_LIBS@
 # Source selection
 MESA_ASM_SOURCES = @MESA_ASM_SOURCES@
 GLAPI_ASM_SOURCES = @GLAPI_ASM_SOURCES@
 MESA_ASM_FILES = @MESA_ASM_FILES@
 # Misc tools and flags
 MAKE = @MAKE@
@@ -64,6 +62,10 @@ NM = @NM@
 # Perl
 PERL = @PERL@
 # Indent (used for generating dispatch tables)
 INDENT = @INDENT@
 INDENT_FLAGS = @INDENT_FLAGS@
 # Python and flags (generally only needed by the developers)
 PYTHON2 = @PYTHON2@
 PYTHON_FLAGS = -t -O -O
@@ -119,9 +121,6 @@ GALLIUM_DRIVERS = $(foreach DIR,$(GALLIUM_DRIVERS_DIRS),$(TOP)/src/gallium/drive
 # Driver specific build vars
 DRI_DIRS = @DRI_DIRS@
 DRICORE_GLSL_LIBS = @DRICORE_GLSL_LIBS@
 DRICORE_LIBS = @DRICORE_LIBS@
 DRICORE_LIB_DEPS = @DRICORE_LIB_DEPS@
 EGL_PLATFORMS = @EGL_PLATFORMS@
 EGL_CLIENT_APIS = @EGL_CLIENT_APIS@
@@ -147,8 +146,8 @@ VG_LIB_DEPS = $(EXTRA_LIB_PATH) @VG_LIB_DEPS@
 GLAPI_LIB_DEPS = $(EXTRA_LIB_PATH) @GLAPI_LIB_DEPS@
 # DRI dependencies
 MESA_MODULES = @MESA_MODULES@
 DRI_LIB_DEPS = $(EXTRA_LIB_PATH) @DRI_LIB_DEPS@
 GALLIUM_DRI_LIB_DEPS = $(EXTRA_LIB_PATH) @GALLIUM_DRI_LIB_DEPS@
 LIBDRM_CFLAGS = @LIBDRM_CFLAGS@
 LIBDRM_LIB = @LIBDRM_LIBS@
 DRI2PROTO_CFLAGS = @DRI2PROTO_CFLAGS@
@@ -187,6 +186,9 @@ VA_LIB_INSTALL_DIR=@VA_LIB_INSTALL_DIR@
 # Xorg driver install directory (for xorg state-tracker)
 XORG_DRIVER_INSTALL_DIR = @XORG_DRIVER_INSTALL_DIR@
 # Path to OpenCL C library libclc
 LIBCLC_PATH = @LIBCLC_PATH@
 # pkg-config substitutions
 GL_PC_REQ_PRIV = @GL_PC_REQ_PRIV@
 GL_PC_LIB_PRIV = @GL_PC_LIB_PRIV@

61

configs/darwin

View File

@@ -1,61 +0,0 @@
 # Configuration for Darwin / MacOS X, making dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = darwin
 INSTALL_DIR = /usr/X11
 X11_DIR = $(INSTALL_DIR)
 # Compiler and flags
 CC = $(shell xcrun -find cc)
 CXX = $(shell xcrun -find c++)
 PIC_FLAGS = -fPIC
 DEFINES =  -D_DARWIN_C_SOURCE -DPTHREADS -D_GNU_SOURCE \
 	   -DGLX_ALIAS_UNSUPPORTED \
 	   -DGLX_DIRECT_RENDERING -DGLX_USE_APPLEGL
 # -DGLX_INDIRECT_RENDERING \
 # -D_GNU_SOURCE          - for src/mesa/main ...
 # -DGLX_DIRECT_RENDERING - pulls in libdrm stuff in glx
 # -DGLX_USE_APPLEGL      - supposed to be used with GLX_DIRECT_RENDERING to use AGL rather than DRM, but doesn't compile
 # -DIN_DRI_DRIVER
 ARCH_FLAGS += $(RC_CFLAGS)
 INCLUDE_FLAGS = -I$(INSTALL_DIR)/include -I$(X11_DIR)/include
 OPT_FLAGS = -g3 -gdwarf-2 -Os -ffast-math -fno-strict-aliasing
 WARN_FLAGS = -Wall -Wmissing-prototypes
 CFLAGS = -std=c99 -fvisibility=hidden \
 	$(OPT_FLAGS) $(WARN_FLAGS) $(INCLUDE_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(ASM_FLAGS) $(DEFINES) $(EXTRA_CFLAGS)
 CXXFLAGS = -fvisibility=hidden \
 	$(OPT_FLAGS) $(WARN_FLAGS) $(INCLUDE_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(ASM_FLAGS) $(DEFINES) $(EXTRA_CFLAGS)
 # Library names (actual file names)
 GL_LIB_NAME = lib$(GL_LIB).dylib
 GLU_LIB_NAME = lib$(GLU_LIB).dylib
 GLW_LIB_NAME = lib$(GLW_LIB).dylib
 OSMESA_LIB_NAME = lib$(OSMESA_LIB).dylib
 VG_LIB_NAME = lib$(VG_LIB).dylib
 # globs used to install the lib and all symlinks
 GL_LIB_GLOB = lib$(GL_LIB).*dylib
 GLU_LIB_GLOB = lib$(GLU_LIB).*dylib
 GLW_LIB_GLOB = lib$(GLW_LIB).*dylib
 OSMESA_LIB_GLOB = lib$(OSMESA_LIB).*dylib
 VG_LIB_GLOB = lib$(VG_LIB).*dylib
 GL_LIB_DEPS = -L$(INSTALL_DIR)/$(LIB_DIR) -L$(X11_DIR)/$(LIB_DIR) -lX11-xcb -lxcb -lX11 -lXext $(EXTRA_LDFLAGS)
 OSMESA_LIB_DEPS = $(EXTRA_LDFLAGS)
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(EXTRA_LDFLAGS)
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L$(INSTALL_DIR)/$(LIB_DIR) -L$(X11_DIR)/$(LIB_DIR) -lX11 -lXt $(EXTRA_LDFLAGS)
 SRC_DIRS = glsl mapi/glapi mapi/vgapi glx/apple mesa glu
 GLU_DIRS = sgi
 DRIVER_DIRS = osmesa
 #DRIVER_DIRS = dri
 DRI_DIRS = swrast
 #GALLIUM_DRIVERS_DIRS = softpipe trace rbug noop identity galahad
 #GALLIUM_DRIVERS_DIRS += llvmpipe

7

configs/darwin-fat-32bit

View File

@@ -1,7 +0,0 @@
 # Configuration for Darwin / MacOS X, making 32bit fat dynamic libs
 RC_CFLAGS=-arch ppc -arch i386
 include $(TOP)/configs/darwin
 CONFIG_NAME = darwin-fat-32bit

7

configs/darwin-fat-all

View File

@@ -1,7 +0,0 @@
 # Configuration for Darwin / MacOS X, making 32bit and 64bit fat dynamic libs
 RC_CFLAGS=-arch ppc -arch i386 -arch ppc64 -arch x86_64
 include $(TOP)/configs/darwin
 CONFIG_NAME = darwin-fat-all

7

configs/darwin-fat-intel

View File

@@ -1,7 +0,0 @@
 # Configuration for Darwin / MacOS X, making 32bit and 64bit fat dynamic libs for intel
 RC_CFLAGS=-arch i386 -arch x86_64
 include $(TOP)/configs/darwin
 CONFIG_NAME = darwin-fat-intel

20

configs/default

View File

@@ -19,11 +19,9 @@ DRM_SOURCE_PATH=$(TOP)/../drm
 # Compiler and flags
 CC = cc
 CXX = CC
 HOST_CC = $(CC)
 CFLAGS = -O
 CXXFLAGS = -O
 LDFLAGS =
 HOST_CFLAGS = $(CFLAGS)
 GLU_CFLAGS =
 GLX_TLS = no
@@ -85,11 +83,8 @@ GLESv2_LIB_GLOB = $(GLESv2_LIB_NAME)*
 VG_LIB_GLOB = $(VG_LIB_NAME)*
 GLAPI_LIB_GLOB = $(GLAPI_LIB_NAME)*
 DRI_CFLAGS = $(CFLAGS)
 DRI_CXXFLAGS = $(CXXFLAGS)
 # Optional assembly language optimization files for libGL
 MESA_ASM_SOURCES =
 MESA_ASM_FILES =
 # GLw widget sources (Append "GLwMDrawA.c" here and add -lXm to GLW_LIB_DEPS in
 # order to build the Motif widget too)
@@ -172,3 +167,16 @@ GLESv2_PC_CFLAGS =
 VG_PC_REQ_PRIV =
 VG_PC_LIB_PRIV =
 VG_PC_CFLAGS =
 # default targets
 # this helps reduce the mismatch between our automake Makefiles and the old
 # custom Makefiles while we transition.
 all: default
 am--refresh:
 distclean: clean
 check:
 test:

29

configs/freebsd

View File

@@ -1,29 +0,0 @@
 # Configuration for FreeBSD
 include $(TOP)/configs/default
 CONFIG_NAME = FreeBSD
 # Compiler and flags
 CC = cc
 CXX = c++
 MAKE = gmake
 OPT_FLAGS  = -O2
 PIC_FLAGS  = -fPIC
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_BSD_SOURCE -DUSE_XSHM \
 	-DHZ=100
 X11_INCLUDES = -I/usr/local/include
 CFLAGS += $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(DEFINES) $(X11_INCLUDES) -ffast-math -pedantic
 CXXFLAGS += $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(DEFINES) $(X11_INCLUDES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 EXTRA_LIB_PATH = -L/usr/local/lib

48

configs/freebsd-dri

View File

@@ -1,48 +0,0 @@
 # -*-makefile-*-
 # Configuration for freebsd-dri: FreeBSD DRI hardware drivers
 include $(TOP)/configs/freebsd
 CONFIG_NAME = freebsd-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 WARN_FLAGS = -Wall
 OPT_FLAGS = -O -g
 EXPAT_INCLUDES = -I/usr/local/include
 X11_INCLUDES = -I/usr/local/include
 DEFINES = -DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
 	-DHAVE_ALIAS
 CFLAGS = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) -Wmissing-prototypes -std=c99 -Wundef -ffast-math \
 	$(ASM_FLAGS) $(X11_INCLUDES) $(DEFINES)
 CXXFLAGS = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(DEFINES) -Wall -ansi -pedantic $(ASM_FLAGS) $(X11_INCLUDES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 ASM_SOURCES =
 MESA_ASM_SOURCES =
 # Library/program dependencies
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = `$(PKG_CONFIG) --cflags libdrm`
 LIBDRM_LIB = `$(PKG_CONFIG) --libs libdrm`
 DRI_LIB_DEPS = $(MESA_MODULES) -L/usr/local/lib -lm -pthread -lexpat $(LIBDRM_LIB)
 GL_LIB_DEPS = -L/usr/local/lib -lX11 -lXext -lXxf86vm -lXdamage -lXfixes \
 	-lm -pthread $(LIBDRM_LIB)
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -L/usr/local/lib -lGL -lXt -lX11
 # Directories
 SRC_DIRS = glx gallium mesa glu
 DRIVER_DIRS = dri
 DRM_SOURCE_PATH=$(TOP)/../drm

10

configs/freebsd-dri-amd64

View File

@@ -1,10 +0,0 @@
 # -*-makefile-*-
 # Configuration for freebsd-dri-amd64: FreeBSD DRI hardware drivers
 include $(TOP)/configs/freebsd-dri
 CONFIG_NAME = freebsd-dri-x86-64
 ASM_FLAGS = -DUSE_X86_64_ASM
 MESA_ASM_SOURCES = $(X86-64_SOURCES)
 GLAPI_ASM_SOURCES = $(X86-64_API)

13

configs/freebsd-dri-x86

View File

@@ -1,13 +0,0 @@
 # -*-makefile-*-
 # Configuration for freebsd-dri: FreeBSD DRI hardware drivers
 include $(TOP)/configs/freebsd-dri
 CONFIG_NAME = freebsd-dri-x86
 # Unnecessary on x86, generally.
 PIC_FLAGS =
 ASM_FLAGS = -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

13

configs/hpux10

View File

@@ -1,13 +0,0 @@
 # Configuration for HPUX v10, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux10
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DAportable +z -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DAportable +Z -Ae -D_HPUX_SOURCE

18

configs/hpux10-gcc

View File

@@ -1,18 +0,0 @@
 # Configuration for HPUX v10, with gcc
 include $(TOP)/configs/default
 CONFIG_NAME = hpux10-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -ansi -O3 -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include  -DUSE_XSHM
 CXXFLAGS = -ansi -O3 -D_HPUX_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing

26

configs/hpux10-static

View File

@@ -1,26 +0,0 @@
 # Configuration for HPUX v10, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux10-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DAportable +z -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DAportable +Z -Ae -D_HPUX_SOURCE
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

27

configs/hpux11-32

View File

@@ -1,27 +0,0 @@
 # Configuration for HPUX v11
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-32
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = +z -Ae -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = +z -Ae -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies
 GL_LIB_DEPS = -L/usr/lib/X11R6/ -L/usr/contrib/X11R6/lib/ -lXext -lXt -lXi -lX11 -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lCsup -lcl
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

25

configs/hpux11-32-static

View File

@@ -1,25 +0,0 @@
 # Configuration for HPUX v11, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-32-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

24

configs/hpux11-32-static-nothreads

View File

@@ -1,24 +0,0 @@
 # Configuration for HPUX v11, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-32-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DA2.0 -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

28

configs/hpux11-64

View File

@@ -1,28 +0,0 @@
 # Configuration for HPUX v11, 64-bit
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-64
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = +z -Ae +DD64 -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = +z -Ae +DD64 -O +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS =
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies
 GL_LIB_DEPS = -L/usr/lib/X11R6/pa20_64 -L/usr/contrib/X11R6/lib/pa20_64 -lXext -lXmu -lXt -lXi -lX11 -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lCsup -lcl
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

25

configs/hpux11-64-static

View File

@@ -1,25 +0,0 @@
 # Configuration for HPUX v11, 64-bit, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-64-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DA2.0W -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -O +DA2.0W -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS = -static
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

28

configs/hpux11-ia64

View File

@@ -1,28 +0,0 @@
 # Configuration for HPUX IA64 v11, 64-bit
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-ia64
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = +z -Ae +DD64 -O +DSmckinley -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = +z -Ae +DD64 -O +DSmckinley -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS =
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.so
 GLU_LIB_NAME = libGLU.so
 GLW_LIB_NAME = libGLw.so
 OSMESA_LIB_NAME = libOSMesa.so
 # Library/program dependencies
 GL_LIB_DEPS = -L/usr/lib/X11R6/ -L/usr/contrib/X11R6/lib/ -lXext -lXmu -lXt -lXi -lX11 -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm -lCsup -lcl
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

25

configs/hpux11-ia64-static

View File

@@ -1,25 +0,0 @@
 # Configuration for HPUX v11, 64-bit, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux11-ia64-static
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DD64 -Ae -D_HPUX_SOURCE +DSmckinley -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -O +DD64 -Ae -D_HPUX_SOURCE +DSmckinley -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DPTHREADS
 MKLIB_OPTIONS = -static
 LIB_DIR = lib64
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies

15

configs/hpux9

View File

@@ -1,15 +0,0 @@
 # Configuration for HPUX v9, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux9
 # Compiler and flags
 CC = cc
 # XXX fix this
 CXX = c++
 CFLAGS = +z -O +Olibcalls +ESlit -Ae +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R5 -DUSE_XSHM
 CXXFLAGS = +z -O +Olibcalls +ESlit -Ae +Onolimit -D_HPUX_SOURCE -I/usr/include/X11R5

13

configs/hpux9-gcc

View File

@@ -1,13 +0,0 @@
 # Configuration for HPUX v10, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = hpux9-gcc
 # Compiler and flags
 CC = cc
 CXX = aCC
 CFLAGS = -O +DAportable +z -Ae -D_HPUX_SOURCE -I/usr/include/X11R6 -I/usr/contrib/X11R6/include -DUSE_XSHM
 CXXFLAGS = -O +DAportable +Z -Ae -D_HPUX_SOURCE

16

configs/irix6-64

View File

@@ -1,16 +0,0 @@
 # Configuration for IRIX 6.x, make n64 DSOs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-64
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -64 -O3 -ansi -woff 1068,1069,1174,1185,1209,1474,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -64 -O3 -ansi -woff 1174 -DPTHREADS
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib64

24

configs/irix6-64-static

View File

@@ -1,24 +0,0 @@
 # Configuration for IRIX 6.x, make n64 static libs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-64-static
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -64 -O3 -ansi -woff 1068,1069,1174,1185,1209,1474,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -64 -O3 -ansi -woff 1174 -DPTHREADS
 MKLIB_OPTIONS = -static
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib64
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

16

configs/irix6-n32

View File

@@ -1,16 +0,0 @@
 # Configuration for IRIX 6.x, make n32 DSOs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-n32
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -n32 -mips3 -O3 -ansi -woff 1174,1521,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -n32 -mips3 -O3 -ansi -woff 1174,1552 -DPTHREADS
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32

23

configs/irix6-n32-static

View File

@@ -1,23 +0,0 @@
 # Configuration for IRIX 6.x, make n32 static libs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-n32-static
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -n32 -mips2 -O2 -ansi -woff 1521,1552 -DUSE_XSHM -DPTHREADS
 CXXFLAGS = -n32 -mips2 -O2 -ansi -woff 3262,3666 -DPTHREADS
 MKLIB_OPTIONS = -static
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

17

configs/irix6-o32

View File

@@ -1,17 +0,0 @@
 # Configuration for IRIX 6.x, make o32 DSOs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-o32
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -32 -mips2 -O2 -ansi -woff 1521,1552 -DUSE_XSHM
 CXXFLAGS = -32 -mips2 -O2 -ansi -woff 3262,3666
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32

23

configs/irix6-o32-static

View File

@@ -1,23 +0,0 @@
 # Configuration for IRIX 6.x, make o32 static libs
 include $(TOP)/configs/default
 CONFIG_NAME = irix6-o32-static
 # Compiler and flags
 CC = cc
 CXX = CC
 CFLAGS = -32 -mips2 -O2 -ansi -woff 1521,1552 -DUSE_XSHM
 CXXFLAGS = -32 -mips2 -O2 -ansi -woff 3262,3666
 MKLIB_OPTIONS = -static
 GLW_SOURCES = GLwDrawA.c GLwMDrawA.c
 LIB_DIR = lib32
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

37

configs/linux

View File

@@ -1,37 +0,0 @@
 # Configuration for generic Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux
 # Compiler and flags
 CC = gcc
 CXX = g++
 OPT_FLAGS  = -O3 -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.  Add -m32
 # to build properly on 64-bit platforms.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_XSHM -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS = -Wall -Wmissing-prototypes -Wdeclaration-after-statement \
 	-Wpointer-arith $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) \
 	$(DEFINES) $(ASM_FLAGS) $(X11_INCLUDES) -std=c99 -ffast-math
 CXXFLAGS = -Wall -Wpointer-arith $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) \
 	$(DEFINES) $(X11_INCLUDES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 EXTRA_LIB_PATH = -L/usr/X11R6/lib

19

configs/linux-alpha

View File

@@ -1,19 +0,0 @@
 # Configuration for Linux on Alpha
 include $(TOP)/configs/default
 CONFIG_NAME = linux-alpha
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -fPIC -D_XOPEN_SOURCE -DUSE_XSHM
 CXXFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -fPIC -D_XOPEN_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm -lpthread
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L/usr/X11R6/lib -lXt -lX11

27

configs/linux-alpha-static

View File

@@ -1,27 +0,0 @@
 # Configuration for Linux on Alpha, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = linux-alpha-static
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -D_XOPEN_SOURCE -DUSE_XSHM
 CXXFLAGS = -O3 -mcpu=ev5 -ansi -mieee -pedantic -D_XOPEN_SOURCE
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm -lpthread
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L/usr/X11R6/lib -lXt -lX11

9

configs/linux-debug

View File

@@ -1,9 +0,0 @@
 # Configuration for debugging on Linux
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-debug
 OPT_FLAGS = -g
 #CFLAGS += -pedantic
 DEFINES += -DDEBUG -DDEBUG_MATH

72

configs/linux-dri

View File

@@ -1,72 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 OPT_FLAGS  = -O2 -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
 	-DHAVE_ALIAS -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS = -Wall -Wmissing-prototypes -std=c99 -ffast-math \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) $(ASM_FLAGS)
 CXXFLAGS = -Wall $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)
 LIBDRM_LIB = $(shell $(PKG_CONFIG) --libs libdrm)
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl $(LIBDRM_LIB)
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lXdamage -lXfixes \
 		-lm -lpthread -ldl $(LIBDRM_LIB)
 # Directories
 SRC_DIRS := glx egl $(SRC_DIRS)
 DRIVER_DIRS = dri
 GALLIUM_WINSYS_DIRS = sw sw/xlib drm/vmware drm/intel svga/drm
 GALLIUM_TARGET_DIRS = dri-vmwgfx
 GALLIUM_STATE_TRACKERS_DIRS = egl dri
 DRI_DIRS = swrast
 INTEL_LIBS = $(shell $(PKG_CONFIG) --libs libdrm_intel)
 INTEL_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm_intel)
 NOUVEAU_LIBS = $(shell $(PKG_CONFIG) --libs libdrm_nouveau)
 NOUVEAU_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm_nouveau)
 RADEON_LIBS = $(shell $(PKG_CONFIG) --libs libdrm_radeon)
 RADEON_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm_radeon)
 RADEON_LDFLAGS = $(LIBDRM_RADEON_LIBS)

8

configs/linux-dri-debug

View File

@@ -1,8 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri-debug: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-debug
 OPT_FLAGS  = -O0 -g
 ARCH_FLAGS = -DDEBUG

9

configs/linux-dri-ppc

View File

@@ -1,9 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-ppc
 OPT_FLAGS = -Os -mcpu=603
 PIC_FLAGS = -fPIC

13

configs/linux-dri-x86

View File

@@ -1,13 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-x86
 ARCH_FLAGS = -m32 -mmmx -msse -msse2
 ASM_FLAGS = -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

17

configs/linux-dri-x86-64

View File

@@ -1,17 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/linux-dri
 CONFIG_NAME = linux-dri-x86-64
 ARCH_FLAGS = -m64
 ASM_FLAGS = -DUSE_X86_64_ASM
 MESA_ASM_SOURCES = $(X86-64_SOURCES)
 GLAPI_ASM_SOURCES = $(X86-64_API)
 LIB_DIR = lib64
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib64

54

configs/linux-dri-xcb

View File

@@ -1,54 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri-xcb
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 OPT_FLAGS  = -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
         -DHAVE_ALIAS -DUSE_XCB -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = $(shell $(PKG_CONFIG) --cflags-only-I x11) $(shell $(PKG_CONFIG) --cflags-only-I xcb) $(shell $(PKG_CONFIG) --cflags-only-I x11-xcb) $(shell $(PKG_CONFIG) --cflags-only-I xcb-glx)
 CFLAGS = -Wall -Wmissing-prototypes $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) \
 	$(DEFINES) $(ASM_FLAGS) -std=c99 -ffast-math
 CXXFLAGS = -Wall $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=$(shell $(PKG_CONFIG) --libs-only-L x11)
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)
 LIBDRM_LIB = $(shell $(PKG_CONFIG) --libs libdrm)
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl $(LIBDRM_LIB)
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lm -lpthread -ldl \
                 $(LIBDRM_LIB) $(shell $(PKG_CONFIG) --libs xcb) $(shell $(PKG_CONFIG) --libs x11-xcb) $(shell $(PKG_CONFIG) --libs xcb-glx)
 SRC_DIRS = glx gallium mesa glu
 DRIVER_DIRS = dri

58

configs/linux-egl

View File

@@ -1,58 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-dri: Linux DRI hardware drivers for XFree86 & others
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 OPT_FLAGS  = -O -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DPTHREADS -DUSE_EXTERNAL_DXTN_LIB=1 -DIN_DRI_DRIVER \
 	-DGLX_DIRECT_RENDERING -DGLX_INDIRECT_RENDERING \
 	-DHAVE_ALIAS -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS = -Wall -Wmissing-prototypes -std=c99 -ffast-math \
 	$(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) $(ASM_FLAGS)
 CXXFLAGS = -Wall $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)
 LIBDRM_LIB = $(shell $(PKG_CONFIG) --libs libdrm)
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl $(LIBDRM_LIB)
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lXdamage -lXfixes \
 		-lm -lpthread -ldl \
                 $(LIBDRM_LIB)
 # Directories
 SRC_DIRS = gallium mesa gallium/winsys gallium/targets glu egl
 DRIVER_DIRS = dri
 GALLIUM_WINSYS_DIRS = egl_drm
 GALLIUM_TARGET_DIRS =
 DRI_DIRS = intel

18

configs/linux-ia64-icc

View File

@@ -1,18 +0,0 @@
 # Configuration for Linux with Intel C compiler
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc
 # Compiler and flags
 CC = icc
 CXX = icpc
 CFLAGS = -O3 -ansi -KPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3 -ansi -KPIC -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 MKLIB_OPTIONS = -arch icc-istatic
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB)
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) $(GL_LIB_DEPS)

23

configs/linux-ia64-icc-static

View File

@@ -1,23 +0,0 @@
 # Configuration for Linux with Intel C compiler, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc-static
 # Compiler and flags
 CC = icc
 CXX = icpc
 CFLAGS = -O3 -ansi -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3 -ansi -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include
 MKLIB_OPTIONS = -static -arch icc-istatic
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

19

configs/linux-icc

View File

@@ -1,19 +0,0 @@
 # Configuration for Linux with Intel C compiler
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc
 # Compiler and flags
 CC = icc
 CXX = g++
 CFLAGS = -O3 -tpp6 -axK -KPIC -D_GCC_LIMITS_H_ -D__GNUC__ -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3
 MKLIB_OPTIONS = -arch icc
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm -lpthread
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

23

configs/linux-icc-static

View File

@@ -1,23 +0,0 @@
 # Configuration for Linux with Intel C compiler, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = linux-icc-static
 # Compiler and flags
 CC = icc
 CXX = icpc
 CFLAGS = -O3 -tpp6 -axK -D_GCC_LIMITS_H_ -D__GNUC__ -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM -DPTHREADS -I/usr/X11R6/include
 CXXFLAGS = -O3 -tpp6 -axK -DPTHREADS
 MKLIB_OPTIONS = -static -arch icc
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 GL_LIB_DEPS =
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

52

configs/linux-indirect

View File

@@ -1,52 +0,0 @@
 # -*-makefile-*-
 # Configuration for linux-indirect: Builds a libGL capable of indirect
 # rendering, but *NOT* capable of direct rendering.
 include $(TOP)/configs/default
 CONFIG_NAME = linux-dri
 # Compiler and flags
 CC = gcc
 CXX = g++
 #MKDEP = /usr/X11R6/bin/makedepend
 #MKDEP = gcc -M
 #MKDEP_OPTIONS = -MF depend
 WARN_FLAGS = -Wall
 OPT_FLAGS  = -O -g
 PIC_FLAGS  = -fPIC
 # Add '-DGLX_USE_TLS' to ARCH_FLAGS to enable TLS support.
 ARCH_FLAGS ?=
 DEFINES = -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE \
 	-D_BSD_SOURCE -D_GNU_SOURCE \
 	-DGLX_INDIRECT_RENDERING \
 	-DPTHREADS -DHAVE_ALIAS -DHAVE_POSIX_MEMALIGN
 X11_INCLUDES = -I/usr/X11R6/include
 CFLAGS   = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) \
 	$(ASM_FLAGS) -std=c99 -ffast-math
 CXXFLAGS = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES)
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 MESA_ASM_SOURCES =
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/X11R6/lib
 MESA_MODULES  = $(TOP)/src/mesa/libmesa.a
 DRI_LIB_DEPS  = $(MESA_MODULES) $(EXTRA_LIB_PATH) -lm -lpthread -lexpat -ldl
 GL_LIB_DEPS   = $(EXTRA_LIB_PATH) -lX11 -lXext -lXxf86vm -lm -lpthread -ldl
 # Directories
 SRC_DIRS = glx glu
 DRIVER_DIRS =

47

configs/linux-llvm

View File

@@ -1,47 +0,0 @@
 # -*-makefile-*-
 # Configuration for Linux and LLVM with optimizations
 # Builds the llvmpipe gallium driver
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-llvm
 # Add llvmpipe driver
 GALLIUM_DRIVERS_DIRS += llvmpipe
 OPT_FLAGS = -O3 -ansi -pedantic
 ARCH_FLAGS = -mmmx -msse -msse2 -mstackrealign
 DEFINES += -DNDEBUG -DGALLIUM_LLVMPIPE
 # override -std=c99
 CFLAGS += -std=gnu99
 LLVM_VERSION := $(shell llvm-config --version)
 ifeq ($(LLVM_VERSION),)
   $(warning Could not find LLVM! Make Sure 'llvm-config' is in the path)
   MESA_LLVM=0
 else
   MESA_LLVM=1
   HAVE_LLVM := 0x0$(subst .,0,$(LLVM_VERSION:svn=))
   DEFINES += -DHAVE_LLVM=$(HAVE_LLVM)
 #  $(info Using LLVM version: $(LLVM_VERSION))
 endif
 ifeq ($(MESA_LLVM),1)
   LLVM_CFLAGS=`llvm-config --cppflags|sed 's/-DNDEBUG\>//g'`
   LLVM_CXXFLAGS=`llvm-config --cxxflags` -Wno-long-long
   LLVM_LDFLAGS = $(shell llvm-config --ldflags)
   LLVM_LIBS = $(shell llvm-config --libs)
   MKLIB_OPTIONS=-cplusplus
 else
   LLVM_CFLAGS=
   LLVM_CXXFLAGS=
 endif
 LD = g++
 GL_LIB_DEPS = $(LLVM_LDFLAGS) $(LLVM_LIBS) $(EXTRA_LIB_PATH) -lX11 -lXext -lm -lpthread -lstdc++
 # to allow the NV drivers to compile
 LIBDRM_CFLAGS = $(shell $(PKG_CONFIG) --cflags libdrm)

12

configs/linux-llvm-debug

View File

@@ -1,12 +0,0 @@
 # -*-makefile-*-
 # Configuration for Linux and LLVM with debugging info
 # Builds the llvmpipe gallium driver
 include $(TOP)/configs/linux-llvm
 CONFIG_NAME = linux-llvm-debug
 OPT_FLAGS = -g -ansi -pedantic
 DEFINES += -DDEBUG -UNDEBUG

27

configs/linux-opengl-es

View File

@@ -1,27 +0,0 @@
 # Configuration for OpenGL ES on Linux
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-opengl-es
 # Directories to build
 LIB_DIR = lib
 SRC_DIRS = egl glsl mapi/es1api mapi/es2api mesa/es \
 	gallium gallium/winsys gallium/targets
 # egl st needs this
 DEFINES += -DGLX_DIRECT_RENDERING
 # no mesa or egl drivers
 DRIVER_DIRS =
 GALLIUM_DRIVERS_DIRS = softpipe
 # build libGLES*.so
 GALLIUM_STATE_TRACKERS_DIRS = es
 # build egl_x11_{swrast,i915}.so
 GALLIUM_DRIVERS_DIRS += trace rbug i915
 GALLIUM_STATE_TRACKERS_DIRS += egl
 GALLIUM_WINSYS_DIRS += drm/intel
 GALLIUM_TARGET_DIRS += egl-swrast egl-i915

26

configs/linux-osmesa

View File

@@ -1,26 +0,0 @@
 # Configuration for building only libOSMesa on Linux, no Xlib driver
 # This doesn't really have any Linux dependencies, so it should be usable
 # on other (gcc-based) systems.
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -g -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -D_GNU_SOURCE -DPTHREADS
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Directories
 SRC_DIRS = mapi/glapi glsl mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread -ldl
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

32

configs/linux-osmesa-static

View File

@@ -1,32 +0,0 @@
 # Configuration for building static libOSMesa.a on Linux, no Xlib driver
 # This doesn't really have any Linux dependencies, so it should be usable
 # on other (gcc-based) systems.
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa
 # Compiler and flags
 CC = gcc -m32
 CXX = g++ -m32
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DPTHREADS
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Directories
 SRC_DIRS = mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

29

configs/linux-osmesa16

View File

@@ -1,29 +0,0 @@
 # Configuration for 16 bits/channel OSMesa library on Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa16
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include -DCHAN_BITS=16 -DDEFAULT_SOFTWARE_DEPTH_BITS=31
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names
 OSMESA_LIB = OSMesa16
 OSMESA_LIB_NAME = libOSMesa16.so
 # Directories
 SRC_DIRS = mapi/glapi glsl mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

30

configs/linux-osmesa16-static

View File

@@ -1,30 +0,0 @@
 # Configuration for 16 bits/channel OSMesa library on Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa16-static
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -ansi -pedantic -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include -DCHAN_BITS=16 -DDEFAULT_SOFTWARE_DEPTH_BITS=31
 CXXFLAGS = -O3 -ansi -pedantic -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names
 OSMESA_LIB = OSMesa16
 OSMESA_LIB_NAME = libOSMesa16.a
 # Directories
 SRC_DIRS = gallium mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread

29

configs/linux-osmesa32

View File

@@ -1,29 +0,0 @@
 # Configuration for 32 bits/channel OSMesa library on Linux
 include $(TOP)/configs/default
 CONFIG_NAME = linux-osmesa32
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE -DUSE_XSHM -DPTHREADS -I/usr/X11R6/include -DCHAN_BITS=32 -DDEFAULT_SOFTWARE_DEPTH_BITS=31
 CXXFLAGS = -O3 -ansi -pedantic -fPIC -ffast-math -D_POSIX_SOURCE -D_POSIX_C_SOURCE=199309L -D_SVID_SOURCE -D_BSD_SOURCE
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library names
 OSMESA_LIB = OSMesa32
 OSMESA_LIB_NAME = libOSMesa32.so
 # Directories
 SRC_DIRS = mapi/glapi glsl mesa glu
 DRIVER_DIRS = osmesa
 # Dependencies
 OSMESA_LIB_DEPS = -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(OSMESA_LIB)

9

configs/linux-ppc

View File

@@ -1,9 +0,0 @@
 # Configuration for Linux on PPC
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-ppc
 OPT_FLAGS = -O3 -mcpu=603 -fsigned-char -funroll-loops
 # FIXME: Use of PowerPC assembly should be enabled here.

14

configs/linux-ppc-static

View File

@@ -1,14 +0,0 @@
 # Configuration for Linux on PPC, static libs
 include $(TOP)/configs/linux-ppc
 CONFIG_NAME = linux-ppc-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

8

configs/linux-profile

View File

@@ -1,8 +0,0 @@
 # Configuration for profiling on Linux with gprof
 include $(TOP)/configs/linux-static
 CONFIG_NAME = linux-profile
 OPT_FLAGS = -pg -g -O2
 DEFINES += -DNDEBUG

9

configs/linux-sparc

View File

@@ -1,9 +0,0 @@
 # Configuration for Linux on Sparc
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-sparc
 #ASM_FLAGS = -DUSE_SPARC_ASM
 #MESA_ASM_SOURCES = $(SPARC_SOURCES)
 #GLAPI_ASM_SOURCES = $(SPARC_API)

7

configs/linux-sparc5

View File

@@ -1,7 +0,0 @@
 # Configuration for Linux on Sparc5
 include $(TOP)/configs/linux-sparc
 CONFIG_NAME = linux-sparc5
 ARCH_FLAGS += -mcpu=ultrasparc

23

configs/linux-static

View File

@@ -1,23 +0,0 @@
 # Configuration for generic Linux, making static libs
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =
 # Need to specify all libraries we may need
 	-l$(GL_LIB) -lm -L/usr/X11R6/lib/ -lX11 -lXext -lXmu -lXi -lpthread

7

configs/linux-ultrasparc

View File

@@ -1,7 +0,0 @@
 # Configuration for Linux on UltraSparc
 include $(TOP)/configs/linux-sparc
 CONFIG_NAME = linux-ultrasparc
 ARCH_FLAGS += -mv8 -mtune=ultrasparc

11

configs/linux-x86

View File

@@ -1,11 +0,0 @@
 # Configuration for Linux with x86 optimizations
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-x86
 ARCH_FLAGS = -m32 -mmmx -msse -msse2
 ASM_FLAGS = -DUSE_X86_ASM -DUSE_MMX_ASM -DUSE_3DNOW_ASM -DUSE_SSE_ASM
 MESA_ASM_SOURCES = $(X86_SOURCES)
 GLAPI_ASM_SOURCES = $(X86_API)

7

configs/linux-x86-32

View File

@@ -1,7 +0,0 @@
 # To build Linux x86 32-bit in an x86-64 environment
 include $(TOP)/configs/linux-x86
 CONFIG_NAME = linux-x86-32
 ARCH_FLAGS += -m32

14

configs/linux-x86-64

View File

@@ -1,14 +0,0 @@
 # Configuration for Linux for 64-bit X86 (Opteron)
 include $(TOP)/configs/linux
 CONFIG_NAME = linux-x86-64
 ARCH_FLAGS = -m64
 MESA_ASM_SOURCES = $(X86-64_SOURCES)
 GLAPI_ASM_SOURCES = $(X86-64_API)
 ASM_FLAGS = -DUSE_X86_64_ASM
 LIB_DIR = lib64
 EXTRA_LIB_PATH = -L/usr/X11R6/lib64

8

configs/linux-x86-64-debug

View File

@@ -1,8 +0,0 @@
 # Configuration for Linux for 64-bit X86 (Opteron)
 include $(TOP)/configs/linux-x86-64
 CONFIG_NAME = linux-x86-64-debug
 OPT_FLAGS = -g
 DEFINES += -DDEBUG -DDEBUG_MATH

8

configs/linux-x86-64-profile

View File

@@ -1,8 +0,0 @@
 # Configuration for profiling on Linux for 64-bit X86 (Opteron) with gprof
 include $(TOP)/configs/linux-x86-64-static
 CONFIG_NAME = linux-x86-64-profile
 OPT_FLAGS = -pg -g -O2
 DEFINES += -DNDEBUG

21

configs/linux-x86-64-static

View File

@@ -1,21 +0,0 @@
 # Configuration for Linux for 64-bit X86 (Opteron), static libs
 include $(TOP)/configs/linux-x86-64
 CONFIG_NAME = linux-x86-64-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

9

configs/linux-x86-debug

View File

@@ -1,9 +0,0 @@
 # Configuration for Linux with x86 code, but no gcc optimizations and
 # debugging enabled.
 include $(TOP)/configs/linux-x86
 CONFIG_NAME = linux-x86-debug
 OPT_FLAGS = -g
 DEFINES += -DDEBUG -DDEBUG_MATH

8

configs/linux-x86-profile

View File

@@ -1,8 +0,0 @@
 # Configuration for profiling on Linux with x86 optimizations with gprof
 include $(TOP)/configs/linux-x86-static
 CONFIG_NAME = linux-x86-profile
 OPT_FLAGS = -pg -g -O2
 DEFINES += -DNDEBUG

21

configs/linux-x86-static

View File

@@ -1,21 +0,0 @@
 # Configuration for Linux with x86 optimizations, static libs
 include $(TOP)/configs/linux-x86
 CONFIG_NAME = linux-x86-static
 MKLIB_OPTIONS = -static
 PIC_FLAGS =
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

15

configs/netbsd

View File

@@ -1,15 +0,0 @@
 # Configuration for NetBSD
 include $(TOP)/configs/default
 CONFIG_NAME = netbsd
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O2 -fPIC -DUSE_XSHM -I/usr/X11R6/include -DHZ=100
 CXXFLAGS = -O2 -fPIC
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing

20

configs/openbsd

View File

@@ -1,20 +0,0 @@
 # Configuration for OpenBSD
 include $(TOP)/configs/default
 CONFIG_NAME = openbsd
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O2 -fPIC -I/usr/X11R6/include -DUSE_XSHM -DHZ=100
 CXXFLAGS = -O2 -fPIC -I/usr/X11R6/include -DHZ=100
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/X11R6/lib -lX11 -lXext -lm
 OSMESA_LIB_DEPS = -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB)

14

configs/osf1

View File

@@ -1,14 +0,0 @@
 # Configuration for OSF/1
 include $(TOP)/configs/default
 CONFIG_NAME = osf1
 # Compiler and flags
 CC = cc
 CXX = cxx
 CFLAGS = -O0 -std1 -ieee_with_no_inexact -DUSE_XSHM -DPTHREADS -D_REENTRANT
 CXXFLAGS = -O2 -std ansi -ieee -DPTHREADS -D_REENTRANT
 GL_LIB_DEPS = -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm

15

configs/osf1-static

View File

@@ -1,15 +0,0 @@
 # Configuration for OSF/1
 include $(TOP)/configs/default
 CONFIG_NAME = osf1
 # Compiler and flags
 CC = cc
 CXX = cxx
 CFLAGS = -O2 -std1 -ieee_with_no_inexact -DUSE_XSHM -DPTHREADS -D_REENTRANT
 CXXFLAGS = -O2 -std ansi -ieee -DPTHREADS -D_REENTRANT
 MKLIB_OPTIONS = -static
 GL_LIB_DEPS =
 GLU_LIB_DEPS =

16

configs/solaris-x86

View File

@@ -1,16 +0,0 @@
 # Configuration for Solaris on x86
 include $(TOP)/configs/default
 CONFIG_NAME = solaris-x86
 # Compiler and flags
 CC = cc
 CFLAGS = -Xa -xO3 -xpentium -KPIC -I/usr/openwin/include -DUSE_XSHM
 MKLIB_OPTIONS = -static
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

18

configs/solaris-x86-gcc

View File

@@ -1,18 +0,0 @@
 # Configuration for Solaris on x86 with gcc, dynamic libs
 include $(TOP)/configs/default
 CONFIG_NAME = solaris-x86-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -march=i486 -fPIC -I/usr/openwin/include -DUSE_XSHM
 CXXFLAGS = -O3 -march=i486 -fPIC
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/openwin/lib -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm

24

configs/solaris-x86-gcc-static

View File

@@ -1,24 +0,0 @@
 # Configuration for Solaris on x86 with gcc, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = solaris-x86-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -O3 -march=i486 -fPIC -I/usr/openwin/include -DUSE_XSHM
 CXXFLAGS = -O3 -march=i486 -fPIC
 MKLIB_OPTIONS = -static
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 GL_LIB_DEPS = -L/usr/openwin/lib -lX11 -lXext -lm -lpthread
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a

11

configs/sunos4

View File

@@ -1,11 +0,0 @@
 # Configuration for SunOS 4, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = sunos4
 # Compiler and flags
 CC = acc
 CFLAGS = -Kpic -O -I/usr/include/X11R5 -DUSE_XSHM -DSUNOS4

17

configs/sunos4-gcc

View File

@@ -1,17 +0,0 @@
 # Configuration for SunOS 4, with gcc, shared libs
 include $(TOP)/configs/default
 CONFIG_NAME = sunos4-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 CFLAGS = -fPIC -O3 -I/usr/openwin/include -I/usr/include/X11R5 -I/usr/include/X11R5 -DUSE_XSHM -DSUNOS4
 CXXFLAGS = -fPIC -O3 -I/usr/openwin/include -DSUNOS4
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing

22

configs/sunos4-static

View File

@@ -1,22 +0,0 @@
 # Configuration for SunOS 4, static libs
 include $(TOP)/configs/default
 CONFIG_NAME = sunos4-static
 # Compiler and flags
 CC = acc
 CFLAGS = -O -DUSE_XSHM -DSUNOS4
 MKLIB_OPTIONS = -static
 # Library names (actual file names)
 GL_LIB_NAME = libGL.a
 GLU_LIB_NAME = libGLU.a
 GLW_LIB_NAME = libGLw.a
 OSMESA_LIB_NAME = libOSMesa.a
 # Library/program dependencies (static libs don't have dependencies)
 GL_LIB_DEPS =
 OSMESA_LIB_DEPS =
 GLU_LIB_DEPS =
 GLW_LIB_DEPS =

15

configs/sunos5

View File

@@ -1,15 +0,0 @@
 # Configuration for SunOS 5
 include $(TOP)/configs/default
 CONFIG_NAME = sunos5
 # Compiler and flags
 CC = cc
 CXX = c++
 CFLAGS = -KPIC -Xa -O -I/usr/openwin/include -I/usr/dt/include -DUSE_XSHM
 CXXFLAGS = -KPIC -Xa -O -I/usr/openwin/include -I/usr/dt/include
 GL_LIB_DEPS = -L/usr/openwin/lib -L/usr/dt/lib -lX11 -lXext -lXmu -lXi -lm
 GLU_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -lm
 GLW_LIB_DEPS = -L$(TOP)/$(LIB_DIR) -l$(GL_LIB) -L/usr/openwin/lib -lXt -lX11

11

configs/sunos5-64-gcc

View File

@@ -1,11 +0,0 @@
 # Configuration for 64-bit SunOS 5, with gcc
 include $(TOP)/configs/sunos5-gcc
 CONFIG_NAME = sunos5-64-gcc
 # only set vars that differ from sunos5-gcc config
 OPT_FLAGS  = -O3 -m64 -mcpu=ultrasparc -mv8plus -mvis -g -fomit-frame-pointer -pipe
 ARCH_FLAGS = -m64

37

configs/sunos5-gcc

View File

@@ -1,37 +0,0 @@
 # Configuration for SunOS 5, with gcc
 include $(TOP)/configs/default
 CONFIG_NAME = sunos5-gcc
 # Compiler and flags
 CC = gcc
 CXX = g++
 WARN_FLAGS = -Wall
 OPT_FLAGS  = -O3 -g -fomit-frame-pointer -pipe
 PIC_FLAGS  = -fPIC
 ARCH_FLAGS ?=
 DEFINES = -D_REENTRANT -DUSE_XSHM
 MESA_ASM_SOURCES = $(SPARC_SOURCES)
 GLAPI_ASM_SOURCES = $(SPARC_API)
 ASM_FLAGS = -DUSE_SPARC_ASM
 CFLAGS   = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) \
 	$(ASM_FLAGS) -std=c99 -ffast-math -I/usr/openwin/include
 CXXFLAGS = $(WARN_FLAGS) $(OPT_FLAGS) $(PIC_FLAGS) $(ARCH_FLAGS) $(DEFINES) \
 	-I/usr/openwin/include
 # Work around aliasing bugs - developers should comment this out
 CFLAGS += -fno-strict-aliasing
 CXXFLAGS += -fno-strict-aliasing
 # Library/program dependencies
 EXTRA_LIB_PATH=-L/usr/openwin/lib
 GL_LIB_DEPS = $(EXTRA_LIB_PATH) -lX11 -lXext -lXmu -lXi -lm

Compare commits

948 Commits i965-primi ... core-conte

11 .dir-locals.el Normal file Unescape Escape View File

10 .emacs-dirvars Unescape Escape View File

1 .gitignore vendored Unescape Escape View File

271 Makefile Unescape Escape View File

124 Makefile.am Normal file Unescape Escape View File

1 bin/.gitignore vendored Unescape Escape View File

48 bin/confdiff.sh Unescape Escape View File

20 bin/extract_git_sha1 Unescape Escape View File

23 bin/shortlog_mesa.sh Executable file Unescape Escape View File

17 bin/version.mk Unescape Escape View File

2 common.py Unescape Escape View File

27 configs/aix Unescape Escape View File

24 configs/aix-64 Unescape Escape View File

21 configs/aix-64-static Unescape Escape View File

21 configs/aix-gcc Unescape Escape View File

20 configs/aix-static Unescape Escape View File

31 configs/bluegene-osmesa Unescape Escape View File

27 configs/bluegene-xlc-osmesa Unescape Escape View File

30 configs/catamount-osmesa-pgi Unescape Escape View File

22 configs/autoconf.in → configs/current.in Unescape Escape View File

61 configs/darwin Unescape Escape View File

7 configs/darwin-fat-32bit Unescape Escape View File

7 configs/darwin-fat-all Unescape Escape View File

7 configs/darwin-fat-intel Unescape Escape View File

20 configs/default Unescape Escape View File

29 configs/freebsd Unescape Escape View File

48 configs/freebsd-dri Unescape Escape View File

10 configs/freebsd-dri-amd64 Unescape Escape View File

13 configs/freebsd-dri-x86 Unescape Escape View File

13 configs/hpux10 Unescape Escape View File

18 configs/hpux10-gcc Unescape Escape View File

26 configs/hpux10-static Unescape Escape View File

27 configs/hpux11-32 Unescape Escape View File

25 configs/hpux11-32-static Unescape Escape View File

24 configs/hpux11-32-static-nothreads Unescape Escape View File

28 configs/hpux11-64 Unescape Escape View File

25 configs/hpux11-64-static Unescape Escape View File

28 configs/hpux11-ia64 Unescape Escape View File

25 configs/hpux11-ia64-static Unescape Escape View File

15 configs/hpux9 Unescape Escape View File

13 configs/hpux9-gcc Unescape Escape View File

16 configs/irix6-64 Unescape Escape View File

24 configs/irix6-64-static Unescape Escape View File

16 configs/irix6-n32 Unescape Escape View File

23 configs/irix6-n32-static Unescape Escape View File

17 configs/irix6-o32 Unescape Escape View File

23 configs/irix6-o32-static Unescape Escape View File

37 configs/linux Unescape Escape View File

19 configs/linux-alpha Unescape Escape View File

27 configs/linux-alpha-static Unescape Escape View File

9 configs/linux-debug Unescape Escape View File

72 configs/linux-dri Unescape Escape View File

8 configs/linux-dri-debug Unescape Escape View File

9 configs/linux-dri-ppc Unescape Escape View File

13 configs/linux-dri-x86 Unescape Escape View File

17 configs/linux-dri-x86-64 Unescape Escape View File

54 configs/linux-dri-xcb Unescape Escape View File

58 configs/linux-egl Unescape Escape View File

18 configs/linux-ia64-icc Unescape Escape View File

23 configs/linux-ia64-icc-static Unescape Escape View File

19 configs/linux-icc Unescape Escape View File

23 configs/linux-icc-static Unescape Escape View File

52 configs/linux-indirect Unescape Escape View File

47 configs/linux-llvm Unescape Escape View File

12 configs/linux-llvm-debug Unescape Escape View File

27 configs/linux-opengl-es Unescape Escape View File

26 configs/linux-osmesa Unescape Escape View File

32 configs/linux-osmesa-static Unescape Escape View File

29 configs/linux-osmesa16 Unescape Escape View File

30 configs/linux-osmesa16-static Unescape Escape View File

29 configs/linux-osmesa32 Unescape Escape View File

9 configs/linux-ppc Unescape Escape View File

14 configs/linux-ppc-static Unescape Escape View File

8 configs/linux-profile Unescape Escape View File

9 configs/linux-sparc Unescape Escape View File

7 configs/linux-sparc5 Unescape Escape View File

23 configs/linux-static Unescape Escape View File

7 configs/linux-ultrasparc Unescape Escape View File

948 Commits

i965-primi ... core-conte

11

.dir-locals.el Normal file

View File

10

.emacs-dirvars

View File

1

.gitignore vendored

View File

271

Makefile

View File

124

Makefile.am Normal file

View File

1

bin/.gitignore vendored

View File

48

bin/confdiff.sh

View File

20

bin/extract_git_sha1

View File

23

bin/shortlog_mesa.sh Executable file

View File

17

bin/version.mk

View File

2

common.py

View File

27

configs/aix

View File

24

configs/aix-64

View File

21

configs/aix-64-static

View File

21

configs/aix-gcc

View File

20

configs/aix-static

View File

31

configs/bluegene-osmesa

View File

27

configs/bluegene-xlc-osmesa

View File

30

configs/catamount-osmesa-pgi

View File

22

configs/autoconf.in → configs/current.in

View File

61

configs/darwin

View File

7

configs/darwin-fat-32bit

View File

7

configs/darwin-fat-all

View File

7

configs/darwin-fat-intel

View File

20

configs/default

View File

29

configs/freebsd

View File

48

configs/freebsd-dri

View File

10

configs/freebsd-dri-amd64

View File

13

configs/freebsd-dri-x86

View File

13

configs/hpux10

View File

18

configs/hpux10-gcc

View File

26

configs/hpux10-static

View File

27

configs/hpux11-32

View File

25

configs/hpux11-32-static

View File

24

configs/hpux11-32-static-nothreads

View File

28

configs/hpux11-64

View File

25

configs/hpux11-64-static

View File

28

configs/hpux11-ia64

View File

25

configs/hpux11-ia64-static

View File

15

configs/hpux9

View File

13

configs/hpux9-gcc

View File

16

configs/irix6-64

View File

24

configs/irix6-64-static

View File

16

configs/irix6-n32

View File

23

configs/irix6-n32-static

View File

17

configs/irix6-o32

View File

23

configs/irix6-o32-static

View File

37

configs/linux

View File

19

configs/linux-alpha

View File

27

configs/linux-alpha-static

View File

9

configs/linux-debug

View File

72

configs/linux-dri

View File

8

configs/linux-dri-debug

View File

9

configs/linux-dri-ppc

View File

13

configs/linux-dri-x86

View File

17

configs/linux-dri-x86-64

View File

54

configs/linux-dri-xcb

View File

58

configs/linux-egl

View File

18

configs/linux-ia64-icc

View File

23

configs/linux-ia64-icc-static

View File

19

configs/linux-icc

View File

23

configs/linux-icc-static

View File

52

configs/linux-indirect

View File

47

configs/linux-llvm

View File

12

configs/linux-llvm-debug

View File

27

configs/linux-opengl-es

View File

26

configs/linux-osmesa

View File

32

configs/linux-osmesa-static

View File

29

configs/linux-osmesa16

View File

30

configs/linux-osmesa16-static

View File

29

configs/linux-osmesa32

View File

9

configs/linux-ppc

View File

14

configs/linux-ppc-static

View File

8

configs/linux-profile

View File

9

configs/linux-sparc

View File

7

configs/linux-sparc5

View File

23

configs/linux-static

View File

7

configs/linux-ultrasparc

View File

11

configs/linux-x86

View File