flodavid/suyu

Author	SHA1	Message	Date
ReinUsesLisp	bb2cbdf704	texture_cache: Test format compatibility before copying Avoid illegal copies. This intercepts the last step of a copy to avoid generating validation errors or corrupting the driver on some instances. We can create views and emit copies accordingly in future commits and remove this last-step validation.	2020-06-26 20:52:22 -03:00
bunnei	3579db425e	Merge pull request #4144 from FernandoS27/tt-fix TextureCache: Fix case where layer goes off bound.	2020-06-26 19:02:39 -04:00
bunnei	78d3b54ea7	Merge pull request #4111 from ReinUsesLisp/preserve-contents-vk vk_rasterizer: Don't preserve contents on full screen clears	2020-06-26 18:48:12 -04:00
ReinUsesLisp	1d6be9febf	video_core/compatible_formats: Table to test if two formats are legal to view or copy Add a flat table to test if it's legal to create a texture view between two formats or copy betweem them. This table is based on ARB_copy_image and ARB_texture_view. Copies are more permissive than views.	2020-06-26 19:28:11 -03:00
ReinUsesLisp	6481d91e4a	gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading After marking buffers as resident, Nvidia's driver seems to take a slow path. To workaround this issue, copy to a STREAM_READ buffer and then call GetNamedBufferSubData on it. This is a temporary solution until we have asynchronous flushing.	2020-06-26 16:58:40 -03:00
Rodrigo Locatti	5872fc21fe	Merge pull request #4151 from ReinUsesLisp/gl-invalidations gl_shader_cache: Avoid use after move for program size	2020-06-25 21:05:27 -03:00
David Marcec	a927d8be52	gl_device: Fix IsASTCSupported Other targets were never actually checked	2020-06-25 19:12:56 +10:00
ReinUsesLisp	bc8d3b8f82	gl_device: Enable NV_vertex_buffer_unified_memory on Turing devices Once we make sure not to corrupt Nvidia's driver, we can safely use resident buffers on Turing devices. See GitHub pull request #4156	2020-06-25 01:28:47 -03:00
bunnei	0e1268e507	Merge pull request #4105 from ReinUsesLisp/resident-buffers gl_rasterizer: Use NV_vertex_buffer_unified_memory for vertex buffer robustness	2020-06-24 11:40:30 -04:00
bunnei	2f2df9a4a7	Merge pull request #4083 from Morph1984/B10G11R11F decode/image: Implement B10G11R11F	2020-06-24 11:02:38 -04:00
Fernando Sahmkow	32343d820d	Merge pull request #4046 from ogniK5377/macro-hle-prod Add support for HLEing Macros	2020-06-24 09:01:00 -04:00
ReinUsesLisp	32a2dcd415	buffer_cache: Use buffer methods instead of cache virtual methods	2020-06-24 02:36:14 -03:00
ReinUsesLisp	39c97f1b65	gl_stream_buffer: Use InvalidateBufferData instead unmap and map Making the stream buffer resident increases GPU usage significantly on some games. This seems to be addressed invalidating the stream buffer with InvalidateBufferData instead of using a Unmap + Map (with invalidation flags).	2020-06-24 02:36:14 -03:00
ReinUsesLisp	41a4090320	gl_rasterizer: Use NV_vertex_buffer_unified_memory for vertex buffer robustness Switch games are allowed to bind less data than what they use in a vertex buffer, the expected behavior here is that these values are read as zero. At the moment of writing this only D3D12, OpenGL and NVN through NV_vertex_buffer_unified_memory support vertex buffer with a size limit. In theory this could be emulated on Vulkan creating a new VkBuffer for each (handle, offset, length) tuple and binding the expected data to it. This is likely going to be slow and memory expensive when used on the vertex buffer and we have to do it on all draws because we can't know without analyzing indices when a game is going to read vertex data out of bounds. This is not a problem on OpenGL's BufferAddressRangeNV because it takes a length parameter, unlike Vulkan's CmdBindVertexBuffers that only takes buffers and offsets (the length is implicit in VkBuffer). It isn't a problem on D3D12 either, because D3D12_VERTEX_BUFFER_VIEW on IASetVertexBuffers takes SizeInBytes as a parameter (although I am not familiar with robustness on D3D12). Currently this only implements buffer ranges for vertex buffers, although indices can also be affected. A KHR_robustness profile is not created, but Nvidia's driver reads out of bound vertex data as zero anyway, this might have to be changed in the future. - Fixes SMO random triangles when capturing an enemy, getting hit, or looking at the environment on certain maps.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	32485917ba	gl_buffer_cache: Mark buffers as resident Make stream buffer and cached buffers as resident and query their address. This allows us to use GPU addresses for several proprietary Nvidia extensions.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	73fb3a304b	gl_device: Expose NV_vertex_buffer_unified_memory except on Turing Expose NV_vertex_buffer_unified_memory when the driver supports it. This commit adds a function the determine if a GL_RENDERER is a Turing GPU. This is required because on Turing GPUs Nvidia's driver crashes when the buffer is marked as resident or on DeleteBuffers. Without a synchronous debug output (single threaded driver), it's likely that the driver will crash in the first blocking call.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	00c66a7289	gl_stream_buffer: Always use a non-coherent buffer	2020-06-24 02:35:33 -03:00
ReinUsesLisp	da79ec9565	gl_stream_buffer: Always use persistent memory maps yuzu no longer supports platforms without persistent maps.	2020-06-24 02:35:33 -03:00
Rodrigo Locatti	b66ccaa376	Merge pull request #4129 from Morph1984/texture-shadow-lod-workaround gl_shader_decompiler: Workaround textureLod when GL_EXT_texture_shadow_lod is not available	2020-06-24 01:51:15 -03:00
David Marcec	f5e2aec422	addressed issues	2020-06-24 12:18:33 +10:00
David Marcec	52340e94ac	clear mme draw mode We already draw, so we can clear it	2020-06-24 12:09:04 +10:00
David Marcec	fabdf5d385	Addressed issues	2020-06-24 12:09:03 +10:00
David Marcec	74b4334d51	Fix constbuffer for 0217920100488FF7	2020-06-24 12:09:02 +10:00
David Marcec	6ce5f3120b	Macro HLE support	2020-06-24 12:09:01 +10:00
ReinUsesLisp	9f54cd4dad	gl_shader_cache: Avoid use after move for program size All programs had a size of zero due to this bug, skipping invalidations. While we are at it, remove some unused forward declarations.	2020-06-23 22:54:42 -03:00
bunnei	15aeae3dd3	Merge pull request #4127 from lioncash/dst-typo texture_cache: Fix incorrect address used in a DeduceSurface() call	2020-06-23 15:59:37 -04:00
ReinUsesLisp	39ab33ee1c	shader/half_set: Implement HSET2_IMM Add HSET2_IMM. Due to the complexity of the encoding avoid using BitField unions and read the relevant bits from the code itself. This is less error prone.	2020-06-22 20:51:18 -03:00
Fernando Sahmkow	544b15e8e4	TextureCache: Fix case where layer goes off bound. The returned layer is expected to be between 0 and the depth of the surface, anything larger is off bounds.	2020-06-22 11:37:40 -04:00
Rodrigo Locatti	406d298457	Merge pull request #4110 from ReinUsesLisp/direct-upload-sets vk_update_descriptor: Upload descriptor sets data directly	2020-06-22 05:02:13 -03:00
ReinUsesLisp	2f09c7ddd3	renderer_vulkan: Update validation layer name and test before enabling Update validation layer string to VK_LAYER_KHRONOS_validation. While we are at it, properly check for available validation layers before enabling them.	2020-06-22 04:10:45 -03:00
bunnei	14a1181a97	Merge pull request #4122 from lioncash/hide video_core: Eliminate some variable shadowing	2020-06-21 22:38:04 -04:00
bunnei	c27c76ed43	Merge pull request #4126 from lioncash/noexcept vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR()	2020-06-21 22:36:14 -04:00
Morph	f77c897b8d	gl_shader_decompiler: Enable GL_EXT_texture_shadow_lod if available Enable GL_EXT_texture_shadow_lod if available. If this extension is not available, such as on Intel/AMD proprietary drivers, use textureGrad as a workaround.	2020-06-20 23:02:29 -04:00
Morph	1e65da971b	gl_device: Check for GL_EXT_texture_shadow_lod	2020-06-20 22:14:32 -04:00
bunnei	f98bf1025f	Merge pull request #4120 from lioncash/arb gl_arb_decompiler: Avoid several string copies	2020-06-20 22:11:49 -04:00
MerryMage	c12eb814b4	macro_jit_x64: Use ecx for shift register shl/shr only accept cl as their second argument	2020-06-20 22:24:05 +01:00
Lioncash	ef53b2fd08	texture_cache: Fix incorrect address used in a DeduceSurface() call Previously the source was being deduced twice in a row.	2020-06-20 14:11:28 -04:00
merry	928e9c09aa	Merge pull request #4125 from lioncash/macro-shift macro_jit_x64: Amend readability of Compile_ExtractShiftLeftRegister()	2020-06-20 16:08:23 +01:00
merry	2bd903e021	Merge pull request #4123 from lioncash/unused-var macro_jit_x64: Remove unused variable	2020-06-20 16:07:58 +01:00
Morph	480e1fa987	decode/image: Implement B10G11R11F - Used by Kirby Star Allies	2020-06-20 00:28:30 -04:00
bunnei	7d1dca4c98	Merge pull request #4099 from MerryMage/macOS-build Fix compilation on macOS	2020-06-19 23:31:04 -04:00
Lioncash	5865a10885	gl_arb_decompiler: Avoid several string copies Variables that are marked as const cannot have the move constructor invoked when returning from a function (the move constructor requires a non-const variable so it can "steal" the resources from it.	2020-06-19 23:09:16 -04:00
Lioncash	a6e5b84d1f	vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR() Check() can throw an exception if the Vulkan result isn't successful. We remove the check so that std::terminate isn't outright called and allows for better debugging (should it ever actually fail).	2020-06-19 23:01:59 -04:00
Lioncash	5a4e89b901	macro_jit_x64: Correct readability of Compile_ExtractShiftLeftImmediate() Previously dst wasn't being used.	2020-06-19 22:57:23 -04:00
Lioncash	140f953b6a	macro_jit_x64: Correct readability of Compile_ExtractShiftLeftRegister() Previously dst wasn't being used.	2020-06-19 22:56:55 -04:00
Lioncash	8ea749c1ca	macro_jit_x64: Remove unused variable Removes a completely unused label and marks another variable as unused, given it seems like it has potential uses in the future.	2020-06-19 22:10:45 -04:00
Lioncash	479605b3e5	memory_manager: Eliminate variable shadowing Renames some variables to prevent ones in inner scopes from shadowing outer-scoped variables. The Copy* functions have no shadowing, but we rename them anyways to remain consistent with the other functions.	2020-06-19 22:02:58 -04:00
Lioncash	811bff009e	macro_jit_x64: Eliminate variable shadowing in Compile_ProcessResult() We can reduce the capture scope so that it's not possible for both "reg" variables to clash with one another. While we're at it, we can prevent unnecessary copies while we're at it.	2020-06-19 21:57:44 -04:00
Lioncash	4514b80b3e	buffer_cache: Eliminate local variable shadowing We can just make use of the instance in the scope above this one.	2020-06-19 21:55:02 -04:00
bunnei	7daea551c0	Merge pull request #4087 from MerryMage/macrojit-inline-Read macro_jit_x64: Inline Engines::Maxwell3D::GetRegisterValue	2020-06-19 21:32:07 -04:00
MerryMage	977ceb4056	macro_jit_x64: Remove unused function Read	2020-06-19 11:39:41 +01:00
bunnei	5a092fb61e	Merge pull request #4090 from MerryMage/macrojit-bugs macro_jit_x64: Optimization correctness	2020-06-18 22:28:17 -04:00
ReinUsesLisp	cf137ea40b	vk_rasterizer: Don't preserve contents on full screen clears There's no need to load contents from the CPU when a clear resets all the contents of the underlying memory. This is already implemented on OpenGL and the texture cache.	2020-06-18 18:18:33 -03:00
ReinUsesLisp	7d763f060e	vk_update_descriptor: Upload descriptor sets data directly Instead of copying to a temporary payload before sending the update task to the worker thread, insert elements to the payload directly.	2020-06-18 17:47:19 -03:00
MerryMage	69f38355ed	vk_rasterizer: BindTransformFeedbackBuffersEXT accepts a size of type VkDeviceSize	2020-06-18 15:47:44 +01:00
MerryMage	b1eada6079	renderer_vulkan: Fix macOS GetBundleDirectory reference	2020-06-18 15:47:44 +01:00
MerryMage	442e48ef4c	memory_util: boost hashes are size_t * boost::hash_value returns a size_t * boost::hash_combine takes a size_t& argument	2020-06-18 15:47:43 +01:00
MerryMage	8ae7154541	Rename PAGE_SHIFT to PAGE_BITS macOS header files #define PAGE_SHIFT	2020-06-18 15:47:43 +01:00
Morph	2f420618ea	vk_sampler_cache: Emulate GL_LINEAR/NEAREST minification filters Emulate GL_LINEAR/NEAREST minification filters using minLod = 0 and maxLod = 0.25 during sampler creation	2020-06-18 04:56:31 -04:00
Morph	be660e7749	maxwell_to_vk: Reorder filter cases and correct mipmap_filter=None maxwell_to_vk: Reorder filtering modes to start with None, then Nearest, then Linear. maxwell_to_vk: Logs filter modes under UNREACHABLE_MSG instead of UNIMPLEMENTED_MSG, since any unknown filter modes are invalid and not unimplemented. maxwell_to_vk: Return VK_SAMPLER_MIPMAP_MODE_NEAREST instead of VK_SAMPLER_MIPMAP_MODE_LINEAR when mipmap_filter is None with the description from the VkSamplerCreateInfo(3) man page.	2020-06-18 04:56:31 -04:00
Morph	8868fb745f	maxwell_to_gl: Miscellaneous changes maxwell_to_gl: Log unimplemented features under UNIMPLEMENTED_MSG instead of LOG_ERROR to bring into parity with maxwell_to_vk maxwell_to_gl: Deduplicate logging in VertexType(), merging them into one. maxwell_to_gl: Return GL_NEAREST instead of GL_LINEAR if an unknown texture filter mode is encountered. maxwell_to_gl: Log the mipmap filter mode if an unknown value is passed in. maxwell_to_gl: Reorder filtering modes to start with None, then Nearest, then Linear.	2020-06-18 04:56:31 -04:00
Rodrigo Locatti	edb2114bac	Merge pull request #4092 from Morph1984/image-bindings gl_device: Reserve 4 image bindings for fragment stage	2020-06-18 04:59:48 -03:00
MerryMage	44f10d9b9f	macro_jit_x64: Inline Engines::Maxwell3D::GetRegisterValue	2020-06-17 17:17:08 +01:00
bunnei	a8ac99b619	Merge pull request #4086 from MerryMage/abi xbyak_abi: Cleanup	2020-06-17 11:20:52 -04:00
MerryMage	c409722435	macro_jit_x64: Optimization implicitly assumes same destination	2020-06-17 10:36:36 +01:00
MerryMage	a6ddd7c382	macro_jit_x64: Should not skip zero registers for certain ALU ops The code generated for these ALU ops assume src_a and src_b are always valid.	2020-06-17 10:36:34 +01:00
bunnei	b660ef6c8a	Merge pull request #4089 from MerryMage/macrojit-cleanup-1 macro_jit_x64: Cleanup	2020-06-16 23:44:48 -04:00
bunnei	798ec003ce	Merge pull request #4041 from ReinUsesLisp/arb-decomp gl_arb_decompiler: Implement an assembly shader decompiler	2020-06-16 14:56:23 -04:00
Morph	e2f5d16540	gl_device: Reserve at least 4 image bindings for fragment stage Due to the limitation of GL_MAX_IMAGE_UNITS being low (8) on Intel's and Nvidia's proprietary drivers, we have to reserve an appropriate amount of image bindings for each of the stages. So far games have been observed to use 4 image bindings on the fragment stage (Kirby Star Allies) and 1 on the vertex stage (TWD series). No games thus far in my limited testing used more than 4 images concurrently and across all currently active programs. This fixes shader compilation errors on Kirby Star Allies on OpenGL (GLSL/GLASM)	2020-06-16 03:03:07 -04:00
Rodrigo Locatti	0bd9bc7201	Merge pull request #4066 from ReinUsesLisp/shared-ptr-buf buffer_cache: Avoid passing references of shared pointers and misc style changes	2020-06-15 22:29:32 -03:00
MerryMage	cf0aad7d6a	macro_jit_x64: Remove NEXT_PARAMETER Not required, as PARAMETERS can just be incremented directly.	2020-06-15 21:19:38 +01:00
MerryMage	1799f4e774	macro_jit_x64: Remove unused function Compile_WriteCarry	2020-06-15 21:19:38 +01:00
MerryMage	c09a9e5cc7	macro_jit_x64: Select better registers All registers are now callee-save registers. RBX and RBP selected for STATE and RESULT because these are most commonly accessed; this is to avoid the REX prefix. RBP not used for STATE because there are some SIB restrictions, RBX emits smaller code.	2020-06-15 21:19:38 +01:00
MerryMage	79aa7b3ace	macro_jit_x64: Remove REGISTERS Unnecessary since this is just an offset from STATE.	2020-06-15 21:00:59 +01:00
MerryMage	35db6e1c68	macro_jit_x64: Remove JITState::parameters This can be passed in as an argument instead.	2020-06-15 20:55:02 +01:00
MerryMage	389549b80d	macro_jit_x64: Remove METHOD_ADDRESS_64 Unnecessary variable.	2020-06-15 20:51:33 +01:00
MerryMage	a6a43a5ae0	macro_jit_x64: Remove RESULT_64 This Reg64 codepath has the exact same behaviour as the Reg32 one.	2020-06-15 20:35:08 +01:00
MerryMage	d563017dfe	xbyak_abi: Remove *GPS variants of stack manipulation functions	2020-06-15 18:59:54 +01:00
ReinUsesLisp	6e5d8aac4d	video_core/macro_jit_x64: Remove initializer in member variable Fix build time issues on gcc. Confirmed through asan that avoiding this initialization is safe.	2020-06-15 05:17:55 -03:00
bunnei	92021a344c	Merge pull request #4064 from ReinUsesLisp/invalidate-buffers gl_rasterizer: Mark vertex buffers as dirty after buffer cache invalidation	2020-06-14 00:29:16 -04:00
bunnei	c2ea1e1bcb	Merge pull request #4049 from ReinUsesLisp/separate-samplers shader/texture: Join separate image and sampler pairs offline	2020-06-13 13:48:27 -04:00
bunnei	5633887569	Merge pull request #3986 from ReinUsesLisp/shader-cache shader_cache: Implement a generic runtime shader cache	2020-06-12 23:14:48 -04:00
ReinUsesLisp	87011a97f9	gl_arb_decompiler: Implement FSwizzleAdd	2020-06-11 22:12:07 -03:00
ReinUsesLisp	a63a0daa5e	gl_arb_decompiler: Implement an assembly shader decompiler Emit code compatible with NV_gpu_program5. This should emit code compatible with Fermi, but it wasn't tested on that architecture. Pascal has some issues not present on Turing GPUs.	2020-06-11 22:12:07 -03:00
bunnei	83e3b77ed7	Merge pull request #4027 from ReinUsesLisp/3d-slices texture_cache: Implement rendering to 3D textures	2020-06-09 21:52:15 -04:00
ReinUsesLisp	6508cdd003	buffer_cache: Avoid passing references of shared pointers and misc style changes Instead of using as template argument a shared pointer, use the underlying type and manage shared pointers explicitly. This can make removing shared pointers from the cache more easy. While we are at it, make some misc style changes and general improvements (like insert_or_assign instead of operator[] + operator=).	2020-06-09 18:30:49 -03:00
ReinUsesLisp	7646f2c21d	gl_rasterizer: Mark vertex buffers as dirty after buffer cache invalidation Vertex buffers bindings become invalid after the stream buffer is invalidated. We were originally doing this, but it got lost at some point. - Fixes Animal Crossing: New Horizons, but it affects everything.	2020-06-08 20:24:16 -03:00
ReinUsesLisp	6e122f0b2c	buffer_cache: Return stream buffer invalidation in Map instead of Unmap We have to invalidate whatever cache is being used before uploading the data, hence it makes more sense to return this on Map instead of Unmap.	2020-06-08 20:22:31 -03:00
bunnei	3626254f48	Merge pull request #4040 from ReinUsesLisp/nv-transform-feedback gl_rasterizer: Use NV_transform_feedback for XFB on assembly shaders	2020-06-08 16:18:33 -04:00
bunnei	98d2461529	Merge pull request #4052 from ReinUsesLisp/debug-output renderer_opengl: Only enable DEBUG_OUTPUT when graphics debugging is enabled	2020-06-08 10:16:41 -04:00
ReinUsesLisp	bd43c05470	texture_cache: Port original code management for 2D vs 3D textures Handle blits to images as 2D, even when they have block depth. - Fixes rendering issues on Luigi's Mansion 3	2020-06-08 05:02:22 -03:00
ReinUsesLisp	c99f5d405b	texture_cache: Simplify blit code	2020-06-08 05:01:44 -03:00
ReinUsesLisp	3c2ae53b4c	texture_cache: Handle 3D texture blits with one layer	2020-06-08 05:01:00 -03:00
ReinUsesLisp	c95c254f3e	texture_cache: Implement rendering to 3D textures This allows rendering to 3D textures with more than one slice. Applications are allowed to render to more than one slice of a texture using gl_Layer from a VTG shader. This also requires reworking how 3D texture collisions are handled, for now, this commit allows rendering to slices but not to miplevels. When a render target attempts to write to a mipmap, we fallback to the previous implementation (copying or flushing as needed). - Fixes color correction 3D textures on UE4 games (rainbow effects). - Allows Xenoblade games to render to 3D textures directly.	2020-06-08 05:01:00 -03:00
Rodrigo Locatti	2293e8a11a	Merge pull request #4034 from ReinUsesLisp/storage-texels vk_rasterizer: Implement storage texels and atomic image operations	2020-06-07 18:43:24 -03:00
ReinUsesLisp	abcea1bb18	rasterizer_cache: Remove files and includes The rasterizer cache is no longer used. Each cache has its own generic implementation optimized for the cached data.	2020-06-07 04:32:57 -03:00
ReinUsesLisp	678f95e4f8	vk_pipeline_cache: Use generic shader cache Trivial port the generic shader cache to Vulkan.	2020-06-07 04:32:57 -03:00
ReinUsesLisp	b96f65b62b	gl_shader_cache: Use generic shader cache Trivially port the generic shader cache to OpenGL.	2020-06-07 04:32:57 -03:00
ReinUsesLisp	dc27252352	shader_cache: Implement a generic shader cache Implement a generic shader cache for fast lookups and invalidations. Invalidations are cheap but expensive when a shader is invalidated. Use two mutexes instead of one to avoid locking invalidations for lookups and vice versa. When a shader has to be removed, lookups are locked as expected.	2020-06-07 04:32:32 -03:00
ReinUsesLisp	e78d681a6c	gl_device: Black list NVIDIA 443.24 for fast buffer uploads Skip fast buffer uploads on Nvidia 443.24 Vulkan beta driver on OpenGL. This driver throws the following error when calling BufferSubData or BufferData on buffers that are candidates for fast constant buffer uploads. This is the equivalens to push constants on Vulkan, except that they can access the full buffer. The error: Unknown internal debug message. The NVIDIA OpenGL driver has encountered an out of memory error. This application might behave inconsistently and fail. If this error persists on future drivers, we might have to look deeper into this issue. For now, we can black list it and log it as a temporary solution.	2020-06-06 02:56:42 -03:00

1 2 3 4 5 ...

4574 commits