Linuxydable/suyu

Author	SHA1	Message	Date
ReinUsesLisp	36abf67e79	shader/image: Implement SUATOM and fix SUST	2019-09-10 20:22:31 -03:00
ReinUsesLisp	4e35177e23	shader_ir: Implement VOTE Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.	2019-08-21 14:50:38 -03:00
Fernando Sahmkow	11f4e739bd	Shader_Ir: Implement F16 Variants of F2F, F2I, I2F. This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done.	2019-07-20 17:38:25 -04:00
ReinUsesLisp	45c162444d	shader/half_set_predicate: Fix HSETP2 implementation	2019-07-19 22:21:22 -03:00
Fernando Sahmkow	1bdb59fc6e	Merge pull request #2695 from ReinUsesLisp/layer-viewport gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders	2019-07-15 16:28:07 -04:00
bunnei	bb67091c77	Merge pull request #2609 from FernandoS27/new-scan Implement a New Shader Scanner, Decompile Flow Stack and implement BRX BRA.CC	2019-07-11 17:36:23 -04:00
bunnei	7fb7054bc8	Merge pull request #2686 from ReinUsesLisp/vk-scheduler vk_scheduler: Drop execution context in favor of views	2019-07-10 16:35:48 -04:00
Fernando Sahmkow	8a6fc529a9	shader_ir: Implement BRX & BRA.CC	2019-07-09 08:14:37 -04:00
ReinUsesLisp	c9d886c84e	gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders This commit implements gl_ViewportIndex and gl_Layer in vertex and geometry shaders. In the case it's used in a vertex shader, it requires ARB_shader_viewport_layer_array. This extension is available on AMD and Nvidia devices (mesa and proprietary drivers), but not available on Intel on any platform. At the moment of writing this description I don't know if this is a hardware limitation or a driver limitation. In the case that ARB_shader_viewport_layer_array is not available, writes to these registers on a vertex shader are ignored, with the appropriate logging.	2019-07-07 20:42:55 -03:00
Lioncash	cbdd6cd1c0	vk_sampler_cache: Remove unused includes These are no longer used within this header, so they can be removed.	2019-07-07 13:40:36 -04:00
Lioncash	4b27680639	video_core: Add missing override specifiers	2019-07-07 13:38:39 -04:00
ReinUsesLisp	86a874a2fc	vk_scheduler: Drop execution context in favor of views Instead of passing by copy an execution context through out the whole Vulkan call hierarchy, use a command buffer view and fence view approach. This internally dereferences the command buffer or fence forcing the user to be unable to use an outdated version of it on normal usage. It is still possible to keep store an outdated if it is casted to VKFence& or vk::CommandBuffer. While changing this file, add an extra parameter for Flush and Finish to allow releasing the fence from this calls.	2019-07-07 03:30:22 -03:00
ReinUsesLisp	06c4ce8645	shader: Decode SUST and implement backing image functionality	2019-06-20 21:38:33 -03:00
Zach Hilman	c0e7b91145	Merge pull request #2538 from ReinUsesLisp/ssy-pbk shader: Split SSY and PBK stack	2019-06-15 20:30:13 -04:00
Zach Hilman	de33ad25f5	Merge pull request #2514 from ReinUsesLisp/opengl-compat video_core: Drop OpenGL core in favor of OpenGL compatibility	2019-06-07 17:23:25 -04:00
ReinUsesLisp	fe8e6618f2	shader: Split SSY and PBK stack Hardware testing revealed that SSY and PBK push to a different stack, allowing code like this: SSY label1; PBK label2; SYNC; label1: PBK; label2: EXIT;	2019-06-07 02:18:27 -03:00
ReinUsesLisp	bf4dfb3ad4	shader: Use shared_ptr to store nodes and move initialization to file Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class.	2019-06-05 20:41:52 -03:00
bunnei	a20ba09bfd	Merge pull request #2520 from ReinUsesLisp/vulkan-refresh vk_device,vk_shader_decompiler: Miscellaneous changes	2019-06-05 18:10:00 -04:00
ReinUsesLisp	a89cc0bafc	maxwell_to_gl: Use GL_CLAMP to emulate Clamp wrap mode	2019-05-30 13:21:01 -03:00
ReinUsesLisp	f424b46036	vk_device: Let formats array type be deduced	2019-05-26 03:09:06 -03:00
ReinUsesLisp	a4c5e3e339	vk_shader_decompiler: Misc fixes Fix missing OpSelectionMerge instruction. This caused devices loses on most hardware, Intel didn't care. Fix [-1;1] -> [0;1] depth conversions. Conditionally use VK_EXT_scalar_block_layout. This allows us to use non-std140 layouts on UBOs. Update external Vulkan headers.	2019-05-26 01:48:04 -03:00
ReinUsesLisp	dec3c981d0	vk_device: Enable features when available and misc changes Keeps track of native ASTC support, VK_EXT_scalar_block_layout availability and SSBO range. Check for independentBlend and vertexPipelineStorageAndAtomics as a required feature. Always enable it. Use vk::to_string format to log Vulkan enums. Style changes.	2019-05-26 01:41:34 -03:00
ReinUsesLisp	9c3461604c	shader: Implement S2R Tid{XYZ} and CtaId{XYZ}	2019-05-20 16:36:49 -03:00
bunnei	d49efbfb4a	Merge pull request #2441 from ReinUsesLisp/al2p shader: Implement AL2P and ALD.PHYS	2019-05-19 14:02:58 -04:00
Mat M	dadcf317dc	Merge pull request #2461 from lioncash/unused-var video_core: Remove a few unused variables and functions	2019-05-14 06:36:26 -04:00
Rodrigo Locatti	940a71089d	Merge pull request #2413 from FernandoS27/opt-gpu Rasterizer Cache: refactor flushing & optimize memory usage of surfaces	2019-05-13 23:01:59 -03:00
Lioncash	e3c45b4338	renderer_vulkan/vk_shader_decompiler: Remove unused variable from DeclareInternalFlags()	2019-05-09 18:47:48 -04:00
ReinUsesLisp	06b363c9b5	shader: Remove unused AbufNode Ipa mode	2019-05-02 21:46:25 -03:00
bunnei	c52233ec8b	Merge pull request #2322 from ReinUsesLisp/wswitch video_core: Silent -Wswitch warnings	2019-04-28 22:24:58 -04:00
Fernando Sahmkow	4c36b78567	Rasterizer Cache: Use a temporal storage for Surfaces loading/flushing. This PR should heavily reduce memory usage since temporal buffers are no longer stored per Surface but instead managed by the Rasterizer Cache.	2019-04-21 11:42:07 -04:00
bunnei	650d9b1044	Merge pull request #2409 from ReinUsesLisp/half-floats shader_ir/decode: Miscellaneous fixes to half-float decompilation	2019-04-19 21:31:52 -04:00
Fernando Sahmkow	a3eb91ed8c	RasterizerCache Redesign: Flush flushing is now responsability of children caches instead of the cache object. This change will allow the specific cache to pass extra parameters on flushing and will allow more flexibility.	2019-04-19 20:44:56 -04:00
ReinUsesLisp	fbe8d1ceaa	video_core: Silent -Wswitch warnings	2019-04-18 15:54:39 -03:00
bunnei	4294062516	Merge pull request #2318 from ReinUsesLisp/sampler-cache gl_sampler_cache: Port sampler cache to OpenGL	2019-04-17 21:45:56 -04:00
ReinUsesLisp	ef8245bed2	vk_shader_decompiler: Add missing operations	2019-04-15 21:32:57 -03:00
ReinUsesLisp	f43995ec53	shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall.	2019-04-15 21:16:10 -03:00
ReinUsesLisp	64613db605	shader_ir/decode: Implement half float saturation	2019-04-15 21:16:10 -03:00
ReinUsesLisp	5c280e6ff0	shader_ir: Implement STG, keep track of global memory usage and flush	2019-04-14 00:25:32 -03:00
ReinUsesLisp	75d23a3679	vk_shader_decompiler: Implement flow primitives	2019-04-10 14:20:25 -03:00
ReinUsesLisp	58ad8dfac6	vk_shader_decompiler: Implement most common texture primitives	2019-04-10 14:20:25 -03:00
ReinUsesLisp	4667ed8e22	vk_shader_decompiler: Implement texture decompilation helper functions	2019-04-10 14:20:25 -03:00
ReinUsesLisp	676172e20d	vk_shader_decompiler: Implement Assign and LogicalAssign	2019-04-10 14:20:25 -03:00
ReinUsesLisp	d316d248ab	vk_shader_decompiler: Implement non-OperationCode visits	2019-04-10 14:20:25 -03:00
ReinUsesLisp	b758c861b0	vk_shader_decompiler: Implement OperationCode decompilation interface	2019-04-10 14:20:25 -03:00
ReinUsesLisp	fec4eb9776	vk_shader_decompiler: Implement Visit	2019-04-10 14:20:25 -03:00
ReinUsesLisp	ca51f99840	vk_shader_decompiler: Implement labels tree and flow	2019-04-10 14:20:25 -03:00
ReinUsesLisp	13aa664f3f	vk_shader_decompiler: Implement declarations	2019-04-10 14:20:25 -03:00
ReinUsesLisp	ad53b233c5	vk_shader_decompiler: Declare and stub interface for a SPIR-V decompiler	2019-04-10 14:20:25 -03:00
Lioncash	26223f8124	video_core/engines: Remove unnecessary inclusions where applicable Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h	2019-04-05 18:26:32 -04:00
bunnei	7931a68d4e	Merge pull request #2302 from ReinUsesLisp/vk-swapchain vk_swapchain: Implement a swapchain manager	2019-04-03 11:50:05 -04:00
ReinUsesLisp	c5047540c9	video_core: Abstract vk_sampler_cache into a templated class	2019-04-02 15:54:11 -03:00
bunnei	1960164055	Merge pull request #2297 from lioncash/reorder video_core: Amend constructor initializer list order where applicable	2019-03-30 20:00:26 -04:00
ReinUsesLisp	746dab407e	vk_swapchain: Implement a swapchain manager	2019-03-29 00:00:51 -03:00
Lioncash	a5fa4b311e	video_core: Amend constructor initializer list order where applicable Specifies the members in the same order that initialization would take place in. This also silences -Wreorder warnings.	2019-03-27 12:37:53 -04:00
Lioncash	bbe700359d	video_core: Add missing override specifiers Ensures that the signatures will always match with the base class. Also silences a few compilation warnings.	2019-03-27 12:24:52 -04:00
bunnei	241563d15c	gpu: Move GPUVAddr definition to common_types.	2019-03-20 22:36:02 -04:00
bunnei	2eaf6c41a4	gpu: Use host address for caching instead of guest address.	2019-03-14 22:34:42 -04:00
Mat M	a3734d7e31	vk_sampler_cache: Use operator== instead of memcmp Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc>	2019-03-12 21:05:36 -03:00
ReinUsesLisp	aa59d77c3b	vk_sampler_cache: Implement a sampler cache	2019-03-12 20:20:57 -03:00
bunnei	1143923cdd	Merge pull request #2191 from ReinUsesLisp/maxwell-to-vk maxwell_to_vk: Initial implementation	2019-03-08 11:51:08 -05:00
Lioncash	f9ee0dc7ee	video_core/engines: Remove unnecessary includes Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those.	2019-03-05 20:35:32 -05:00
ReinUsesLisp	1f6571b3de	maxwell_to_vk: Initial implementation	2019-03-04 04:06:05 -03:00
ReinUsesLisp	8e84e81e74	vk_buffer_cache: Fix clang-format	2019-03-02 02:16:45 -03:00
ReinUsesLisp	35c105a108	vk_buffer_cache: Implement a buffer cache This buffer cache is just like OpenGL's buffer cache with some minor style changes. It uses VKStreamBuffer.	2019-03-01 17:33:36 -03:00
bunnei	1b13859af8	Merge pull request #2152 from ReinUsesLisp/vk-stream-buffer vk_stream_buffer: Implement a stream buffer	2019-02-27 21:19:15 -05:00
Lioncash	16ea93c11e	vk_memory_manager: Reorder constructor initializer list in terms of member declaration order Reorders members in the order that they would actually be initialized in. Silences a -Wreorder warning.	2019-02-27 11:08:19 -05:00
ReinUsesLisp	730eb1dad7	vk_stream_buffer: Remove copy code path	2019-02-26 02:09:43 -03:00
ReinUsesLisp	33a0597603	vk_stream_buffer: Implement a stream buffer This manages two kinds of streaming buffers: one for unified memory models and one for dedicated GPUs. The first one skips the copy from the staging buffer to the real buffer, since it creates an unified buffer. This implementation waits for all fences to finish their operation before "invalidating". This is suboptimal since it should allocate another buffer or start searching from the beginning. There is room for improvement here. This could also handle AMD's "pinned" memory (a heap with 256 MiB) that seems to be designed for buffer streaming.	2019-02-24 04:27:51 -03:00
ReinUsesLisp	281a8bf259	vk_resource_manager: Minor VKFenceWatch changes	2019-02-24 04:19:04 -03:00
bunnei	f7090bacc5	Merge pull request #2146 from ReinUsesLisp/vulkan-scheduler vk_scheduler: Implement a scheduler	2019-02-23 23:32:43 -05:00
ReinUsesLisp	92050c4d86	vk_memory_manager: Fixup commit interval allocation VKMemoryCommitImpl was using as the end of its interval "begin + end". That ended up wasting memory.	2019-02-24 01:04:41 -03:00
ReinUsesLisp	f546fb35ed	vk_scheduler: Implement a scheduler The scheduler abstracts command buffer and fence management with an interface that's able to do OpenGL-like operations on Vulkan command buffers. It returns by value a command buffer and fence that have to be used for subsequent operations until Flush or Finish is executed, after that the current execution context (the pair of command buffers and fences) gets invalidated a new one must be fetched. Thankfully validation layers will quickly detect if this is skipped throwing an error due to modifications to a sent command buffer.	2019-02-22 01:33:32 -03:00
ReinUsesLisp	b675c97cdd	vk_memory_manager: Implement memory manager A memory manager object handles the memory allocations for a device. It allocates chunks of Vulkan memory objects and then suballocates.	2019-02-19 03:42:28 -03:00
ReinUsesLisp	ae6c052ed9	vk_resource_manager: Implement a command buffer pool with VKFencedPool	2019-02-14 18:44:26 -03:00
ReinUsesLisp	a2b6de7e9f	vk_resource_manager: Add VKFencedPool interface Handles a pool of resources protected by fences. Manages resource overflow allocating more resources. This class is intended to be used through inheritance.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	0ffdd0a683	vk_resource_manager: Implement VKResourceManager and fence allocator CommitFence iterates a pool of fences until one is found. If all fences are being used at the same time, allocate more.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	aa0b6babda	vk_resource_manager: Implement VKFenceWatch A fence watch is used to keep track of the usage of a fence and protect a resource or set of resources without having to inherit from their handlers.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	25c2fe1c6b	vk_resource_manager: Implement VKFence Fences take ownership of objects, protecting them from GPU-side or driver-side concurrent access. They must be commited from the resource manager. Their usage flow is: commit the fence from the resource manager, protect resources with it and use them, send the fence to an execution queue and Wait for it if needed and then call Release. Used resources will automatically be signaled when they are free to be reused.	2019-02-14 18:44:26 -03:00
ReinUsesLisp	33a4cebc22	vk_resource_manager: Add VKResource interface VKResource is an interface that gets signaled by a fence when it is free to be reused.	2019-02-14 18:36:15 -03:00
ReinUsesLisp	8beca060d1	vk_device: Abstract device handling into a class VKDevice contains all the data required to manage and initialize a physical device. Its intention is to be passed across Vulkan objects to query device-specific data (for example the logical device and the dispatch loader).	2019-02-12 21:43:02 -03:00
ReinUsesLisp	18fe910957	renderer_vulkan: Add declarations file This file is intended to be included instead of vulkan/vulkan.hpp. It includes declarations of unique handlers using a dynamic dispatcher instead of a static one (which would require linking to a Vulkan library).	2019-02-12 18:33:02 -03:00

... 2 3 4 5 6

281 commits