Linuxydable/suyu

Author	SHA1	Message	Date
ameerj	cd8427367e	gl_buffer_cache: Use unorm internal formats for snorm texture buffer views Fixes black textures in UE4 games	2021-07-22 21:51:35 -04:00
ReinUsesLisp	adb591a757	glasm: Use storage buffers instead of global memory when possible	2021-07-22 21:51:33 -04:00
ReinUsesLisp	d621e96d0d	shader: Initial OpenGL implementation	2021-07-22 21:51:30 -04:00
Fernando Sahmkow	b780d5b5c5	DMAEngine: Accelerate BufferClear	2021-07-13 03:49:47 +02:00
ReinUsesLisp	5ad62e7bfc	buffer_cache: Heuristically decide to skip cache on uniform buffers Some games benefit from skipping caches (Pokémon Sword), and others don't (Animal Crossing: New Horizons). Add an heuristic to decide this at runtime. The cache hit ratio has to be ~98% or better to not skip the cache. There are 16 frames of buffer.	2021-03-02 02:44:19 -03:00
ReinUsesLisp	0b631f22fc	renderer_opengl: Remove interop Remove unused interop code from the OpenGL backend.	2021-02-13 02:18:04 -03:00
ReinUsesLisp	3da87d3f12	gl_buffer_cache: Drop interop based parameter buffer workarounds Sacrify runtime performance to avoid generating kernel exceptions on Windows due to our abusive aliasing of interop buffer objects.	2021-02-13 02:17:24 -03:00
ReinUsesLisp	82c2601555	video_core: Reimplement the buffer cache Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.	2021-02-13 02:17:22 -03:00
ReinUsesLisp	9764c13d6d	video_core: Rewrite the texture cache The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues.	2020-12-30 03:38:50 -03:00
Lioncash	f95602f152	video_core: Resolve more variable shadowing scenarios pt.3 Cleans out the rest of the occurrences of variable shadowing and makes any further occurrences of shadowing compiler errors.	2020-12-05 16:02:23 -05:00
ReinUsesLisp	9e87193725	video_core: Remove all Core::System references in renderer Now that the GPU is initialized when video backends are initialized, it's no longer needed to query components once the game is running: it can be done when yuzu is booting. This allows us to pass components between constructors and in the process remove all Core::System references in the video backend.	2020-09-06 05:28:48 -03:00
ReinUsesLisp	a8a2526128	gl_arb_decompiler: Use NV_shader_buffer_{load,store} on assembly shaders NV_shader_buffer_{load,store} is a 2010 extension that allows GL applications to use what in Vulkan is known as physical pointers, this is basically C pointers. On GLASM these is exposed through the LOAD/STORE/ATOM instructions. Up until now, assembly shaders were using NV_shader_storage_buffer_object. These work fine, but have a (probably unintended) limitation that forces us to have the limit of a single stage for all shader stages. In contrast, with NV_shader_buffer_{load,store} we can pass GPU addresses to the shader through local parameters (GLASM equivalent uniform constants, or push constants on Vulkan). Local parameters have the advantage of being per stage, allowing us to generate code without worrying about binding overlaps.	2020-07-18 01:59:57 -03:00
ReinUsesLisp	6481d91e4a	gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading After marking buffers as resident, Nvidia's driver seems to take a slow path. To workaround this issue, copy to a STREAM_READ buffer and then call GetNamedBufferSubData on it. This is a temporary solution until we have asynchronous flushing.	2020-06-26 16:58:40 -03:00
ReinUsesLisp	32a2dcd415	buffer_cache: Use buffer methods instead of cache virtual methods	2020-06-24 02:36:14 -03:00
ReinUsesLisp	32485917ba	gl_buffer_cache: Mark buffers as resident Make stream buffer and cached buffers as resident and query their address. This allows us to use GPU addresses for several proprietary Nvidia extensions.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	6508cdd003	buffer_cache: Avoid passing references of shared pointers and misc style changes Instead of using as template argument a shared pointer, use the underlying type and manage shared pointers explicitly. This can make removing shared pointers from the cache more easy. While we are at it, make some misc style changes and general improvements (like insert_or_assign instead of operator[] + operator=).	2020-06-09 18:30:49 -03:00
ReinUsesLisp	891236124c	buffer_cache: Use boost::intrusive::set for caching Instead of using boost::icl::interval_map for caching, use boost::intrusive::set. interval_map is intended as a container where the keys can overlap with one another; we don't need this for caching buffers and a std::set-like data structure that allows us to search with lower_bound is enough.	2020-05-21 16:44:00 -03:00
ReinUsesLisp	fe931ac976	{maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers Drop MemoryBarrier from the buffer cache and use Maxwell3D's register WaitForIdle. To implement this on OpenGL we just call glMemoryBarrier with the necessary bits. Vulkan lacks this synchronization primitive, so we set an event and immediately wait for it. This is not a pretty solution, but it's what Vulkan can do without submitting the current command buffer to the queue (which ends up being more expensive on the CPU).	2020-04-28 02:18:12 -03:00
Fernando Sahmkow	131b342130	OpenGL: Guarantee writes to Buffers.	2020-04-22 11:36:18 -04:00
ReinUsesLisp	090fd3fefa	buffer_cache: Return handles instead of pointer to handles The original idea of returning pointers is that handles can be moved. The problem is that the implementation didn't take that in mind and made everything harder to work with. This commit drops pointer to handles and returns the handles themselves. While it is still true that handles can be invalidated, this way we get an old handle instead of a dangling pointer. This problem can be solved in the future with sparse buffers.	2020-04-16 02:33:34 -03:00
Fernando Sahmkow	7fcd0fee6d	Buffer Cache: Use vAddr instead of physical memory.	2020-04-06 09:23:06 -04:00
ReinUsesLisp	76ca2a5f82	gl_rasterizer: Upload constant buffers with glNamedBufferSubData Nvidia's OpenGL driver maps gl(Named)BufferSubData with some requirements to a fast. This path has an extra memcpy but updates the buffer without orphaning or waiting for previous calls. It can be seen as a better model for "push constants" that can upload a whole UBO instead of 256 bytes. This path has some requirements established here: http://on-demand.gputechconf.com/gtc/2014/presentations/S4379-opengl-44-scene-rendering-techniques.pdf#page=24 Instead of using the stream buffer, this commits moves constant buffers uploads to calls of glNamedBufferSubData and from my testing it brings a performance improvement. This is disabled when the vendor is not Nvidia since it brings performance regressions.	2019-11-02 05:05:34 -03:00
ReinUsesLisp	878adee0a3	gl_buffer_cache: Add missing include RasterizerInterface was considered an incomplete object by clang.	2019-08-29 22:02:52 +00:00
Fernando Sahmkow	6ce2c85047	Buffer_Cache: Implement flushing.	2019-08-21 12:14:26 -04:00
Fernando Sahmkow	862bec001b	Video_Core: Implement a new Buffer Cache	2019-08-21 12:14:22 -04:00
ReinUsesLisp	1fa21fa192	gl_buffer_cache: Implement with generic buffer cache	2019-07-06 00:37:55 -03:00
ReinUsesLisp	2bcae41a73	gl_buffer_cache: Remove global system getters	2019-07-06 00:37:55 -03:00
ReinUsesLisp	d14fbfb9b5	gl_buffer_cache: Implement flushing	2019-07-06 00:37:55 -03:00
ReinUsesLisp	345f852bdb	gl_rasterizer: Drop gl_global_cache in favor of gl_buffer_cache	2019-07-06 00:37:55 -03:00
ReinUsesLisp	8155b12d3d	gl_buffer_cache: Rework to support internalized buffers	2019-07-06 00:37:55 -03:00
ReinUsesLisp	f8ba72d491	gl_buffer_cache: Store in CachedBufferEntry the used buffer handle	2019-07-06 00:37:55 -03:00
ReinUsesLisp	b54fb8fc4c	gl_buffer_cache: Return used buffer from Upload function	2019-07-06 00:37:55 -03:00
Fernando Sahmkow	4705d1b523	rasterizer_cache: Protect inherited caches from submission level	2019-07-01 04:32:01 -04:00
ReinUsesLisp	6ac4490751	gl_buffer_cache: Remove unused ReserveMemory method	2019-05-30 13:21:01 -03:00
Lioncash	fbf452ab0e	video_core/texures/texture: Remove unnecessary includes Nothing in this header relies on common_funcs or the memory manager. This gets rid of reliance on indirect inclusions in the OpenGL caches.	2019-04-06 00:03:35 -04:00
Lioncash	3fd5998d84	video_core/renderer_opengl: Remove unnecessary includes Quite a few unused includes have built up over time, particularly on core/memory.h. Removing these includes means the source files including those files will no longer need to be rebuilt if they're changed, making compilation slightly faster in this scenario.	2019-04-04 12:00:46 -04:00
Lioncash	a5fa4b311e	video_core: Amend constructor initializer list order where applicable Specifies the members in the same order that initialization would take place in. This also silences -Wreorder warnings.	2019-03-27 12:37:53 -04:00
bunnei	241563d15c	gpu: Move GPUVAddr definition to common_types.	2019-03-20 22:36:02 -04:00
bunnei	574e89d924	video_core: Refactor to use MemoryManager interface for all memory access. # Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp	2019-03-16 00:38:48 -04:00
bunnei	2eaf6c41a4	gpu: Use host address for caching instead of guest address.	2019-03-14 22:34:42 -04:00
ReinUsesLisp	2bdbb90af7	video_core: Assert on invalid GPU to CPU address queries	2019-02-03 04:58:40 -03:00
ReinUsesLisp	5933b3ea96	gl_stream_buffer: Use DSA for buffer management	2019-01-06 16:49:24 -03:00
Markus Wick	97f5c4ffd3	gl_rasterizer: Skip VB upload if the state is clean.	2018-11-17 14:28:54 +01:00
Lioncash	9046f764bf	rasterizer_cache: Remove reliance on the System singleton Rather than have a transparent dependency, we can make it explicit in the interface. This also gets rid of the need to put the core include in a header.	2018-11-08 06:16:38 -05:00
Frederic L	7a5eda5914	global: Use std::optional instead of boost::optional (#1578 ) * get rid of boost::optional * Remove optional references * Use std::reference_wrapper for optional references * Fix clang format * Fix clang format part 2 * Adressed feedback * Fix clang format and MacOS build	2018-10-30 00:03:25 -04:00
ReinUsesLisp	3e2380327a	gl_rasterizer: Implement quads topology	2018-10-04 00:03:44 -03:00
fearlessTobi	63c2e32e20	Port #4182 from Citra: "Prefix all size_t with std::"	2018-09-15 15:21:06 +02:00
Lioncash	8d685a29bc	gl_buffer_cache: Make GetHandle() a const member function GetHandle() internally calls GetHandle() on the stream_buffer instance, which is a const member function, so this can be made const as well.	2018-09-06 15:07:15 -04:00
Lioncash	14230fe2af	gl_buffer_cache: Remove unnecessary includes	2018-09-06 15:05:52 -04:00
Markus Wick	50a806ea67	renderer_opengl: Implement a buffer cache. The idea of this cache is to avoid redundant uploads. So we are going to cache the uploaded buffers within the stream_buffer and just reuse the old pointers. The next step is to implement a VBO cache on GPU memory, but for now, I want to check the overhead of the cache management. Fetching the buffer over PCI-E should be quite fast.	2018-09-05 08:03:50 +02:00

50 commits