Linuxydable/suyu

Author	SHA1	Message	Date
ReinUsesLisp	6481d91e4a	gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading After marking buffers as resident, Nvidia's driver seems to take a slow path. To workaround this issue, copy to a STREAM_READ buffer and then call GetNamedBufferSubData on it. This is a temporary solution until we have asynchronous flushing.	2020-06-26 16:58:40 -03:00
ReinUsesLisp	32a2dcd415	buffer_cache: Use buffer methods instead of cache virtual methods	2020-06-24 02:36:14 -03:00
ReinUsesLisp	32485917ba	gl_buffer_cache: Mark buffers as resident Make stream buffer and cached buffers as resident and query their address. This allows us to use GPU addresses for several proprietary Nvidia extensions.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	6508cdd003	buffer_cache: Avoid passing references of shared pointers and misc style changes Instead of using as template argument a shared pointer, use the underlying type and manage shared pointers explicitly. This can make removing shared pointers from the cache more easy. While we are at it, make some misc style changes and general improvements (like insert_or_assign instead of operator[] + operator=).	2020-06-09 18:30:49 -03:00
ReinUsesLisp	891236124c	buffer_cache: Use boost::intrusive::set for caching Instead of using boost::icl::interval_map for caching, use boost::intrusive::set. interval_map is intended as a container where the keys can overlap with one another; we don't need this for caching buffers and a std::set-like data structure that allows us to search with lower_bound is enough.	2020-05-21 16:44:00 -03:00
ReinUsesLisp	fe931ac976	{maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers Drop MemoryBarrier from the buffer cache and use Maxwell3D's register WaitForIdle. To implement this on OpenGL we just call glMemoryBarrier with the necessary bits. Vulkan lacks this synchronization primitive, so we set an event and immediately wait for it. This is not a pretty solution, but it's what Vulkan can do without submitting the current command buffer to the queue (which ends up being more expensive on the CPU).	2020-04-28 02:18:12 -03:00
Fernando Sahmkow	131b342130	OpenGL: Guarantee writes to Buffers.	2020-04-22 11:36:18 -04:00
ReinUsesLisp	090fd3fefa	buffer_cache: Return handles instead of pointer to handles The original idea of returning pointers is that handles can be moved. The problem is that the implementation didn't take that in mind and made everything harder to work with. This commit drops pointer to handles and returns the handles themselves. While it is still true that handles can be invalidated, this way we get an old handle instead of a dangling pointer. This problem can be solved in the future with sparse buffers.	2020-04-16 02:33:34 -03:00
Fernando Sahmkow	7fcd0fee6d	Buffer Cache: Use vAddr instead of physical memory.	2020-04-06 09:23:06 -04:00
ReinUsesLisp	76ca2a5f82	gl_rasterizer: Upload constant buffers with glNamedBufferSubData Nvidia's OpenGL driver maps gl(Named)BufferSubData with some requirements to a fast. This path has an extra memcpy but updates the buffer without orphaning or waiting for previous calls. It can be seen as a better model for "push constants" that can upload a whole UBO instead of 256 bytes. This path has some requirements established here: http://on-demand.gputechconf.com/gtc/2014/presentations/S4379-opengl-44-scene-rendering-techniques.pdf#page=24 Instead of using the stream buffer, this commits moves constant buffers uploads to calls of glNamedBufferSubData and from my testing it brings a performance improvement. This is disabled when the vendor is not Nvidia since it brings performance regressions.	2019-11-02 05:05:34 -03:00
ReinUsesLisp	878adee0a3	gl_buffer_cache: Add missing include RasterizerInterface was considered an incomplete object by clang.	2019-08-29 22:02:52 +00:00
Fernando Sahmkow	6ce2c85047	Buffer_Cache: Implement flushing.	2019-08-21 12:14:26 -04:00
Fernando Sahmkow	862bec001b	Video_Core: Implement a new Buffer Cache	2019-08-21 12:14:22 -04:00
ReinUsesLisp	1fa21fa192	gl_buffer_cache: Implement with generic buffer cache	2019-07-06 00:37:55 -03:00
ReinUsesLisp	2bcae41a73	gl_buffer_cache: Remove global system getters	2019-07-06 00:37:55 -03:00
ReinUsesLisp	d14fbfb9b5	gl_buffer_cache: Implement flushing	2019-07-06 00:37:55 -03:00
ReinUsesLisp	345f852bdb	gl_rasterizer: Drop gl_global_cache in favor of gl_buffer_cache	2019-07-06 00:37:55 -03:00
ReinUsesLisp	8155b12d3d	gl_buffer_cache: Rework to support internalized buffers	2019-07-06 00:37:55 -03:00
ReinUsesLisp	f8ba72d491	gl_buffer_cache: Store in CachedBufferEntry the used buffer handle	2019-07-06 00:37:55 -03:00
ReinUsesLisp	b54fb8fc4c	gl_buffer_cache: Return used buffer from Upload function	2019-07-06 00:37:55 -03:00
Fernando Sahmkow	4705d1b523	rasterizer_cache: Protect inherited caches from submission level	2019-07-01 04:32:01 -04:00
ReinUsesLisp	6ac4490751	gl_buffer_cache: Remove unused ReserveMemory method	2019-05-30 13:21:01 -03:00
Lioncash	fbf452ab0e	video_core/texures/texture: Remove unnecessary includes Nothing in this header relies on common_funcs or the memory manager. This gets rid of reliance on indirect inclusions in the OpenGL caches.	2019-04-06 00:03:35 -04:00
Lioncash	3fd5998d84	video_core/renderer_opengl: Remove unnecessary includes Quite a few unused includes have built up over time, particularly on core/memory.h. Removing these includes means the source files including those files will no longer need to be rebuilt if they're changed, making compilation slightly faster in this scenario.	2019-04-04 12:00:46 -04:00
Lioncash	a5fa4b311e	video_core: Amend constructor initializer list order where applicable Specifies the members in the same order that initialization would take place in. This also silences -Wreorder warnings.	2019-03-27 12:37:53 -04:00
bunnei	241563d15c	gpu: Move GPUVAddr definition to common_types.	2019-03-20 22:36:02 -04:00
bunnei	574e89d924	video_core: Refactor to use MemoryManager interface for all memory access. # Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp	2019-03-16 00:38:48 -04:00
bunnei	2eaf6c41a4	gpu: Use host address for caching instead of guest address.	2019-03-14 22:34:42 -04:00
ReinUsesLisp	2bdbb90af7	video_core: Assert on invalid GPU to CPU address queries	2019-02-03 04:58:40 -03:00
ReinUsesLisp	5933b3ea96	gl_stream_buffer: Use DSA for buffer management	2019-01-06 16:49:24 -03:00
Markus Wick	97f5c4ffd3	gl_rasterizer: Skip VB upload if the state is clean.	2018-11-17 14:28:54 +01:00
Lioncash	9046f764bf	rasterizer_cache: Remove reliance on the System singleton Rather than have a transparent dependency, we can make it explicit in the interface. This also gets rid of the need to put the core include in a header.	2018-11-08 06:16:38 -05:00
Frederic L	7a5eda5914	global: Use std::optional instead of boost::optional (#1578 ) * get rid of boost::optional * Remove optional references * Use std::reference_wrapper for optional references * Fix clang format * Fix clang format part 2 * Adressed feedback * Fix clang format and MacOS build	2018-10-30 00:03:25 -04:00
ReinUsesLisp	3e2380327a	gl_rasterizer: Implement quads topology	2018-10-04 00:03:44 -03:00
fearlessTobi	63c2e32e20	Port #4182 from Citra: "Prefix all size_t with std::"	2018-09-15 15:21:06 +02:00
Lioncash	8d685a29bc	gl_buffer_cache: Make GetHandle() a const member function GetHandle() internally calls GetHandle() on the stream_buffer instance, which is a const member function, so this can be made const as well.	2018-09-06 15:07:15 -04:00
Lioncash	14230fe2af	gl_buffer_cache: Remove unnecessary includes	2018-09-06 15:05:52 -04:00
Markus Wick	50a806ea67	renderer_opengl: Implement a buffer cache. The idea of this cache is to avoid redundant uploads. So we are going to cache the uploaded buffers within the stream_buffer and just reuse the old pointers. The next step is to implement a VBO cache on GPU memory, but for now, I want to check the overhead of the cache management. Fetching the buffer over PCI-E should be quite fast.	2018-09-05 08:03:50 +02:00

38 commits