<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>VR World &#187; x1k</title>
	<atom:link href="http://www.vrworld.com/tag/x1k/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.vrworld.com</link>
	<description></description>
	<lastBuildDate>Fri, 10 Apr 2015 07:54:22 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=4.1.1</generator>
	<item>
		<title>AMD&#8217;s Folding performance explained, future development revealed</title>
		<link>http://www.vrworld.com/2008/11/04/amd-folding-explained-future-reveale/</link>
		<comments>http://www.vrworld.com/2008/11/04/amd-folding-explained-future-reveale/#comments</comments>
		<pubDate>Tue, 04 Nov 2008 13:00:19 +0000</pubDate>
		<dc:creator><![CDATA[Theo Valich]]></dc:creator>
				<category><![CDATA[3D]]></category>
		<category><![CDATA[AMD]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Graphics]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[3800]]></category>
		<category><![CDATA[4800]]></category>
		<category><![CDATA[ATI]]></category>
		<category><![CDATA[Folding]]></category>
		<category><![CDATA[GeForce]]></category>
		<category><![CDATA[GPGPU]]></category>
		<category><![CDATA[GTX280]]></category>
		<category><![CDATA[Nvidia]]></category>
		<category><![CDATA[Radeon]]></category>
		<category><![CDATA[x1900]]></category>
		<category><![CDATA[x1k]]></category>

		<guid isPermaLink="false">http://theovalich.wordpress.com/?p=304</guid>
		<description><![CDATA[<p>Following the article about Top graphics cards for Folding@Home, it seems that I managed to get some doors opened and receive answers  from the people ...</p>
<p>The post <a rel="nofollow" href="http://www.vrworld.com/2008/11/04/amd-folding-explained-future-reveale/">AMD&#8217;s Folding performance explained, future development revealed</a> appeared first on <a rel="nofollow" href="http://www.vrworld.com">VR World</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>Following the article about <a href="http://theovalich.wordpress.com/2008/10/24/why-nvidia-destroys-ati-in-folding-at-hom/" target="_blank">Top graphics cards for Folding@Home</a>, it seems that I managed to get some doors opened and receive answers  from the people closely involved with the project.<br />
I had that luck of being contacted by people who were or still are involved with the project, and thus their answers were quite interesting. Names will remain unrevealed, of course.;-) In order to keep the clarity of the article, I’ve dumbed down some items that came up in discussions  &#8211; I will try to keep it both technical and simple. Impossible task, I know.<br />
Onto the matter then &#8211; the reason for ATI’s problems lies in the fact that ATI had a client for several hardware generations. Going back to the beginning, Dr. Vijay Pande (head of the F@H project) and Mike Houston (GPGPU pioneer, now emplyoee of AMD) demonstrated Folding@Home client around two years ago, using ATi Radeon X1900 as a base for demonstration.<br />
<strong>The Problem<br />
</strong>And here lies the problem with current GPU client &#8211; ATI X1K hardware comes off with one big flaw &#8211; lack of local memory share between the shader units. As you probably know, Nvidia designed G80 and following GPUs with shaders in groups of 8 units, featuring cache in-between them. According to our sources, that cache issue that stop ATI from achieving greatness, because we heard claims that their VLIW shader arrangement works in &#8220;best in class&#8221; mode.</p>
<div id="attachment_305" style="width: 310px" class="wp-caption alignright"><a href="http://cdn.vrworld.com/wp-content/uploads/2008/11/gtx280-shaders.jpg" rel="lightbox-0"><img class="size-full wp-image-305" title="gtx280-shaders" src="http://cdn.vrworld.com/wp-content/uploads/2008/11/gtx280-shaders.jpg" alt="The reason for GeForce dominance lies in the purple bar - scratch cache" width="300" height="314" /></a><p class="wp-caption-text">The reason for GeForce dominance lies in the purple bar - scratch cache</p></div>
<p>Then again, problem in gaming with X1K and later R600 and RV670 was the relative lack of texture units (TMUs), and problem with GPGPU continued to be &#8211; local share. You now might be wondering what will happen if you don&#8217;t put that &#8220;scratch cache&#8221; in the GPU. What happens is that your CPU will be constantly polled, and this drags the performance down to the gutter.<br />
We heard a lot of technical details about that particular issue, and the difference in scaling between dual-core CPU and a quad-core one. All in all, quite interesting stuff. But there is one large point to be made: the reason why Nvidia is so successful with CUDA is the fact that Nvidia offered what companies needed (scratch cache, CUDA, math libraries), while ATI suffers from selecting Brook+ to be their bread and butter until OpenCL comes along.<br />
<strong>RV770 saves the day…or not?</strong><br />
The RV770 GPU, more known as Radeon 4800 series is a vast improvement over previous generations. GPGPU-wise, most important thing is introduction of local share, since every 10 shaders got their &#8220;slice of the pie&#8221;. But GPGPU is more complex field that just &#8220;here is the feature, we can all use it now&#8221;.<br />
Our sources repeatedly criticized Brook+, claiming it is not in sync with AMD&#8217;s own CTI and Stream SDK&#8217;s. Brook+ allegedly breaks &#8220;with new drivers, with old drivers&#8221;, &#8220;whatever can go wrong, it can&#8221; and so on.<br />
ATI&#8217;s hardware now has local share, but that support has to be hard-coded into Brooke+. AMD recently released 1.21 Beta Stream SDK featuring local share, but that same support has to come inside Brooke+ as well.<br />
<strong>The Solution: Q1&#8217;09</strong><br />
So, we have shown you the problem, and now the time is for the solution. ATI can&#8217;t fix the performance issue on previous-gen hardware, but it will solve multitude of issues on Radeon 4800 boards. The team at Stanford is taking some necessary steps to re-do the workflow and introduce local memory share. This could take months, so realistic goal is to have a new client coming in Q1&#8217;09.<br />
Once that Radeon 4870 gets fully utilized, those 800 shaders and 70% of theoretical value (700-800 GFLOPS instead of 1-1.2 TFLOPS) should be good enough for reaching the level of GTX280.</p>
<p>Next story update will bring some views and opinions from AMD folk.</p>
<p>The post <a rel="nofollow" href="http://www.vrworld.com/2008/11/04/amd-folding-explained-future-reveale/">AMD&#8217;s Folding performance explained, future development revealed</a> appeared first on <a rel="nofollow" href="http://www.vrworld.com">VR World</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vrworld.com/2008/11/04/amd-folding-explained-future-reveale/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Content Delivery Network via Amazon Web Services: CloudFront: cdn.vrworld.com

 Served from: www.vrworld.com @ 2015-04-10 16:36:09 by W3 Total Cache -->