<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Christian's corner on HPC</title>
	<atom:link href="http://terboven.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://terboven.wordpress.com</link>
	<description>A Blog on Parallel Programming - covering all OSes :-) - by Christian Terboven.</description>
	<lastBuildDate>Thu, 29 Dec 2011 20:40:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='terboven.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Christian's corner on HPC</title>
		<link>http://terboven.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://terboven.wordpress.com/osd.xml" title="Christian&#039;s corner on HPC" />
	<atom:link rel='hub' href='http://terboven.wordpress.com/?pushpress=hub'/>
		<item>
		<title>On the future of HPC on Windows</title>
		<link>http://terboven.wordpress.com/2011/12/29/on-the-future-of-hpc-on-windows/</link>
		<comments>http://terboven.wordpress.com/2011/12/29/on-the-future-of-hpc-on-windows/#comments</comments>
		<pubDate>Thu, 29 Dec 2011 20:40:25 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[Future of HPC]]></category>
		<category><![CDATA[Windows-HPC]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[PLINQ]]></category>
		<category><![CDATA[SC11]]></category>
		<category><![CDATA[Supercomputing]]></category>
		<category><![CDATA[Windows HPC Server]]></category>
		<category><![CDATA[Windows-HPC UG]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10268</guid>
		<description><![CDATA[Just a few weeks ago during SC11 Microsoft released two new or updated HPC products, namely Windows Azure HPC Scheduler and Windows HPC Server 2008 R2 SP3. However, what I saw and heard during the last few months as well as during SC11 did not give me the best feeling for the future of Microsoft&#8217;s [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10268&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Just a few weeks ago during <a href="http://sc11.supercomputing.org" target="_blank">SC11</a> Microsoft released two new or updated HPC products, namely <em><a href="http://go.microsoft.com/fwlink/?LinkID=230449&amp;clcid=0x409" target="_blank">Windows Azure HPC Scheduler</a></em> and <a href="http://go.microsoft.com/fwlink/?LinkID=231891" target="_blank"><em>Windows HPC Server 2008 R2 SP3</em></a>. However, what I saw and heard during the last few months as well as during SC11 did not give me the best feeling for the future of Microsoft&#8217;s HPC Server product. This post is on my impressions and thoughts not only on the product, but also on doing HPC on the Windows platform in general.</p>
<p>What disturbed me a little was the absence of any roadmap presentation. Well, over the last few years Windows HPC Server clearly has become mature enough to not lack any significant feature necessary for deployment and use on a medium-sized HPC installation. However, Microsoft publically outlining a product roadmap with several key features always felt right, and it&#8217;s absence at SC11 has been noted by the community. Furthermore, they quietly killed their Dryad project (including LINQ to HPC), which was prominently displayed at SC10, now betting  on a yet-to-be-released distribution of <a href="http://hadoop.apache.org/" target="_blank">Apache Hadoop</a> for Windows HPC Server and Azure. Finally, there have been several business restructuring activities inside Microsoft. For example, here in Germany Microsoft apparently shut down the HPC group and moved (some of) the people under the hood of Azure. From what I heard, all these activities caused some confusion in the community on how Microsoft sees the future of the Windows HPC Server product and how much support and innovations may be expected from the company on this regard.</p>
<p>What Microsoft now talks a lot about is the Azure integration. If you followed the development of Windows HPC Server up to release R2 SP3, you could clearly see this coming. From a technology point of view, I am impressed. However, I am not convinced yet, for several reasons &#8211; the most important one being the offer much too expensive for our application needs. Of course we are following what is going on regarding Clouds and HPC, and in fact in one project we are extending one application to make use of both on-premise and off-premis compute power based on availability (and maybe even price). But for the time being, our local clusters, including the one running Windows, will clearly dominate (or, as we Germans say, set the tone).</p>
<p>Finally, I am missing a clear picture of HPC-related improvements in the Windows Server roadmap. Just recently we added a frontend system with 160 (logical) cores, this is 8 sockets, 512 GB of memory. Windows just works on such a machine &#8211; but it could do better. It could serve HPC applications better. And given that next-gen ordinary (HPC) systems probably have a similar core count, Windows really has to serve applications better on such machines in order to stay competitive. Furthermore, smooth and stable integration of accelerators &#8211; be it GPGPUs, or something different but similar in spirit &#8211; will be as important at least.</p>
<div id="attachment_10275" class="wp-caption aligncenter" style="width: 310px"><a href="http://terboven.files.wordpress.com/2011/12/windows_8_socket_many_cores.jpg"><img class="size-medium wp-image-10275" title="Windows Task-Manager with 160 cores (8 sockets)" src="http://terboven.files.wordpress.com/2011/12/windows_8_socket_many_cores.jpg?w=300&#038;h=277" alt="Windows Task-Manager with 160 cores (8 sockets)" width="300" height="277" /></a><p class="wp-caption-text">Windows Task-Manager with 160 cores (8 sockets)</p></div>
<p>I will stop here. Our user base is clearly showing a demand for Windows HPC Server-based clusters, and in fact the demand is growing. Trying to combine my personal opinion with the feedback and opinions I got from the (German) community, Microsoft has to improve the communication regarding Windows HPC Server. It is time for a clear statement regarding the future of the product and the directions it will be going to.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10268/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10268/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10268/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10268&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/12/29/on-the-future-of-hpc-on-windows/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>

		<media:content url="http://terboven.files.wordpress.com/2011/12/windows_8_socket_many_cores.jpg?w=300" medium="image">
			<media:title type="html">Windows Task-Manager with 160 cores (8 sockets)</media:title>
		</media:content>
	</item>
		<item>
		<title>OpenMP and OpenACC</title>
		<link>http://terboven.wordpress.com/2011/11/16/openmp-and-openacc/</link>
		<comments>http://terboven.wordpress.com/2011/11/16/openmp-and-openacc/#comments</comments>
		<pubDate>Wed, 16 Nov 2011 21:43:56 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[Future of HPC]]></category>
		<category><![CDATA[OpenMP]]></category>
		<category><![CDATA[Accelerator]]></category>
		<category><![CDATA[OpenACC]]></category>
		<category><![CDATA[SC11]]></category>
		<category><![CDATA[Supercomputing]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10263</guid>
		<description><![CDATA[If you attended SC11, you might have noticed some buzz around OpenACC. Well, at least I did. For example, today&#8217;s OpenMP BOF had some information on this. I want to use this blog post to add some general comments and insights on the developments and direction of the OpenMP language committee as well as what [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10263&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>If you attended SC11, you might have noticed some buzz around <a href="http://www.openacc-standard.org/" target="_blank">OpenACC</a>. Well, at least I did. For example, today&#8217;s OpenMP BOF had some information on this. I want to use this blog post to add some general comments and insights on the developments and direction of the OpenMP language committee as well as what has lead to OpenACC. As always you have to understand that these statements are mine only, on this blog I do not speak in any official role.</p>
<p>Since quite a while now, OpenMP is moving into the accelerator space, with the work done by the <em>OpenMP for Accelerators</em> subcommittee of the OpenMP Language Committee. That subcommittee publically presented the status of their work at the last IWOMP, where James Beyer et al had a paper on that particular topic (<a href="http://www.ncsa.illinois.edu/Conferences/IWOMP11/program/presentations/beyer.pdf" target="_blank">PDF of their presentation</a>). They invested a lot of effort and made good progress since then. In order to make support for accelerators happen in OpenMP, they have to achieve three goals: (i) provide support for Slicing and Shaping expressions, (ii) provide support for data management constructs and clauses, and finally (iii) provide support to denote kernels and constructs for execution on the accelerator. For all three items the subcommittee looked at existing other proposals, particularly from <a href="http://www.pgroup.com/resources/accel.htm" target="_blank">PGI</a>, <a href="http://pm.bsc.es/" target="_blank">BSC</a> and <a href="http://www.caps-entreprise.com/hmpp.html" target="_blank">CAPS</a>, but also from others. There are good proposals underway for (i) and (ii) which probably are backed by a majority in the language committee, since this functionality may turn out to be very handy to drive other features and proposals as well. Just as an example we are aiming for improved support for Affinity of threads and data, which requires Slicing and Shaping of array expressions.</p>
<p>However, support for (iii) is really tough, if one wants to integrate well with the rest of OpenMP and allow for future extensions. An important design goal is that OpenMP will support not just one particular type of accelerator, but rather be widely applicable to different kinds of devices from different vendors. These are the reasons for OpenMP developing with the slow speed it is. We are planning for a public draft of OpenMP 4.0 for SC12, one year from now.</p>
<p>In order to allow for faster development and ignoring the OpenMP integration just for a moment, the OpenACC standard initiative was formed and basically is a spin-off of the OpenMP Language Committee. Personally, I see this as a <em>beta</em> of OpenMP for Accelerators, and I hope that this initiative will help to collect valuable feedback on how pragma-based accelerator programming has to look like. Cray, PGI and CAPS all have announced to implement the specification as it is currently. When it comes to getting the resources for that, it is much easier to implement this spin-off spec, instead of implementing an incompleted proposal draft. This is what I like the OpenACC effort for. Any by the way, it was prominently promoted during the NVIDIA keynote at SC11 on Tuesday morning.</p>
<p>However, what I do not like is, how it was marketed. People did not get the relation to OpenMP. They way it was published it was not clear that effort from other parties was involved in the development as well, not just the ones mentioned on the website. In fact, many people who visited the booth thought that OpenACC is about to become a competitor for OpenMP in the accelerator domain. This is not true, it is clearly the intend to feed back the OpenACC development into the next OpenMP specification. While clearly hope for the SC12 time frame to release a draft, but until then we have several technical problems to solve.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10263/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10263/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10263/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10263/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10263/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10263/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10263/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10263/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10263/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10263/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10263/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10263/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10263/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10263/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10263&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/11/16/openmp-and-openacc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>Dan Reed on Technical (Cloud) Computing with Microsoft: Vision</title>
		<link>http://terboven.wordpress.com/2011/07/04/dan-reed-on-technical-cloud-computing-the-vision/</link>
		<comments>http://terboven.wordpress.com/2011/07/04/dan-reed-on-technical-cloud-computing-the-vision/#comments</comments>
		<pubDate>Mon, 04 Jul 2011 20:16:17 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Future of HPC]]></category>
		<category><![CDATA[Windows-HPC]]></category>
		<category><![CDATA[.NET]]></category>
		<category><![CDATA[Azure]]></category>
		<category><![CDATA[Exascale]]></category>
		<category><![CDATA[ISC2011]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[MPI]]></category>
		<category><![CDATA[Supercomputing]]></category>
		<category><![CDATA[Teaching]]></category>
		<category><![CDATA[Windows HPC Server]]></category>
		<category><![CDATA[Windows-HPC UG]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10253</guid>
		<description><![CDATA[During ISC 2011 in Hamburg I got the opportunity to talk to Microsoft&#8217;s Dan Reed, Corporate Vice President, Technology Policy and Extreme Computing Group. It was a very nice discussion soon targeting towards HPC in the Cloud, touching the topics of Microsoft&#8217;s Vision, Standards, and Education. Karsten Reineck from the Fraunhofer SCAI was also present, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10253&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>During ISC 2011 in Hamburg I got the opportunity to talk to Microsoft&#8217;s Dan Reed, Corporate Vice President, Technology Policy and Extreme Computing Group. It was a very nice discussion soon targeting towards HPC in the Cloud, touching the topics of <em>Microsoft&#8217;s Vision</em>, <em>Standards</em>, and <em>Education</em>. Karsten Reineck from the Fraunhofer SCAI was also present, he already put an excerpt of the interview on his <a href="http://reineck.wordpress.com/2011/06/22/interview-mit-dan-reed-vice-president-microsoft-auf-der-isc%E2%80%9911-in-hamburg/" target="_blank">blog</a> (in German). The following is my recapitulation of the discussion pointing out his most important statements &#8211; part 1 of 2.</p>
<p>Being the person I am, I started the talk with a nasty question on the pricing scheme of Azure (and similar commercial offerings), claiming that it is pretty expensive both per CPU hour as well as per byte of I/O. Just recently we did a full cost accounting to calculate our price per CPU hour for our HPC service, and we found us to be cheaper by a notable factor.</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: Academic sites, of reasonable size such as yours, can do HPC cheaper because they are utilizing the hardware on a 24&#215;7 basis. Traditionally, they do not offer service-level agreements on how fast any job starts, they just queue the jobs. Azure is different, and it has to be, one can get the resources available in a guaranteed time frame. As of today, HPC in the Cloud is interesting for burst scenarios where the on-promise resources are not sufficient, or for people for whom traditional HPC is too complex (regardless of Windows vs. Linux, just maintaining an on-premise cluster versus buying HPC time when it is needed).</p>
<p>I am completely in line with that. I expressed my belief that we will need (and have!) academic HPC centers for the foreseeable future. Basically, we are just a (local) HPC cloud service provider for our users &#8211; which of course we call customers, internally. To conclude this topic, he said something very interesting:</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: In industry, the cost is not the main constraint, the skill is.</p>
<p>Ok, since we are offering HPC services on Linux and Windows, and since there was quite some buzz around the future of the Windows HPC Server product during ISC, I asked where the Windows HPC Server product is heading to in the future.</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: The foremost goal is to better integrate and support cloud issues. For example, currently, there are two schedulers, the Azure scheduler and the traditional Windows HPC Server scheduler. Basically, that is one scheduler too much. Regarding improvements in Azure, we will see support for high-speed interconnects soon.</p>
<p>Azure support for MPI programs has just been introduced with Windows HPC Server 2008 R2 SP2 (a long product name, hm?). By the way, he assumes that future x GigaBit Ethernet will be favoured over InfiniBand.</p>
<p>For us it is clearly interesting to see where Azure, and other similar offerings, are heading to, and we can learn something from that for our own HPC service. For example, we already offer service-level agreements for some customers under some circumstances. However, on-premise resources will play the dominating role for academic HPC in the foreseeable future. Thus I am interested in the future of the product and asked specifically about the future of the Windows HPC Server.</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: Microsoft, as a company, is strongly committed to a service-based business model. This has to be understood in order to realize what is driving some of the shifts we are seeing right now, both in the products and the organization itself. The focus on Cloud Computing elevated the HPC Server team, the Technical Computing division is now part of the Azure organization. The emphasis of the future product development thus is clearly shifting towards cloud computing, that is true, although the product remains to be improved and features will be added for a few releases (already in planning).</p>
<p>Well, as a MVP for Windows HPC Server, and a member of the Customer Advisory Board, I know something about the planning of upcoming product release, so I believe Microsoft is still committed to the product (as opposed to some statements made by other people during ISC). However, I do not see the Windows Server itself moving in the right direction for HPC. Obviously HPC is just a niche market for Microsoft, but better support for multi- and many-core processors and hierarchical memory architectures (NUMA !) would be desirable. Asking (again) on that, I got the following answer:</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: Windows HPC Server is derived from Windows Server, which itself is derived from Windows. So, if you want to know where Windows HPC Server is going with regard to its base technologies, you have to see (and understand) where Windows itself is going.</p>
<p>Uhm, ok, so we better take a close look at Windows 8 <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . Regarding Microsoft&#8217; way towards Cloud Computing, I will write a second blog post later to cover more of our discussion on the topics of <em>Standards</em> and <em>Education</em>. This this blog post is on the <em>Vision</em>, I just want to share a brief discussion we had when heading back to the ISC show floor. I asked him on his personal (!) opinion on the race towards Exascale. Will we get an Exascale system by (the end of) 2019?</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: Given the political will and money, we will overcome the technical issues we are facing today.</p>
<p>Ok. Given that someone has that will and the money, would such a system be usable? Do you see any single application for such a system?</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: Big question mark. I would rather see money being invested in solving the software issues. If we get such powerful systems, we have to be able to make use of them for more than just a single project.</p>
<p>Again, I am pretty much in line with that. By no means I am claiming to fully understand all challenges and opportunities of Exascale systems, but what I do see are the challenges to make use of today&#8217;s Petaflop systems with applications other than LINPACK, especially from the domain of Computational Engineering. Taking the opportunity, my last question was: Who do you guess would have the political will and the money to build an Exascale system first, the US, or Europe, or rather Asia?</p>
<p><span style="text-decoration:underline;">Dan Reed</span>: Uhm. If I would have to bet, I would bet on Asia. And if such a system comes from Asia, all critical system components will be designed and manufactured in Asia.</p>
<p>Interesting. And clearly a challenge.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10253/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10253&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/07/04/dan-reed-on-technical-cloud-computing-the-vision/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>An Update on Building and Using BOOST.MPI on Windows HPC Server 2008 R2</title>
		<link>http://terboven.wordpress.com/2011/05/13/an-update-on-building-and-using-boost-mpi-on-windows-hpc-server-2008-r2/</link>
		<comments>http://terboven.wordpress.com/2011/05/13/an-update-on-building-and-using-boost-mpi-on-windows-hpc-server-2008-r2/#comments</comments>
		<pubDate>Fri, 13 May 2011 08:09:31 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[Windows-HPC]]></category>
		<category><![CDATA[BOOST.MPI]]></category>
		<category><![CDATA[MPI]]></category>
		<category><![CDATA[Visual Studio]]></category>
		<category><![CDATA[Windows HPC Server]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10229</guid>
		<description><![CDATA[My 2008 blog post on Building and Using BOOST.MPI on Windows HPC Server 2008 still generates quite some traffic. Since some things have changed since then, I thought it could help those visitors to provide an updated howto. Again, this post puts the focus on building boost.mpi with various versions of MS-MPI, and does not [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10229&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>My 2008 blog post on <a href="http://terboven.wordpress.com/2008/09/09/building-and-using-boostmpi-on-windows-hpc-server-2008/">Building and Using BOOST.MPI on Windows HPC Server 2008</a> still generates quite some traffic. Since some things have changed since then, I thought it could help those visitors to provide an updated howto. Again, this post puts the focus on building boost.mpi with various versions of MS-MPI, and does not cover all aspects of building boost on Windows (go to <a href="http://www.boost.org/doc/libs/1_46_1/more/getting_started/windows.html" target="_blank">Getting Started on Windows</a> for that).</p>
<p>The problem that still remains is, that the MPI auto-configuration only looks for MS-MPI v1, which came with the Compute Cluster Pack and was typically installed to the directory <em>C:\Program Files\Microsoft Compute Cluster Pack</em>. MS-MPI v2, that comes with the Microsoft HPC Pack 2008 [R2], is typically installed to the directory <em>C:\Program Files\Microsoft HPC Pack 2008 [R2] SDK</em>, but the auto-configuration does not examine these directories. In the old post I explained where to change the path the auto-configurator is looking at. Of course, this is not what one expects from an &#8220;auto&#8221;-configuration tool. Extending the <em>mpi.jam</em> file to search for all possible standard directories where MS-MPI might be installed in turned out to be pretty simple. You can download my <a href="http://www.terboven.com/public/mpi_jam.zip" target="_blank">modified mpi.jam for boost 1.46.1 supporting MS-MPI v1 and v2</a> and replace the <em>mpi.jam</em> file that comes with the boost package. As a summary, below are the basic steps to build boost with boost.mpi on Windows (HPC) Server 2008 using Visual Studio and MS-MPI.</p>
<ol>
<li>Download <a href="http://www.boost.org/users/download/" target="_blank">boost 1.46.1</a> (82 MB), which is the most current version by the time of this writing (May 13th, 2011).</li>
<li>Extract the archive. For the rest of the instructions I will assume <em>X:\src.boost_1_46_1</em> as the directory the archive has been extracted into.</li>
<li>Open a Visual Studio command prompt from the <em>Visual Studio Tools</em> submenu. Depending on what you intend to build, you have to use the 32-bit or 64-bit compiler environment. Execute all commands listed in the rest of the instructions from within this command prompt.</li>
<li>Run <em>bootstrap.bat</em>. This will build <em>bjam.exe</em>.</li>
<li>Modify the <em>mpi.jam</em> file located in the <em>tools\build\v2\tools</em> subdirectory to search for MS-MPI in the right place, or use my <a href="http://www.terboven.com/public/mpi_jam.zip" target="_blank">modified mpi.jam for boost 1.46.1 supporting MS-MPI v1 and v2</a> instead.</li>
<li>Edit the <em>user-config.jam</em> file located in the <em>tools\build\v2</em> subdirectory to contain the following line: <em>using mpi ;</em>.</li>
<li>Execute the following to command to start the build and installation process: <em>bjam.exe &#8211;build-dir=x:\src.boost_1_46_1\build\vs90-64 &#8211;prefix=x:\boost_1_46_1\vs90-64 install</em>. Please note that I use different directories in the <em>&#8211;build-dir</em> and <em>&#8211;prefix</em> options, since I intend to remove the <em>X:\src.boost_1_46_1</em> directory once boost is installed. Especially a debug build may use a significant amount of disc storage.</li>
<li>Wait&#8230;</li>
<li>There are several other options that you might want to explore, but in many cases the default does just fine. Using the command line from above, on Windows you will get static multi-threaded libraries in debug and release mode using shared runtime. On Windows, the default toolset is msvc, which is the Visual Studio compiler. You can change that via the <em>toolset=xxx</em> option, for example insert <em>toolset=intel</em> to the command line above just before <em>install</em> if you want to build using the Intel compilers.</li>
</ol>
<p>Since it is uncomfortable to change <em>mpi.jam</em> whenever you are going to build a new version of boost, I <a href="https://svn.boost.org/trac/boost/ticket/5531" target="_blank">filed a bug report on this</a> and proposed to extend the search path to include MS-MPI v2 locations as well.</p>
<p>In order to use this build of boost, in your projects you have to add <em>X:\boost_1_46_1\vs90-32\include\boost-1_46_1</em> to the list of include directories, and <em>X:\boost_1_46_1\vs90-32\lib</em> to the list of library directories (all acording to the directory scheme I used above). In your code you do <em>#include &lt;boost/mpi.hpp&gt;</em>. The boost header files contain directives to link the correct boost libraries automatically, but of course you have to linke with the MS-MPI library you used to build boost with.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10229/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10229/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10229/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10229/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10229/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10229/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10229/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10229/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10229/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10229/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10229/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10229/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10229/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10229/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10229&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/05/13/an-update-on-building-and-using-boost-mpi-on-windows-hpc-server-2008-r2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>HPC Server 2008 R2 Failover Cluster deployment guide</title>
		<link>http://terboven.wordpress.com/2011/05/12/hpc-server-2008-r2-failover-deployment-guide/</link>
		<comments>http://terboven.wordpress.com/2011/05/12/hpc-server-2008-r2-failover-deployment-guide/#comments</comments>
		<pubDate>Thu, 12 May 2011 09:40:06 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[Windows-HPC]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Windows HPC Server]]></category>
		<category><![CDATA[Windows-HPC UG]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10232</guid>
		<description><![CDATA[I just learned about a Deployment of HPC Server 2008 R2 failover cluster, including a detailed step-by-step guide on how to configure head node failover and remote database installation to employ a SQL server failover cluster. This document has been provided by Microsoft, enjoy.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10232&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I just learned about a <a href="http://terboven.files.wordpress.com/2011/05/hpc_failover_guide.pdf">Deployment of HPC Server 2008 R2 failover cluster</a>, including a detailed step-by-step guide on how to configure head node failover and remote database installation to employ a SQL server failover cluster. This document has been provided by Microsoft, enjoy.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10232/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10232&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/05/12/hpc-server-2008-r2-failover-deployment-guide/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>Recap of the 4th Meeting of the German Windows-HPC User Group</title>
		<link>http://terboven.wordpress.com/2011/04/19/recap-of-the-4th-meeting-of-the-german-windows-hpc-user-group/</link>
		<comments>http://terboven.wordpress.com/2011/04/19/recap-of-the-4th-meeting-of-the-german-windows-hpc-user-group/#comments</comments>
		<pubDate>Tue, 19 Apr 2011 07:59:18 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[Private]]></category>
		<category><![CDATA[Windows-HPC]]></category>
		<category><![CDATA[Windows HPC Server]]></category>
		<category><![CDATA[Windows-HPC UG]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10225</guid>
		<description><![CDATA[The 4th Meeting of the German Windows-HPC User Group took place on March 31st and April 1st in Karlsruhe, hosted by the Karlsruhe Institute for Technology (KIT). The event was attended by over 70 participants from Industry and Academia. This event has been sponsored by Bull, COMSOL, EMCL @ KIT, Intel, Microsoft and NVIDIA. After [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10225&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.rz.rwth-aachen.de/go/id/teb/?lang=de" target="_blank">4th Meeting of the German Windows-HPC User Group</a> took place on March 31st and April 1st in Karlsruhe, hosted by the <a href="http://www.kit.edu/" target="_blank">Karlsruhe Institute for Technology (KIT)</a>. The event was attended by over 70 participants from Industry and Academia. This event has been sponsored by <a href="http://www.bull.de/" target="_blank">Bull</a>, <a href="http://www.comsol.de/" target="_blank">COMSOL</a>, <a href="http://www.emcl.kit.edu/index.php" target="_blank">EMCL @ KIT</a>, <a href="http://www.intel.com/index.htm?de_DE_03" target="_blank">Intel</a>, <a href="http://www.microsoft.com/germany/hpc/" target="_blank">Microsoft</a> and <a href="http://www.nvidia.de/page/home.html" target="_blank">NVIDIA</a>.</p>
<p>After a brief welcome address by the organizators (Wolfgang Dreyer from Microsoft and myself), Rudolf Lohner (KIT) gave an overview of the Steinbuch Centre for Computing (SCC) at the KIT. He was followed by the keynote speak from Microsoft, given by Xavier Pillons (Microsoft Corporation) on <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvon" target="_blank">Windows HPC Server 2008 R2 and Azure</a> as well as <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvok" target="_blank">Dryad/DryadLinq</a>. We specifically asked for these two topics, and it turned out that Cloud Computing as well as Data-intensive Computing was the subject of many discussions during this event. After that, Axel Köhler (now NVIDIA) gave a glimpse into the <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvwh" target="_blank">current HPC developments at NVIDIA</a>, including how a pure accelerator-driven supercomputer might look like. He was followed by Dagmar Kremer (BCC), who presented their solution for <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvom" target="_blank">real-time super-computing on the desktop using Excel</a>. This topic was also on the agendy by popular demand, and apparently the combination of the two keywords &#8220;Excel&#8221; and &#8220;HPC&#8221; makes many people interested. The first day was closed by Achim Streit (KIT), who gave his vision on <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvqs" target="_blank">HPC and the Cloud</a>, outlining current projects around HPC as a Service (HPCaaS) for technical computing.</p>
<p>The evening event took place in the <a href="http://www.zetkaem.de/" target="_blank">ZetKaeM restaurant</a>, after touring the Media Museum, the world&#8217;s first and only museum for interactive art. We all experienced some funny exhibits <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . Such an evening event serves well the role of a user group &#8211; leading to discussions and thought exchange over a good glass of wine.</p>
<p>The second day started with a keynote address from Vincent Heuveline (KIT) on <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdwgw" target="_blank">HPC and hardware-aware computing at the EMCL @ KIT</a>. He was followed by Joachim Redmber (Bull), presenting the <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvoo" target="_blank">Bull way of Supercomputing</a>. Representing a Windows-HPC user, Shiqing Fan (HLRS) outlined their work on <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvwg" target="_blank">implementing and integrating OpenMPI with Windows HPC environments</a>, and apparently they can outperform Microsoft MPI in some benchmarks. Horst Schwichtenberg (Fraunhofer SCAI) gave an example of Excel HPC integration via WCF. As another user contribution, Stefan Truthähn and Martin Steinert (both hhpberlin Ingenieure für Brandschutz GmbH) gave a vivid talk on how they came to use Windows-HPC and HPC in general (more by accident than by master plan <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  ) and how they see the future of their <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvol" target="_blank">CFD computations on on-premise as well as Cloud HPC offerings</a>. They were followed by Michael Klemm (Intel), giving an overview of <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvoi" target="_blank">Intel Technology for HPC on Windows</a>. Henrik Nordborg (University of Applied Sciences in Rapperswil) from the Microsoft Technical Computing Innovation Center (MICTC) outlined where he sees an<a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvoj" target="_blank"> increasing demand for expertise in technical computing</a> (and why) as well as he gave a report on the first activities of the MICTC. The second and final day of the meeting was closed by a talk given by Michael Wirtz and myself on our experience and setting for <a href="http://www.rz.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaaacdvop" target="_blank">Windows-HPC for 1000+ users</a>.</p>
<p>All in all, I think this meeting was successful and so far we got positive feedback from the attendees. We plan to have the next meeting our March or April 2012 at an yet-to-be-decided-on location.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10225/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10225&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/04/19/recap-of-the-4th-meeting-of-the-german-windows-hpc-user-group/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>Upcoming Events in March 2011</title>
		<link>http://terboven.wordpress.com/2011/03/09/upcoming-events-in-march-2011/</link>
		<comments>http://terboven.wordpress.com/2011/03/09/upcoming-events-in-march-2011/#comments</comments>
		<pubDate>Wed, 09 Mar 2011 11:00:21 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[OpenMP]]></category>
		<category><![CDATA[Private]]></category>
		<category><![CDATA[University]]></category>
		<category><![CDATA[Windows-HPC]]></category>
		<category><![CDATA[MPI]]></category>
		<category><![CDATA[Teaching]]></category>
		<category><![CDATA[Windows-HPC UG]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10221</guid>
		<description><![CDATA[Let me point you to some HPC events in March 2011. 3rd Parallel Programming in Computational Engineering and Science (PPCES) Workshop. This event will continue the tradition of previous annual week-long events taking place in Aachen every spring since 2001, this year from March 21st to March 25th. This year, the agenda is &#8211; as [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10221&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Let me point you to some HPC events in March 2011.</p>
<p><strong>3rd Parallel Programming in Computational Engineering and Science (PPCES) Workshop.</strong> This event will continue the tradition of previous annual week-long events taking place in Aachen every spring since 2001, this year from March 21st to March 25th. This year, the agenda is &#8211; as always &#8211; a little different from the previous one. Beginning with a series of overview presentations on Monday afternoon, we  are very happy to announce the upcoming RWTH Compute Cluster to be delivered by Bull. Throughout the week, we will cover serial and parallel programming using  OpenMP and MPI in Fortran and C / C++ as well as performance tuning  addressing both, Linux and Windows platforms. Due to the positive  experience of last year, we are happy to present a renowned speaker to give  an introduction into GPGPU architectures and programming on Friday: Michael Wolfe from PGI. All further information can be found at the event website: <a href="http://www.rz.rwth-aachen.de/ppces" target="_blank">http://www.rz.rwth-aachen.de/ppces</a>.</p>
<p><strong>4th Meeting of the German Windows-HPC User Group.</strong> The<a href="http://www.rz.rwth-aachen.de/go/id/teb/?lang=de" target="_blank"> fourth meeting of the German Windows HPC User Group</a> will take place in Karlsruhe on March 31st and April 1st, kindly hosted by the KIT. As in the previous years, we will learn about and discuss Microsoft&#8217;s current and future products, as well as users  presenting their (good and not so good) experiences in doing HPC on Windows. This year, we will have an Expert Discussion Panel for which the audience is invited to ask (tough) question to fire up the discussion.<strong><br />
</strong></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10221/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10221/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10221/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10221/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10221/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10221/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10221/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10221/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10221/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10221/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10221/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10221/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10221/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10221/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10221&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/03/09/upcoming-events-in-march-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>RWTH Aachen gets a new 300 Teraflops HPC system from Bull</title>
		<link>http://terboven.wordpress.com/2011/02/15/rwth-aachen-gets-300-tflops-from-bull/</link>
		<comments>http://terboven.wordpress.com/2011/02/15/rwth-aachen-gets-300-tflops-from-bull/#comments</comments>
		<pubDate>Tue, 15 Feb 2011 08:21:35 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[University]]></category>
		<category><![CDATA[MPI]]></category>
		<category><![CDATA[OpenMP]]></category>
		<category><![CDATA[Supercomputing]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10218</guid>
		<description><![CDATA[While I usually do not repeat press releases in my blog, this one I do since we all are a little proud of the achievement: RWTH Aachen University orders Bull supercomputer to support its scientific, industrial and environmental research. Getting this system was a lot of work, and preparing for it still is. The compute [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10218&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>While I usually do not repeat press releases in my blog, this one I do since we all are a little proud of the achievement: <a href="http://www.wcm.bull.com/internet/pr/new_rend.jsp?DocId=634453&amp;lang=en" target="_blank">RWTH Aachen University orders Bull supercomputer to support its scientific, industrial and environmental research</a>. Getting this system was a lot of work, and preparing for it still is. The compute power of that machine totals 300 Teraflops. The focus of our center is not just running this machine, but to provide HPC-specific support and to ensure efficient operation. We are confident that in Bull we found a competent partner to investigate these and other topcis in close collaboration.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10218/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10218&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/02/15/rwth-aachen-gets-300-tflops-from-bull/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>OpenMP 3.1 spec published as Draft for Public Comment</title>
		<link>http://terboven.wordpress.com/2011/02/09/openmp-3-1-released-as-draft-for-public-comment/</link>
		<comments>http://terboven.wordpress.com/2011/02/09/openmp-3-1-released-as-draft-for-public-comment/#comments</comments>
		<pubDate>Wed, 09 Feb 2011 16:22:20 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[OpenMP]]></category>
		<category><![CDATA[Binding]]></category>
		<category><![CDATA[C++]]></category>
		<category><![CDATA[C++0X]]></category>
		<category><![CDATA[cc-NUMA]]></category>
		<category><![CDATA[IWOMP]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10214</guid>
		<description><![CDATA[You might have heard it already: The next incarnation of the OpenMP specification, which is targeted to be released as version 3.1 around June in time for IWOMP 2011 in Chicago, has been published as a Draft for Public Comment. You may think of it as beta . Back in October 2009, I already commented [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10214&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>You might have heard it already: The next incarnation of the OpenMP specification, which is targeted to be released as version 3.1 around June in time for <a href="http://www.ncsa.illinois.edu/Conferences/IWOMP11/">IWOMP 2011</a> in Chicago, has been published as a <a href="http://openmp.org/wp/2011/02/31-draft-specs-ready-for-public-comment/" target="_blank">Draft for Public Comment</a>. You may think of it as beta <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> .</p>
<p>Back in October 2009, I already commented on some of the <a href="http://terboven.wordpress.com/2009/10/04/how-openmp-is-moving-towards-version-3-1-4-0/#comments">goals for versions 3.1 and 4.0</a>. OpenMP 3.1 addresses some issues found in the 3.0 specification and brings only minor functional improvements, still it will be released with a delay of almost one year to our initially planned schedule. However, work on version 4.0 already made some significant progress, including support for accelerators (GPUs), further enhancements to the tasking model, and support  for error handling. Taking the outline of my <a href="http://terboven.wordpress.com/2009/10/04/how-openmp-is-moving-towards-version-3-1-4-0/#comments">previous post on the development of OpenMP</a>, this is the list of updates to be found in OpenMP 3.1 and the status of the development towards OpenMP 4.0 (expressed in my own words and stating my own beliefs and opinions):</p>
<p><strong>1: Development of an OpenMP Error Model.</strong> There is nothing new on this topic in OpenMP 3.1. However, with respect to OpenMP 4.0, the so-called <em>done </em>directive has been discussed for quite some time already. It can be used to terminate the execution of a <em>Parallel Region</em>, or a <em>Worksharing </em>construct, or a <em>Task </em>construct, and it is a prominent candidate for the next OpenMP spec. It would provide necessary functionality towards full-featured error handling capabilities, for which there is no good proposal that could be agreed upon yet.</p>
<p><strong>2: Interoperability and Composability.</strong> There is nothing new on this topic in OpenMP 3.1. We made several experiments, gained some insights, and the goal is to come up with a set of reliable expectations and assertions in the OpenMP 4.0 timeframe.</p>
<p><strong>3: Incorporating Tools Support into the OpenMP Specification.</strong> There is currently no activity on this topic in the OpenMP Language Committee in general.</p>
<p><strong>4: Associating Computation or Memory across Workshares.</strong> There is little progress in this direction to be found in OpenMP 3.1. The environment variable <em>OMP_PROC_BIND</em> has been added to control the binding of threads to processors, it accepts a boolean value. If enabled, the OpenMP runtime is instructed to not move OpenMP threads between processors. The mapping of threads to processors is unspecified and thus depends on the implementation. In general, introducing this variable that controls program-wide behavior was intended to standardize behavior found in almost all current OpenMP implementations.</p>
<p><strong>5: Accelerators, GPUs and More.</strong> While there is nothing new on this topic in OpenMP 3.1, the Accelerator subcommittee put a lot of effort into coming up with a first (preliminary!) proposal. This is clearly interesting. From my personal point of view, OpenMP 4.0 might provide basic support for programming accelerators such as GPUs, thus delivering a vendor-neutral standard. Do not expect anything full-featured similar to CUDA, the current proposal is rather similar in spirit to the <a href="http://www.pgroup.com/resources/accel.htm" target="_blank">PGI Accelerator</a> approach (which I do like). However, this is still far from being done, and may (or may not) change directions completely. The crucial aspects are to integrate well with the rest of OpenMP, and to provide an easy to use but still powerful approach to allow for bringing certain important code patterns to accelerator devices.</p>
<p><strong>6: Transactional Memory and Thread Level Speculation.</strong> There is in general no activity on this topic in the OpenMP Language Committee and apparently it dropped from the set of important topics. Personally, (now) I do not think TM should be a target for OpenMP in the forseable future.</p>
<p><strong>7: Refinements to the OpenMP Tasking Model.</strong> There have been some improvements to the Tasking model, with some more on the roadmap for OpenMP 4.0.</p>
<ul>
<li>The <em>taskyield </em>directive has been added to allow for user-defined task scheduling (tsp) points. A tsp is a point in the execution of a task at which is can be suspended to be resumed later; or the event of task completion, after which the executing thread may switch to a different task.</li>
<li>The <em>mergeable </em>clause has been added to the list of possible task clauses, indicating that the task may have the same data region as the generating task region.</li>
<li>The <em>final </em>clause has been added to the list of possible task clauses, denoting the execution of all descending tasks sequentially in the same region. This implies immediate execution of final tasks, and ignoring any untied task clauses. An optional scalar expression allows for conditioning the application of the final clause.</li>
</ul>
<p><strong>8: Extending OpenMP to C++0x and FORTRAN 2003.</strong> There is nothing new on this topic in OpenMP 3.1. We closely follow the development of the base language and it has to be seen what can (or has to) be done for OpenMP 4.0. Anyhow, the fact that base languages are introducing threading and a thread-aware memory model leads to some simplifications on the one hand, but also could lead to potential conflicts on the other hand. We are not aware of any such conflict, but digging through the details and implification of a base language such as C++ as well as OpenMP is a pretty complex task.</p>
<p><strong>9: Extending OpenMP to Additional Languages.</strong> There is nothing new on this topic in OpenMP 3.1, and currently there is no intention of doing so inside the OpenMP Language Committee. Personally, I would like to see an OpenMP binding for Java, since it would really help teaching parallel programming, but I do not see this happen.</p>
<p><strong>10: Clarifications to the Existing Specifications.</strong> There have been plenty of clarification, corrections, and micro-updates. Most notably the examples and description in the appendix have been corrected, clarified, and expanded.</p>
<p><strong>11: Miscellaneous Extensions.</strong> A couple of miscellaneous extensions made it into OpenMP 3.1:</p>
<ul>
<li>The <em>atomic </em>construct has been extended to accept the following new clauses: <em>read</em>, <em>write</em>, <em>update</em> and <em>capture</em>. If none is given, it defaults to update. Specifying an atomic region allows to atomically read / write / update the value of the variable affected by the construct. Note that not everything inside an atomic region is performed atomically, i.e. the evaluation of &#8220;other&#8221; variables is not. For example in an atomic write construct, only the left-hand variable (the one that is written to) is written atomically.</li>
<li>The <em>firstprivate </em>clause now accepts const-qualified types in C/C++ as well as intent(in) in Fortran. This is just a reaction to annoyances reported by some users.</li>
<li>The reduction clause has been extended to allow for <em>min </em>and <em>max </em>reductions for built-in datatypes in C/C++. This still excludes aggregate types (including arrays) as well as pointer and reference types from being used in an OpenMP reduction. We had a proposal for powerful user-defined reductions (UDRs) on the table for a long time, it was discussed heavily, but did not make it into OpenMP 3.1. That would have made this release of the spec much stronger. Adding UDRs is a high priority for OpenMP 4.0 for many OpenMP Language Committee members, though.</li>
<li><em>omp_in_final()</em> is as new API routine to determine whether it is called from within a final (aka included) task region.</li>
</ul>
<p><strong>12: Additional Task / Threads Synchronization Mechanisms.</strong> There is nothing new on this topic in OpenMP 3.1, and not much interest in the OpenMP Language Committee that I have noticed. However, we are thinking of task dependencies and task reductions for OpenMP 4.0, and both feature would probably fall into this category (and then there would be a high interest).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10214/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10214&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2011/02/09/openmp-3-1-released-as-draft-for-public-comment/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>
	</item>
		<item>
		<title>Examining the NUMA architecture of a 8-socket Nehalem-EX</title>
		<link>http://terboven.wordpress.com/2010/10/18/examining-the-numa-architecture-of-a-8-socket-nehalem-ex/</link>
		<comments>http://terboven.wordpress.com/2010/10/18/examining-the-numa-architecture-of-a-8-socket-nehalem-ex/#comments</comments>
		<pubDate>Mon, 18 Oct 2010 12:57:20 +0000</pubDate>
		<dc:creator>terboven</dc:creator>
				<category><![CDATA[NUMA]]></category>
		<category><![CDATA[OpenMP]]></category>
		<category><![CDATA[Binding]]></category>
		<category><![CDATA[cc-NUMA]]></category>

		<guid isPermaLink="false">http://terboven.wordpress.com/?p=10206</guid>
		<description><![CDATA[I have been rather quiet on this blog for some while now, which is opposite to my intent – I plan to write more regularly again! And I will just continue with one of the topics I like most: NUMA architectures. Some while ago I talked about how different two systems equipped with exactly the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10206&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I have been rather quiet on this blog for some while now, which is opposite to  my intent – I plan to write more regularly again! And I will just continue with  one of the topics I like most: NUMA architectures. Some while ago I talked about  how different two systems equipped with exactly the same processors may look  like and how this can influence the application performance. This blog post is  about exploring the NUMA architecture of a very recent system in more detail.</p>
<p>Some days ago we got remote access to a very recent eight-way (meaning 8-socket)  system equipped with Nehalem-EX processors. This makes 64 physical or 128  logical (hyper-threaded) cores per system! The system was kindly provided by  Fujitsu. Since we soon will get plenty of those (not necessarily from Fujitsu,  we really do not know yet), we took a close look on how it behaves, especially  my colleague Dirk Schmidl performed lots of the benchmarks with the help of some  student workers.</p>
<p>In the aforementioned previous blog post I pointed to the so-called System  Locality Information Table (SLIT) provided by the BIOS. Does it help to  understand how the eight sockets found in this server are related to each other?  Taking a look at it, the answer is simple: No. It just know about two levels:  Same socket (the diagonal: 10) and other socket (the rest: 12).</p>
<p style="text-align:center;">&nbsp;</p>
<p>&nbsp;</p>
<div id="attachment_10205" class="wp-caption aligncenter" style="width: 310px"><a href="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_slit.jpg"><img class="size-medium wp-image-10205 " title="8-socket Nehalem-EX: SLIT value matrix" src="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_slit.jpg?w=300&#038;h=97" alt="8-socket Nehalem-EX: SLIT value matrix" width="300" height="97" /></a><p class="wp-caption-text">8-socket Nehalem-EX: SLIT value matrix</p></div>
<p>&nbsp;</p>
<p>Our goal was to examine how the eight sockets are related to each other and how  “deep” the NUMA architecture of that machine really is. Of course you can get  that information from the system specification documentation, but in order to  get a <em>feeling</em> of the performance characteristics of a machine it is  good practice to examine it first on your own and then check whether your  conclusions match what is described.</p>
<p>We used a simple benchmark: We placed eight threads (each processor has eight  physical cores) on one selected socket and made all of them access memory at  another socket (well, one thread access the local socket). We then measured the  achieved memory bandwidth [MB/s]. This resulted in the following performance  matrix:</p>
<p>&nbsp;</p>
<div id="attachment_10202" class="wp-caption aligncenter" style="width: 310px"><a href="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_8threads_membw.jpg"><img class="size-medium wp-image-10202" title="8-socket Nehalem-EX: memory bandwidth matrix" src="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_8threads_membw.jpg?w=300&#038;h=98" alt="8-socket Nehalem-EX: memory bandwidth matrix" width="300" height="98" /></a><p class="wp-caption-text">8-socket Nehalem-EX: memory bandwidth matrix</p></div>
<p>&nbsp;</p>
<p>By measuring the memory bandwidth in this particular way we do not get the  optimal aggregated memory bandwidth the system could deliver, since all sockets  are busy and there is also some cache coherency traffic. Instead, our benchmark  results are more close to what the system delivers when it is fully loaded using  a rather bad memory access behavior.</p>
<p>Our measurements revealed three significantly different performance levels, of  which one can further be spitted into two separate ones. The different levels  are colored accordingly in the figure below. Depending of which socket you label  as “0”, you can come up with the following architectural plot (my colleague  Dieter an Mey did this particular one):</p>
<p>&nbsp;</p>
<div id="attachment_10203" class="wp-caption aligncenter" style="width: 310px"><a href="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_architecture.jpg"><img class="size-medium wp-image-10203" title="8-socket Nehalem-EX: architecture" src="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_architecture.jpg?w=300&#038;h=140" alt="8-socket Nehalem-EX: architecture" width="300" height="140" /></a><p class="wp-caption-text">8-socket Nehalem-EX: architecture</p></div>
<p>&nbsp;</p>
<p>One can see that we have two pairs of four sockets each that are connected by  apparently slightly slower links. I do not yet know what is causing this.  Looking at the number of hops you get this matrix:</p>
<p>&nbsp;</p>
<div id="attachment_10204" class="wp-caption aligncenter" style="width: 310px"><a href="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_numhops.jpg"><img class="size-medium wp-image-10204" title="8-socket Nehalem-EX: number of hops matrix" src="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_numhops.jpg?w=300&#038;h=98" alt="8-socket Nehalem-EX: number of hops matrix" width="300" height="98" /></a><p class="wp-caption-text">8-socket Nehalem-EX: number of hops matrix</p></div>
<p>&nbsp;</p>
<p>The maximum number of hops to get from one socket to another socket is two.  Since the Intel QuickPath interconnect allows to use (up to) three connectors to  build a multi-socket system, each socket as three neighbors than can be reached  with just one hop.</p>
<p>Well, an aggregated memory bandwidth of nearly 90 GB/s with this bad memory  access pattern is pretty ok. But it is not a factor two over a system of four  sockets. It is well-suited for shared memory parallel programs that can make use  of that many cores (and a large total memory), but of course it odes not offer a  price-performance sweet spot (the price trend of adding sockets is clearly  over-linear). And last but not least, although the memory bandwidth is really  important for most HPC applications, there are also other factors that play an  important role in an application’s performance on a given architecture. We did  many more benchmarks to evaluate this system, of which I do not want to speak  here and now, but by doing some memory bandwidth benchmark we figured out how  the system architecture looks like and how the eight sockets are related to each  other.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/terboven.wordpress.com/10206/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/terboven.wordpress.com/10206/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/terboven.wordpress.com/10206/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/terboven.wordpress.com/10206/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/terboven.wordpress.com/10206/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/terboven.wordpress.com/10206/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/terboven.wordpress.com/10206/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/terboven.wordpress.com/10206/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/terboven.wordpress.com/10206/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/terboven.wordpress.com/10206/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/terboven.wordpress.com/10206/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/terboven.wordpress.com/10206/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/terboven.wordpress.com/10206/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/terboven.wordpress.com/10206/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=terboven.wordpress.com&amp;blog=5383873&amp;post=10206&amp;subd=terboven&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://terboven.wordpress.com/2010/10/18/examining-the-numa-architecture-of-a-8-socket-nehalem-ex/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">terboven</media:title>
		</media:content>

		<media:content url="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_slit.jpg?w=300" medium="image">
			<media:title type="html">8-socket Nehalem-EX: SLIT value matrix</media:title>
		</media:content>

		<media:content url="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_8threads_membw.jpg?w=300" medium="image">
			<media:title type="html">8-socket Nehalem-EX: memory bandwidth matrix</media:title>
		</media:content>

		<media:content url="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_architecture.jpg?w=300" medium="image">
			<media:title type="html">8-socket Nehalem-EX: architecture</media:title>
		</media:content>

		<media:content url="http://terboven.files.wordpress.com/2010/10/nehalemex_8socket_numhops.jpg?w=300" medium="image">
			<media:title type="html">8-socket Nehalem-EX: number of hops matrix</media:title>
		</media:content>
	</item>
	</channel>
</rss>
