<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>University Research Computing</title>
	<atom:link href="http://www.urc.uncc.edu/urc/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.urc.uncc.edu/urc</link>
	<description>High Performance Computing (HPC) &#124; UNC Charlotte &#124; Charlotte, North Carolina</description>
	<lastBuildDate>Tue, 10 Nov 2009 17:20:41 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.x</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Power Outage to impact ChargerNet Cluster</title>
		<link>http://www.urc.uncc.edu/urc/announcements/power-outage-to-impact-chargernet-cluster/</link>
		<comments>http://www.urc.uncc.edu/urc/announcements/power-outage-to-impact-chargernet-cluster/#comments</comments>
		<pubDate>Tue, 10 Nov 2009 17:18:30 +0000</pubDate>
		<dc:creator>Chuck Price</dc:creator>
				<category><![CDATA[Announcements]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=222</guid>
		<description><![CDATA[ChargerNet Users:
 Due to the failure of a system board in the UPS (Uninterruptible Power Supply), there will be a planned power outage for the campus server room on the weekend of November 15th.  As a result, we will have to shut down the ChargerNet Cluster and any jobs that are running at the time [...]]]></description>
			<content:encoded><![CDATA[<p><!--StartFragment--><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;">ChargerNet Users:</p>
<p> Due to the failure of a system board in the UPS (Uninterruptible Power Supply), there will be a planned power outage for the campus server room on the weekend of November 15th.  As a result, we will have to shut down the ChargerNet Cluster and any jobs that are running at the time of the shutdown will be lost.   We plan to begin the shut down at 5pm on Friday, Nov. 13th, and return the cluster to production no later than 8am on Monday the 16th.   Please try to plan your work so that your jobs will complete prior to the shutdown (or can be restarted following the outage). </p>
<p> We apologize for the inconvenience, </p>
<p> Chuck</span></span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/announcements/power-outage-to-impact-chargernet-cluster/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ChargerNet File System Access</title>
		<link>http://www.urc.uncc.edu/urc/cluster-utilities/chargernet-file-system-access/</link>
		<comments>http://www.urc.uncc.edu/urc/cluster-utilities/chargernet-file-system-access/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 12:23:03 +0000</pubDate>
		<dc:creator>Mike Mosley</dc:creator>
				<category><![CDATA[Cluster Utilities]]></category>
		<category><![CDATA[FAQs]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[storage]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=174</guid>
		<description><![CDATA[
Access to the ChargerNet File System is available for those users who need to access their home directory or project area from a remote client (on campus or via the campus VPN).
URC maintains a Samba server which allows SMB capable clients (i.e. MS-Windows, MacOS X, and Linux) to mount remote shares as a local drive [...]]]></description>
			<content:encoded><![CDATA[<p><br class="spacer_" /></p>
<p><span style="font-size: medium;">Access to the ChargerNet File System is available for those users who need to access their home directory or project area from a remote client (on campus or via the campus VPN).</span></p>
<p><span style="font-size: medium;">URC maintains a Samba server which allows SMB capable clients (i.e. MS-Windows, MacOS X, and Linux) to mount remote shares as a local drive or mount point.  To obtain permission to use this server, send email to URCSupport@uncc.edu.</span></p>
<p><span style="font-size: medium;"><span id="more-174"></span>Below are instructions for mounting a ChargerNet Samba Share from  various client machine types.</span><em><span style="font-size: medium;"> Note that<strong> </strong>the Samba server  authenticates using Campus (NinerNet) passwords.</span></em></p>
<p><br class="spacer_" /></p>
<p><span style="font-size: medium;"> </span></p>
<p><span style="font-size: medium;">The examples below assume the following for ChargerNet user<em> &#8220;joeuser&#8221;</em>:</span></p>
<p style="padding-left: 30px;"><span style="font-size: medium;">ChargerNet Samba Server:                        <strong>samba.urc.uncc.edu</strong><br />
 </span></p>
<p style="padding-left: 30px;"><span style="font-size: medium;">ChargerNet login:                                         <strong>joeuser   (its.uncc.edu/joeuser)*<br />
 </strong></span></p>
<p style="padding-left: 30px;"><span style="font-size: medium;">ChargerNet home directory:                      <strong>/users/joeuser</strong></span></p>
<p style="padding-left: 30px;"><span style="font-size: medium;">ChargerNet share name:                            <strong>joeuser <br />
 </strong></span></p>
<p><strong><br />
 </strong></p>
<p><span style="text-decoration: underline;"><strong>[ MS-Windows XP ]</strong></span><em><strong> </strong></em></p>
<p><strong>1.</strong> Select &#8216;<em>My Computer&#8217;</em></p>
<p><strong>2.</strong> Select &#8216;Tools &#8211; &gt; Map Network Drive&#8217;</p>
<p><strong>3.</strong> Fill in path to folder (share)  i.e.  <strong>\\<em>samba.urc.uncc.edu</em>\<em>joeuser</em></strong></p>
<p><strong>4. </strong>Select &#8216;<em>Connect using a different user name&#8217;</em></p>
<p><strong>5.</strong> Fill in username<strong>* (</strong><em>see footnote below</em><strong>)</strong> i.e    <em><strong>its.uncc.edu</strong></em><strong>\</strong><em><strong>joeuser </strong></em><strong> </strong></p>
<p><strong>6.</strong> Fill in <strong>NinerNet</strong> password<em>.<br />
 </em></p>
<p><br class="spacer_" /></p>
<p><span style="text-decoration: underline;"><strong>[ MacOS X ]<br />
 </strong></span></p>
<p><strong>1.</strong> In &#8216;Finder&#8217; application select Go -&gt; Connect to Sever</p>
<p><strong>2.</strong> Fill in Server address (and share) indicating smb protocol i.e. <strong>smb://samba.urc.uncc.edu/joeuser</strong></p>
<p><strong>3.</strong> Press &#8216;Connect&#8217;</p>
<p><strong>4.</strong> Fill in Name (ChargerNet login)  i.e.<strong> joeuser </strong> (domain not required)<strong><br />
 </strong></p>
<p><strong>5.</strong> Fill in <strong>NinerNet</strong> password</p>
<p><br class="spacer_" /></p>
<p><span style="text-decoration: underline;"><strong>[ Red Hat Linux - File Browser ]<br />
 </strong></span></p>
<p><strong>1</strong>. In &#8216;<em>File Broswer</em>&#8216; application select <em>File -&gt; Connect to Server</em></p>
<p><strong>2.</strong> Fill in the following:</p>
<p style="padding-left: 30px;"><em>Service Type: </em><strong>Windows share</strong></p>
<p style="padding-left: 30px;"><em>Server:</em> <strong>samba.urc.uncc.edu</strong></p>
<p style="padding-left: 30px;"><em> Share:</em> <strong>joeuser</strong></p>
<p style="padding-left: 30px;"><em>Domain Name:</em> <strong>its.uncc.edu</strong></p>
<p style="padding-left: 30px;"><em>Name to use for connection:</em> <strong>joeuser </strong></p>
<p><strong>3.</strong> Press &#8216;<em>Connect&#8217; </em> button</p>
<p><strong>4, </strong>Fill in <strong>NinerNet </strong>password.<br class="spacer_" /></p>
<p><br class="spacer_" /></p>
<p><span style="text-decoration: underline;"><strong>[ Red hat Linux - Command Line ]</strong></span></p>
<p>You can use the mount.cifs(8) command which is part of the samba(7) suite.  See the man page for mount.cifs for more details</p>
<p>To mount a share (to /home/joeuser/samba on the client) use the following command:</p>
<p style="padding-left: 30px;"><em><strong>$ /sbin/mount.cifs //samba.urc.uncc.edu/joeuser  /home/joeuser/samba -o user=joeuser</strong></em></p>
<p>To unmount the share:</p>
<p style="padding-left: 30px;"><em><strong> $/sbin/umount.cifs /home/joeuser/samba</strong></em></p>
<p><br class="spacer_" /></p>
<p><em><strong>*</strong> Objects on the network including servers and even users are distinguished using &#8220;domain&#8221; naming conventions.  For instance, the samba server referenced above belongs to URC and is therefore part of the &#8216;urc&#8217; domain.  Hence the full name of that object is: samba.<strong>urc</strong>.uncc.edu.</em></p>
<p><em>However, since the samba server uses the campus NinetNet system to authenticate user logins, the full user name (where required as in the case of a MS-Windows client)  uses the &#8216;its&#8217; domain not the &#8216;urc&#8217; domain.  Hence, a fully qualified user name is specified as:  <strong>its</strong>.uncc.edu</em>\<em>joeuser.</em></p>
<p><strong> </strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/cluster-utilities/chargernet-file-system-access/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>URC Webserver Upgrade</title>
		<link>http://www.urc.uncc.edu/urc/old-announcements/urc-webserver-upgrade/</link>
		<comments>http://www.urc.uncc.edu/urc/old-announcements/urc-webserver-upgrade/#comments</comments>
		<pubDate>Fri, 04 Sep 2009 20:48:07 +0000</pubDate>
		<dc:creator>Jonathan Halter</dc:creator>
				<category><![CDATA[Old Announcements]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=166</guid>
		<description><![CDATA[ChargerNet Users:
ChargerNet Users:
We are planning a  URC Webserver/Portal outage for Tuesday, September 8, 2009 from 7am until 8am.   We will use this outage to upgrade the webserver to Redhat EL 5.3.   The following websites will be unavailable for the duration of the upgrade:
The URC Website:
http://www.urc.uncc.edu/urc/
The ChargerNet Portal:
https://chargernet.uncc.edu/portal/
The outage will impact access to the [...]]]></description>
			<content:encoded><![CDATA[<p>ChargerNet Users:</p>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">ChargerNet Users:</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">We are planning a  URC Webserver/Portal outage for Tuesday, September 8, 2009 from 7am until 8am.   We will use this outage to upgrade the webserver to Redhat EL 5.3.   The following websites will be unavailable for the duration of the upgrade:</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">The URC Website:</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">http://www.urc.uncc.edu/urc/</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">The ChargerNet Portal:</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">https://chargernet.uncc.edu/portal/</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">The outage will impact access to the cluster through the ChargerNet Portal, but it will NOT impact cluster usage through the submit hosChargerNet Users:</div>
<p>We are planning a  <strong>URC Webserver/Portal outage</strong> for <strong>Tuesday, September 8, 2009</strong> from <strong>7am until 8am</strong>.   We will use this outage to upgrade the webserver to Redhat EL 5.3.<span id="more-166"></span>The following websites will be unavailable for the duration of the upgrade:</p>
<p>The URC Website:<br />
 <a title="URC Website" href="http://www.urc.uncc.edu/urc/"> http://www.urc.uncc.edu/urc/</a></p>
<p>The ChargerNet Portal:<br />
 <a title="ChargerNet Portal" href="https://chargernet.uncc.edu/portal/"> https://chargernet.uncc.edu/portal/</a></p>
<p>The outage will impact access to the cluster through the ChargerNet Portal, but it will NOT impact cluster usage through the submit hosts.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/old-announcements/urc-webserver-upgrade/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Update on Transition to Torque/Maui</title>
		<link>http://www.urc.uncc.edu/urc/old-announcements/update-on-transition-to-torquemaui/</link>
		<comments>http://www.urc.uncc.edu/urc/old-announcements/update-on-transition-to-torquemaui/#comments</comments>
		<pubDate>Tue, 01 Sep 2009 13:18:47 +0000</pubDate>
		<dc:creator>Chuck Price</dc:creator>
				<category><![CDATA[Old Announcements]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[maintenance]]></category>
		<category><![CDATA[torque]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=159</guid>
		<description><![CDATA[ChargerNet Cluster Users:
This is a reminder that we are transitioning away from the Condor Job Scheduler to the Torque/Maui Scheduling system.  We have been running both systems for most of the summer in order to allow sufficient time for you to learn the new commands and update your submission scripts and procedures.  We have now [...]]]></description>
			<content:encoded><![CDATA[<p>ChargerNet Cluster Users:</p>
<p>This is a reminder that we are transitioning away from the Condor Job Scheduler to the Torque/Maui Scheduling system.  We have been running both systems for most of the summer in order to allow sufficient time for you to learn the new commands and update your submission scripts and procedures.  We have now moved most of our compute nodes into Torque/Maui and we plan to complete the transition and retire Condor on September 15th.</p>
<p><span id="more-159"></span>If you have not already done so, please begin submitting your cluster jobs using the new system as described on our web pages at <a title="Job scheduling with Torque" href="http://www.urc.uncc.edu/urc/faqs/job-scheduling-with-torque/">http://www.urc.uncc.edu/urc/faqs/job-scheduling-with-torque/</a>.  We have provided example scripts for our most popular applications (see folder /apps/torque/examples on the cluster).  The URC staff is also available to assist you with the transition if needed – simply send an email request to <a title="email URC Support" href="mailto:URCSupport@uncc.edu">URCSupport@uncc.edu</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/old-announcements/update-on-transition-to-torquemaui/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Application Updates for Fall 2009</title>
		<link>http://www.urc.uncc.edu/urc/apps/application-updates/</link>
		<comments>http://www.urc.uncc.edu/urc/apps/application-updates/#comments</comments>
		<pubDate>Tue, 14 Jul 2009 19:30:46 +0000</pubDate>
		<dc:creator>Jonathan Halter</dc:creator>
				<category><![CDATA[Applications]]></category>
		<category><![CDATA[Old Announcements]]></category>
		<category><![CDATA[abaqus]]></category>
		<category><![CDATA[app-testing]]></category>
		<category><![CDATA[applications]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[compilers]]></category>
		<category><![CDATA[matlab]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=145</guid>
		<description><![CDATA[There are several application changes that we will make prior to the start of the Fall 2009 Semester. In conjunction with the updates, we will be retiring old versions of the software. I’ve compiled a list of “new” application versions, as well as a list of the versions that will be retired:

Application versions to be [...]]]></description>
			<content:encoded><![CDATA[<p><!--StartFragment--><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;">There are several application changes that we will make prior to the start of the Fall 2009 Semester. In conjunction with the updates, we will be retiring old versions of the software. I’ve compiled a list of “new” application versions, as well as a list of the versions that will be retired:<span id="more-145"></span><br />
</span></span></p>
<p><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;"><strong>Application versions to be updated on the URC Cluster:</strong></span></span></p>
<p style="padding-left: 30px;"><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;"> <strong>*ABAQUS 6.8-4</strong><br />
 <em><span style="font-size: small;"> path = /apps/pkg/abaqus/6.8-4<br />
 env = /apps/usr/env/abaqus-6.8-4</span></em><span style="font-size: small;"><br />
 </span><strong> *CFDACE 2009.0.14</strong><br />
 <em><span style="font-size: small;"> path = /apps/pkg/cfdace-2009.0.14<br />
 env = /apps/usr/env/cfdace-2009.0.14.sh</span></em><br />
 <strong> Intel compilers 11.0.083</strong><br />
 <em><span style="font-size: small;"> path = /apps/pkg/intel-11.0.083<br />
 env = /apps/usr/env/intel-11.0.sh</span></em><br />
 <strong> *Matlab R2009a</strong><br />
 <em><span style="font-size: small;"> path = /apps/pkg/apps/pkg/matlab-R2009a<br />
 env = (n/a; will be in default path)</span></em><br />
 <strong> PGI compilers 9.0-1</strong><br />
 <em><span style="font-size: small;"> path = /apps/pkg/pgi/linux86-64/9.0-1<br />
 env = /apps/usr/env/pgi-9.0.sh</span></em><br />
 <strong> SAS 9.2</strong><br />
 <em><span style="font-size: small;"> path = /apps/pkg/sas-9.2</span></em><br />
 <strong> *STARCCM+ 4.02</strong><br />
 <em><span style="font-size: small;"> path = /apps/pkg/sc+4.02<br />
 env = /apps/usr/env/starccm+4.02.007.sh</span></em></span></span></p>
<p style="padding-left: 30px;"><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;"><em>* there are example torque scripts for these apps in /apps/torque/examples</em></span></span></p>
<p><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;"><em> </em><strong>Application versions to be retired:</strong></span></span></p>
<p style="padding-left: 30px;"><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;"><strong> </strong>ABAQUS 6.5-1<br />
 ABAQUS 6.6-1<br />
 CFDACE 2009<br />
 Intel compilers 9.1.047<br />
 Matlab R2007b<br />
 PGI compilers 7.04<br />
 STARCCM+ 3.06</span></span></p>
<p><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;">The semester begins on August 17th; our plan is to retire the old application versions on August 15th. Please make sure that any scripts that reference the old versions are updated with the new versions. The new applications are on the system and available for use now, so you can start using them as soon as you like. I provide the path and environment file for the new apps as a reference for updating your scripts if need be.</span></span></p>
<p><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;">If you have any questions or concerns regarding any of the applications mentioned above, please let us know by emailing <span style="color: #0000ff;"><span style="text-decoration: underline;"><a href="URCSupport@uncc.edu">URCSupport@uncc.edu</a></span></span>.</span></span></p>
<p><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span style="font-size: 11pt;"><br />
 </span></span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/apps/application-updates/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Transition to Torque/Maui</title>
		<link>http://www.urc.uncc.edu/urc/old-announcements/transition-to-torquemaui/</link>
		<comments>http://www.urc.uncc.edu/urc/old-announcements/transition-to-torquemaui/#comments</comments>
		<pubDate>Mon, 22 Jun 2009 14:04:41 +0000</pubDate>
		<dc:creator>Chuck Price</dc:creator>
				<category><![CDATA[Old Announcements]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[torque]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=139</guid>
		<description><![CDATA[ChargerNet Users:
We are beginning to transition from the Condor Job Scheduler that we have used in the cluster since its inception in 2005 to the Torque/Maui Scheduling system which has become the predominate scheduler for academic HPC environments.  This change will allow us to take better advantage of tools and techniques developed at other universities, [...]]]></description>
			<content:encoded><![CDATA[<p>ChargerNet Users:</p>
<p>We are beginning to transition from the Condor Job Scheduler that we have used in the cluster since its inception in 2005 to the Torque/Maui Scheduling system which has become the predominate scheduler for academic HPC environments.  This change will allow us to take better advantage of tools and techniques developed at other universities, provide better integration with commercial products which support Torque/Maui for parallel processing (like MatLab), and improve our queue management by implementing resource limits or “fair share” algorithms. <span id="more-139"></span></p>
<p>This change to the scheduler will require all users to replace their condor submit scripts with similar torque submit scripts and switch from using the condor commands (condor_submit, condor_q, etc.) to manage cluster jobs to using the equivalent torque commands (qsub, qstat, etc.)   This change is conceptually very simple, but does require learning a few new techniques. The URC staff has prepared a web page with <a title="torque instructions" href="http://www.urc.uncc.edu/urc/faqs/job-scheduling-with-torque/">instructions for getting started with torque</a>, as well as, sample torque scripts for many of our most common applications (see folder /apps/torque/examples on the cluster). The staff is also available to assist users with the transition as needed – simply send an email request to <a href="mailto:URCSupport@uncc.edu">URCSupport@uncc.edu</a>.</p>
<p>In order to allow this transition to occur with the minimum possible disruption, we will run both schedulers for several months and slowly move the compute nodes from the condor pool into the torque pool as our users make the necessary adjustments.  Currently, we have 32 computing cores available in the URC torque pool and 120 computing cores in the corresponding condor pool with additional nodes available in pools that are dedicated to specific research groups or activities (e.g. the Mechanical Engineering nodes).  You can see a summary of the activity in the condor and torque pools on the <a title="URC Website" href="http://www.urc.uncc.edu/urc/">URC home page</a>.  Once all users and compute nodes have been moved over to torque, we will discontinue our use of condor.</p>
<p>Email questions or comments to <a href="mailto:URCSupport@uncc.edu">URCSupport@uncc.edu</a> .<a href="mailto:ceprice@uncc.edu"></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/old-announcements/transition-to-torquemaui/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Job Scheduling with Torque</title>
		<link>http://www.urc.uncc.edu/urc/faqs/job-scheduling-with-torque/</link>
		<comments>http://www.urc.uncc.edu/urc/faqs/job-scheduling-with-torque/#comments</comments>
		<pubDate>Mon, 18 May 2009 15:37:25 +0000</pubDate>
		<dc:creator>Mike Mosley</dc:creator>
				<category><![CDATA[FAQs]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[torque]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=73</guid>
		<description><![CDATA[Introduction
As part of URC&#8217;s efforts to provide users with a more user-friendly and efficient environment, we are in the process of transitioning our job scheduler from Condor to Torque/Maui.  Torque is an Open Source scheduler based on the old PBS scheduler code. 
The following is a set of directions to assist a user in learning [...]]]></description>
			<content:encoded><![CDATA[<h3>Introduction</h3>
<p>As part of URC&#8217;s efforts to provide users with a more user-friendly and efficient environment, we are in the process of transitioning our job scheduler from Condor to Torque/Maui.  Torque is an Open Source scheduler based on the old PBS scheduler code. <span id="more-73"></span></p>
<p>The following is a set of directions to assist a user in learning to use Torque to submit jobs to the URC cluster(s).  It is tailored specifically to the URC environment and is by no means comprehensive.  Details not found in here can be found online at:</p>
<p><a href="http://www.clusterresources.com/resources/documentation.php/">http://www.clusterresources.com/resources/documentation.php/</a></p>
<p>as well as <a title="man pages" href="http://www.urc.uncc.edu/urc/man-pages/">man pages</a> for the various command.</p>
<p><em>Note:<br />
 Some of the sample scripts displayed in the text are not complete so that the reader can focus specifically on the item being discussed.  Full, working examples of scripts and commands are provided in the Examples section at the end of this document.</em></p>
<h3>Configuration</h3>
<p>Before submitting jobs, it is important to understand how the compute clusters are laid out in terms of Torque scheduling.</p>
<p>Like the Condor configuration, it is replacing, Torque at URC will accept jobs submitted from three hosts:</p>
<p>submit.urc.uncc.edu    (General URC users)<br />
 mees.urc.uncc.edu    (MEES users)<br />
 mees10.urc.uncc.edu    (MEES users &#8211; Dr Uddin)</p>
<p>As with Condor, users will use SSH to connect to one of the hosts above and from there submit the various Torque specific commands as outlined below. Also, submission of jobs to Torque will also be supported by the URC portal.</p>
<p>All jobs submitted from the submit host are funneled to the URC Torque server (m03) which is running the Torque and Maui server processes.</p>
<p>Compute nodes in the cluster(s) are logically grouped and accessed via Torque &#8220;queues.&#8221;  Users and groups of users are granted the rights to submit jobs to specific queues and hence run on specific nodes.  The currently defined queues are:</p>
<p style="padding-left: 30px;">batch    -    Default queue.  (Disabled)<br />
 urc    -    queue for the general URC users.<br />
 mees    -    queue for the MEES users.<br />
 mees10-    queue for MEES users &#8211; Dr. Uddin)<br />
 wrf    -    queue for the WRF project group</p>
<p>To determine if a specific user has been granted rights to submit to a particular queue, use the Torque command qmgr:</p>
<pre style="padding-left: 30px;">$ qmgr -c "list queue QNAME acl_users" m03</pre>
<p>where QNAME is one of the queues defined above.  <em>Note that all users have access to submit to the &#8220;urc&#8221; queue.  The access control list is NOT enabled for that queue.</em></p>
<h3>Submitting a Job</h3>
<p>Scheduling a job in Torque is similar to the method used in URC&#8217;s previous scheduler (Condor).  It requires creating a file that describes the job (in this case a shell script) and then that file is given as an argument to the Torque command &#8220;qsub&#8221; to execute the job.</p>
<p>First of all, here is a sample shell script (myjob.sh) describing a simple job to be submitted:</p>
<pre style="padding-left: 30px;">#! /bin/bash</pre>
<pre style="padding-left: 30px;"># ==== Main ======
/bin/date</pre>
<p>This script simply runs the &#8216;date&#8217; command.  To submit it to the scheduler for execution, we use the Torque qsub command:</p>
<pre style="padding-left: 30px;">$ qsub  myjob.sh</pre>
<p>This will cause the script (and hence the date command) to be executed on the default queue.</p>
<p>The simple example above can be expanded to demonstrate additional options:</p>
<pre style="padding-left: 30px;">$ qsub -N "MyJob"  -q "urc"  my_script.sh</pre>
<p>In this example, the &#8220;-n&#8221; switch gives the job a name while the &#8220;-q&#8221; switch is used to route the job to the &#8220;urc&#8221; queue.</p>
<p>Many of the command line options to qsub can also be specified in the shell script itself using Torque (PBS) directives. Using the previous example, our script (my_script.sh) could look like the following:</p>
<pre style="padding-left: 30px;">#!/bin/sh</pre>
<pre style="padding-left: 30px;"># ===== PBS OPTIONS =====
### Set the job name
#PBS -N "MyJob"

### Run in the queue named "urc"
#PBS -q "urc"</pre>
<pre style="padding-left: 30px;"># ==== Main ======
/bin/date</pre>
<p>Running the command is now simply:</p>
<pre style="padding-left: 30px;">$ qsub my_script.sh</pre>
<p>For the entire list of options, see the man page qsub i.e.</p>
<pre style="padding-left: 30px;">$ man qsub</pre>
<p>Standard Output and Standard Error<br />
 In  Torque, any output that would normally print to stdout or stderr is collected into two files. By default these files are placed in the initial working directory where you submitted the job from and are named:</p>
<p style="padding-left: 30px;"><strong>scriptname.{o}jobid</strong> for stdout<br />
 <strong>scriptname.{e}jobid</strong> for stderr</p>
<p>In our previous example (if we did not specify a job name with -n) that would translate to:</p>
<p style="padding-left: 30px;">My_script.sh.oNNN<br />
 My_script.sh.oNNN</p>
<p>Where NNN is the job ID number returned by qsub.  If I named the job with -n (as above) and it was assigned job id 801, the files would be:</p>
<p style="padding-left: 30px;">MyJob.o801<br />
 MyJob.e801</p>
<p><em>Note:<br />
 Torque accomplishes this by buffering the output on the execution host until the job completes and then copies it back to the working directory.   An unfortunate circumstance of this is that if your job does write information to stdout or stderr, you would be unable to view it until the job completes.</em></p>
<p>The qsub command does have an option (-k) to &#8220;keep&#8221; the files from being buffered and spooled in this manner.  Unfortunately, it will only allow the files to be created in the user&#8217;s home directory ($HOME) rather than in the working directory.  This is not very convenient in most cases.</p>
<p>To avoid this type of problem, URC suggests that (rather than using the -k option) that in your shell script (My_script.sh), you should explicity direct stdout and stderr to files.  There are many ways to do this in a shell script.  Some common ways are:</p>
<pre style="padding-left: 30px;">#! /bin/bash
exec 1&gt;$PBS_O_WORKDIR/out  2&gt;$PBS_O_WORKDIR/err</pre>
<p>which will direct stdout and stderr to the current working directory of the qsub command.</p>
<p>Another method is to enclose curly braces {} around the body of text in the script and recirect it:</p>
<pre style="padding-left: 30px;">#! /bin/bash</pre>
<pre style="padding-left: 30px;">{
/bin/date
} &gt; $PBS_O_WORKDIR/out 2&gt;$PBS_O_WORKDIR/err</pre>
<p>See the examples below for a more detailed example.</p>
<h3>Monitoring a Job</h3>
<p>Monitoring a Torque job is done primarily using the Torque command &#8220;qstat.&#8221; For instance, to see a list of available queues:</p>
<pre style="padding-left: 30px;">$ qstat -q</pre>
<p>To see the status of a specific queue:</p>
<pre style="padding-left: 30px;">$ qstat "queuename"</pre>
<p>To see the full status of a specific job:</p>
<pre style="padding-left: 30px;">$ qstat -f  jobid</pre>
<p>where jobid is the unique identifier for the job returned by the qsub command.</p>
<h3>Deleting a Job</h3>
<p>To delete a Torque job after it has been submitted,  use the qdel command:</p>
<pre style="padding-left: 30px;">$ qdel jobid</pre>
<p>where jobid is the unique identifier for the job returned by the qsub command.</p>
<h3>Monitoring Compute Nodes</h3>
<p>To see the status of the nodes associated with a specific queue, use the torque command pbs_nodes(1) (qlso refered to as qnodes):</p>
<pre style="padding-left: 30px;">$ pbsnodes :queue_name</pre>
<p>where  queue_name is the name of the queue  prefixed by a colon (:).  For example:</p>
<pre style="padding-left: 30px;">$ pbsnodes :urc</pre>
<p>would display information about all of the nodes associated with the &#8220;urc&#8221; queue.  The output includes (for each node) the number of cores available (np= ).  If there are jobs running on the node, each one is listed in the (jobs= ) field.  This shows how many of the available cores are actually in use.</p>
<h3>Parallel (MPI) Jobs</h3>
<p>Parallel jobs are submitted to Torque in the manner described above except that you must first ask Torque to reserve the number of  processors (cores) you are requesting in your job.  This is accomplished using the -l switch to the qsub command:</p>
<p>For example:</p>
<pre style="padding-left: 30px;">$ qsub  -q urc -l nodes=16 my_script.sh</pre>
<p>would submit my script requesting 16 processors (cores)  from the &#8220;urc&#8221; queue.  The script (my_script.sh) would look something like the following:</p>
<pre style="padding-left: 30px;">#! /bin/bash
mpirun -hostfile $PBS_NODEFILE  my_mpi_prgram</pre>
<p>If you need to specify a specify number of processors (cores) per compute host, you can append a colon (:) to the number of specified nodes and then append the number of processors per host.  For example, to request 16 total processors (cores) with only 4 per compute host, the syntax would be:</p>
<pre style="padding-left: 30px;">$ qsub  -q urc -l nodes=4:ppn=4 my_script.sh</pre>
<p>In this example, mpirun is using the environment variable $PBS_NODEFILE as the path to the hosts file that contains the list of nodes that MPI job can run on.  This variable is automatically set by Torque and the file it points to is automatically populated as a result of the &#8220;-l nodes=16&#8243; argument given to the qsub command.</p>
<p>Note that the syntax of the mpirun command varies across MPI implementations.  The $PBS_NODEFILE is set by Torque and so can be used by any implementation of mpirun provided that the proper syntax is used.  The example above (&#8211;hostfile) is for OpenMPI.</p>
<p>As described previously, options to qsub can be  specified directly in the script file.  For the example above, my_script.sh would look similar to the following:</p>
<pre style="padding-left: 30px;">#! /bin/bash
# ===== PBS OPTIONS =====
### Set the job name
#PBS -N MyJob

### Run in the queue named "urc"
#PBS -q urc
### Specify the number of cpus for your job.
#PBS -l nodes=4:ppn=4</pre>
<pre style="padding-left: 30px;">mpirun -hostfile $PBS_NODEFILE  my_mpi_prgram</pre>
<p><br class="spacer_" /></p>
<h3>Examples of Torque Submit Scripts</h3>
<p><em>NOTE: Additional sample scripts can be found online in <strong>/apps/torque/examples.</strong></em></p>
<p><strong>[1] Simple Job</strong></p>
<pre style="padding-left: 30px;">#! /bin/bash</pre>
<pre style="padding-left: 30px;"># ===== PBS OPTIONS =====
### Set the job name
#PBS -N MyJob
### Run in the queue named "urc"
#PBS -q urc

# ===== END PBS OPTIONS =====</pre>
<pre style="padding-left: 30px;"># Redirect standard out and standard error.
exec 1&gt;$PBS_O_WORKDIR/$PBS_JOBID.out 2&gt;$PBS_O_WORKDIR/$PBS_JOBID.err</pre>
<pre style="padding-left: 30px;"># Main Program
/bin/date</pre>
<p><strong>[2] Parallel Job &#8211; 16 Processors (Using OpenMPI)</strong></p>
<pre style="padding-left: 30px;">#! /bin/bash</pre>
<pre style="padding-left: 30px;"># ===== PBS OPTIONS =====
### Set the job name
#PBS -N MyJob
### Run in the queue named "urc"
#PBS -q urc
### Specify the number of cpus for your job.
#PBS -l nodes=16
# ===== END PBS OPTIONS =====</pre>
<pre style="padding-left: 30px;"># Redirect standard out and standard error.
exec 1&gt;$PBS_O_WORKDIR/$PBS_JOBID.out 2&gt;$PBS_O_WORKDIR/$PBS_JOBID.err</pre>
<pre style="padding-left: 30px;"># =========== Main Program ===========
# Setup to use OpenMPI
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/apps/sys/openmpi-1.2.4/rhel5_u1-64/gnu/lib
MPI_RUN=/apps/sys/openmpi-1.2.4/rhel5_u1-64/gnu/bin/mpirun</pre>
<pre style="padding-left: 30px;"># Run the program "simplempi" with an argument of "30"
$MPI_RUN --hostfile $PBS_NODEFILE /users/joe/simplempi 30</pre>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/faqs/job-scheduling-with-torque/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Power Outage</title>
		<link>http://www.urc.uncc.edu/urc/old-announcements/power-outage/</link>
		<comments>http://www.urc.uncc.edu/urc/old-announcements/power-outage/#comments</comments>
		<pubDate>Thu, 19 Mar 2009 13:30:07 +0000</pubDate>
		<dc:creator>Chuck Price</dc:creator>
				<category><![CDATA[Old Announcements]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=71</guid>
		<description><![CDATA[ChargerNet Users:
 Last night at about 6pm, there was a fire in the room that provides power to the campus server room, as a result all primary campus systems including the ChargerNet cluster lost power until about 1:30am. Obviously, all running jobs were lost.
 URC staff is currently working to bring the cluster back on [...]]]></description>
			<content:encoded><![CDATA[<p><!--StartFragment--><span style="font-family: Calibri, Verdana, Helvetica, Arial;"><span>ChargerNet Users:</p>
<p> Last night at about 6pm, there was a fire in the room that provides power to the campus server room, as a result all primary campus systems including the ChargerNet cluster lost power until about 1:30am. Obviously, all running jobs were lost.</p>
<p> URC staff is currently working to bring the cluster back on line.  Hopefully, we will be back in production this afternoon, and I will provide additional details at that time.</p>
<p> We apologize for the inconvenience.</p>
<p> Chuck</span></span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/old-announcements/power-outage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Spring Break Network Outage</title>
		<link>http://www.urc.uncc.edu/urc/old-announcements/spring-break-network-outage/</link>
		<comments>http://www.urc.uncc.edu/urc/old-announcements/spring-break-network-outage/#comments</comments>
		<pubDate>Fri, 06 Mar 2009 21:38:40 +0000</pubDate>
		<dc:creator>Chuck Price</dc:creator>
				<category><![CDATA[Old Announcements]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=66</guid>
		<description><![CDATA[ChargerNet Users:
ITS is planning a network outage for Saturday March 14th from 6am until 9am.  As a result you will be unable to reach the ChargerNet computing cluster during this time.  The good news is that we expect the cluster to remain operational so your running jobs should not be affected.  The one exception is [...]]]></description>
			<content:encoded><![CDATA[<p>ChargerNet Users:</p>
<p>ITS is planning a network outage for <strong>Saturday March 14th from 6am until 9am</strong>.  As a result you will be unable to reach the ChargerNet computing cluster during this time.  The good news is that we expect the cluster to remain operational so your running jobs should not be affected.  The one exception is jobs that use the Abaqus software package – they may fail during the outage because they will be unable to contact the license server which is provided by the College of Engineering.</p>
<p>So, if you are using Abaqus, you may want to plan your work so that your jobs finish prior to March 14th.  If not, then the only impact you will see is loss of access to the cluster for 3 hours on the morning of March 14th.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/old-announcements/spring-break-network-outage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ChargerNet Maintenance Outage</title>
		<link>http://www.urc.uncc.edu/urc/old-announcements/chargernet-maintenance-outage/</link>
		<comments>http://www.urc.uncc.edu/urc/old-announcements/chargernet-maintenance-outage/#comments</comments>
		<pubDate>Tue, 30 Sep 2008 18:53:51 +0000</pubDate>
		<dc:creator>Chuck Price</dc:creator>
				<category><![CDATA[Old Announcements]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[storage]]></category>

		<guid isPermaLink="false">http://www.urc.uncc.edu/urc/?p=60</guid>
		<description><![CDATA[We are planning a cluster outage for Monday, October 13, 2008 from 10am until noon (this is the first day of fall break).   We will use this outage to perform a few minor maintenance tasks as well as to complete some upgrades to our disk storage system that will improve both manageability and scalability.   This [...]]]></description>
			<content:encoded><![CDATA[<p>We are planning a <strong>cluster outage</strong> for <strong>Monday, October 13, 2008</strong> from <strong>10am until noon</strong> (this is the first day of fall break).   We will use this outage to perform a few minor maintenance tasks as well as to complete some upgrades to our disk storage system that will improve both manageability and scalability.   This outage will impact the entire cluster and all running jobs.  Please plan your work so that your jobs will complete prior to the start of this outage or so that they can be restarted after the cluster is back online without losing the prior work.</p>
<p>We apologize for any inconvenience that this may cause, but the need to perform certain routine maintenance tasks requires a complete outage once or twice per year.  We do not anticipate the need to schedule any additional outages until spring break.</p>
<p>Please send a response if you have any questions or comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.urc.uncc.edu/urc/old-announcements/chargernet-maintenance-outage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
