There was no stdout, and the stderr (attached and copied below) looks normal.<div><br></div><div><div>I started the controller and engines separately this time (with qsub), using the two attached scripts. This procedure worked fine before the upgrade.</div>
<div><br></div><div>I tried recreating the sge profile using the instructions from this thread &lt;<a href="http://python.6.n6.nabble.com/Getting-setup-on-a-remote-cluster-w-Sun-Grid-Engine-td1663090.html">http://python.6.n6.nabble.com/Getting-setup-on-a-remote-cluster-w-Sun-Grid-Engine-td1663090.html</a>&gt;, and this had no effect.</div>
</div><div><br></div><div>stderr for controller</div><div><br></div><div><div>    2012-07-11 14:59:49,674.674 [IPControllerApp] Using existing profile dir: u&#39;/home/robert/.ipython/profile_sge&#39;</div><div>    2012-07-11 14:59:49.797 [IPControllerApp] Hub listening on tcp://<a href="http://0.0.0.0:52512">0.0.0.0:52512</a> for registration.</div>
<div>    2012-07-11 14:59:49.799 [IPControllerApp] Hub using DB backend: &#39;NoDB&#39;</div><div>    2012-07-11 14:59:50.079 [IPControllerApp] hub::created hub</div><div>    2012-07-11 14:59:50.085 [IPControllerApp] writing connection info to /home/robert/.ipython/profile_sge/security/ipcontroller-client.json</div>
<div>    2012-07-11 14:59:50.103 [IPControllerApp] writing connection info to /home/robert/.ipython/profile_sge/security/ipcontroller-engine.json</div><div>    2012-07-11 14:59:50.129 [IPControllerApp] task::using Python leastload Task scheduler</div>
<div>    2012-07-11 14:59:50.135 [IPControllerApp] Heartmonitor started</div><div>    2012-07-11 14:59:50.166 [scheduler] Scheduler started [leastload]</div><div>    2012-07-11 14:59:50.195 [IPControllerApp] Creating pid file: /home/robert/.ipython/profile_sge/pid/ipcontroller.pid</div>
<div><br></div><div>stderr for engines</div><div><br></div><div><div>    2012-07-11 14:59:58,438.438 [IPClusterEngines] Using existing profile dir: u&#39;/home/robert/.ipython/profile_sge&#39;</div><div>    2012-07-11 14:59:58.470 [IPClusterEngines] IPython cluster: started</div>
<div>    2012-07-11 14:59:58.471 [IPClusterEngines] Starting engines with [daemon=False]</div><div>    2012-07-11 14:59:58.471 [IPClusterEngines] Starting 5 Engines with SGEEngineSetLauncher</div><div>    2012-07-11 14:59:58.559 [IPClusterEngines] Job submitted with job id: &#39;2092&#39;</div>
<div>    2012-07-11 15:00:28.559 [IPClusterEngines] Engines appear to have started successfully</div></div><div><br></div><div>-Robert</div><br><div class="gmail_quote">On Wed, Jul 11, 2012 at 10:42 AM, MinRK <span dir="ltr">&lt;<a href="mailto:benjaminrk@gmail.com" target="_blank">benjaminrk@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">What is the stdout/err of the controller and engine jobs?<br>
<div><div class="h5"><br>
On Wed, Jul 11, 2012 at 6:03 PM, Robert Nishihara<br>
&lt;<a href="mailto:robertnishihara@gmail.com">robertnishihara@gmail.com</a>&gt; wrote:<br>
&gt; My cluster recently upgraded to IPython 0.13. Now, when I run<br>
&gt;<br>
&gt;     ipcluster start -n 3 --profile=sge<br>
&gt;<br>
&gt; the controller and engines get submitted to the queue, but the terminate<br>
&gt; immediately after starting. However, the output looks normal<br>
&gt;<br>
&gt;     2012-07-11 11:56:27,531.531 [IPClusterStart] Using existing profile dir:<br>
&gt; u&#39;/home/robert/.ipython/profile_sge&#39;<br>
&gt;     2012-07-11 11:56:27.566 [IPClusterStart] Starting ipcluster with<br>
&gt; [daemon=False]<br>
&gt;     2012-07-11 11:56:27.570 [IPClusterStart] Creating pid file:<br>
&gt; /home/robert/.ipython/profile_sge/pid/ipcluster.pid<br>
&gt;     2012-07-11 11:56:27.573 [IPClusterStart] Starting Controller with<br>
&gt; SGEControllerLauncher<br>
&gt;     2012-07-11 11:56:27.723 [IPClusterStart] Job submitted with job id:<br>
&gt; &#39;2088&#39;<br>
&gt;     2012-07-11 11:56:28.568 [IPClusterStart] Starting 3 Engines with<br>
&gt; SGEEngineSetLauncher<br>
&gt;     2012-07-11 11:56:28.645 [IPClusterStart] Job submitted with job id:<br>
&gt; &#39;2089&#39;<br>
&gt;     2012-07-11 11:56:58.647 [IPClusterStart] Engines appear to have started<br>
&gt; successfully<br>
&gt;<br>
&gt; Is there a good way to troubleshoot this? The --debug flag doesn&#39;t seem to<br>
&gt; give me any useful information.<br>
&gt;<br>
&gt; -Robert<br>
&gt;<br>
</div></div>&gt; _______________________________________________<br>
&gt; IPython-User mailing list<br>
&gt; <a href="mailto:IPython-User@scipy.org">IPython-User@scipy.org</a><br>
&gt; <a href="http://mail.scipy.org/mailman/listinfo/ipython-user" target="_blank">http://mail.scipy.org/mailman/listinfo/ipython-user</a><br>
&gt;<br>
_______________________________________________<br>
IPython-User mailing list<br>
<a href="mailto:IPython-User@scipy.org">IPython-User@scipy.org</a><br>
<a href="http://mail.scipy.org/mailman/listinfo/ipython-user" target="_blank">http://mail.scipy.org/mailman/listinfo/ipython-user</a><br>
</blockquote></div><br></div>