Nodes

  • pnfs-2.t2.ucsd.edu used for gfactory
  • gftp-6b.t2.ucsd.edu used for job submission
cd ~fkw/CMS/CMSSW_1_5_4
source source_me
cd ../submitDir

Email about restarting gfactory

If things still do not work, you need to login to
gfactory@pnfs-2

kill the glidein factory
('killall python' will work)

and create a new factory.

You can go to 
glideinWMS/creation
and modify
glideinWMS.xml

Look for "Expose_Grid_environment" (or similar) and put all values to True...
I thought I set it right, but there must be a bug somewhere :(

The change the name (the one there is v3.1, make it v3.2 or v4)
and create the new factory
./create_glidein glideinWMS.xml

at this point, go to $HOME
cd $HOME

modify start_factory.sh to point to the correct new one
(replace v3.1 with whatever you called it)
and you can now restart it
./start_factory.sh

Restarting the submission machine

I managed to crash condor on gftp-6b by running out of disk space. After cleaning up all the disk space, I logged in as root and did a /etc/init.d/condor restart Checked what this actually does. It appears to start up the 9 schedd's.

The gfactory web monitoring

http://pnfs-2.t2.ucsd.edu:8319/glidefactory/monitor/glidein_v3.4/

The most useful part is in the total directory.

condor_history -name schedd_jobs7@ -completedsince "2007-11-12 06:14"

Setting priority factors in condor

condor_userprio -all
condor_userprio -setprio group_cms.uscms1586@osg-gw-4.t2.ucsd.edu 0.5
condor_userprio -setfactor group_cms.uscms1586@osg-gw-4.t2.ucsd.edu 1
condor_userprio -all

Refreshing the proxy for gfactory

All you need to do is:
On gftp-6b:
----------
voms-proxy-init -voms cms -valid 500:0.0
 This creates /tmp/x509up_u502
the copy that over to pnfs-2.

On pnfs-2:
----------
copy the file into ~gfactory/.globus/x509_service_proxy

Then be patient for about 30min or an hour or so. Eventually, the system starts back up.
If you kill and restart, it might go a little faster. However, Igor claims you need not kill and restart the gfactory.
-- FkW - 08 Nov 2007
Topic revision: r5 - 2007/11/21 - 00:27:36 - FkW
 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback