Difference: BoscoSDSC (19 vs. 20)

Revision 202016/04/14 - Main.JeffreyDost

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

HOWTO Install and Configure BOSCO for Glidein-Based Submission

Line: 47 to 47
 
  1. Add the BOSCO_HOST by running the bosco_cluster script with the following parameters, this will forward the passwordless bosco ssh key, and install bosco on the remote side:
     [FRONTEND_USER@FRONTEND_HOST ~]$ bosco_cluster --add BOSCO_USER@BOSCO_HOST BATCH_TYPE 
    where BATCH_TYPE = pbs, condor, etc.
  2. Run a BOSCO test job to check the connection between the FRONTEND_HOST and the BOSCO_HOST and its worker nodes.
     [FRONTEND_USER@FRONTEND_HOST ~]$ bosco_cluster --test BOSCO_USER@BOSCO_HOST 
  3. If successful, run bosco_stop on the FRONTEND_HOST.
     [FRONTEND_USER@FRONTEND_HOST ~]$ bosco_stop 
Changed:
<
<
  1. Finally, add the following elements to your frontend configuration file, frontend.xml. Note, you may add them to either the group or global credential definition. Note: All paths should be absolute, not relative.
     <credentials> <credential absfname="/path/to/grid_proxy" security_class="frontend" trust_domain="grid" type="grid_proxy"/> <credential absfname="/home/frontend/.ssh/bosco_key.rsa.pub" keyabsfname="/home/frontend/.ssh/bosco_key.rsa" pilotabsfname="/path/to/grid_proxy" security_class="frontend" trust_domain="bosco" type="key_pair"/> </credentials> 
  2. Please stop, reconfig, and restart your frontend. If successful, the FRONTEND_HOST is now properly configured.
    [root@FRONTEND_HOST ~]$ service gwms-frontend stop [root@FRONTEND_HOST ~]$ service gwms-frontend reconfig [root@FRONTEND_HOST ~]$ service gwms-frontend start 
>
>
  1. Finally, add the following elements to your frontend configuration file, frontend.xml. Note, you may add them to either the group or global credential definition. Note: All paths should be absolute, not relative.
     <credentials>
       <credential absfname="/path/to/grid_proxy" security_class="frontend" trust_domain="grid" type="grid_proxy"/>
       <credential absfname="/home/frontend/.ssh/bosco_key.rsa.pub" keyabsfname="/home/frontend/.ssh/bosco_key.rsa" pilotabsfname="/path/to/grid_proxy" security_class="frontend" trust_domain="bosco" type="key_pair"/>
    </credentials>
    
  2. Please stop, reconfig, and restart your frontend. If successful, the FRONTEND_HOST is now properly configured.
    [root@FRONTEND_HOST ~]$ service gwms-frontend stop
    [root@FRONTEND_HOST ~]$ service gwms-frontend reconfig
    [root@FRONTEND_HOST ~]$ service gwms-frontend start
    
 
  1. Next, login to the FACTORY_HOST via ssh as root.
     [user@client ~]$ ssh root@FACTORY_HOST
  2. Install condor-bosco on the FACTORY_HOST from root.
     [root@FACTORY_HOST ~]$ yum install condor-bosco 
Changed:
<
<
  1. Remove and retouch the 60-campus_factory.config file.
     [root@FACTORY_HOST ~]$ rm /etc/condor/config.d/60-campus_factory.config [root@FACTORY_HOST ~]$ touch /etc/condor/config.d/60-campus_factory.config
  2. Now, add the entry for the BOSCO_HOST to factory configuration file, glideinWMS.xml.
     <entry name="CMS_TX_US_XXXXX_BOSCO" auth_method="key_pair" enabled="True" gatekeeper="BOSCO_USER@BOSCO_HOST" gridtype="batch BATCH_TYPE" rsl="" trust_domain="bosco" verbosity="std" work_dir="~/"> <config> <max_jobs> <default_per_frontend glideins="256" held="50" idle="50"/> <per_entry glideins="256" held="50" idle="50"/> <per_frontends> </per_frontends> </max_jobs> <release max_per_cycle="20" sleep="0.2"/> <remove max_per_cycle="5" sleep="0.2"/> <restrictions require_glidein_glexec_use="False" require_voms_proxy="False"/> <submit cluster_size="10" max_per_cycle="100" sleep="0.2" slots_layout="fixed"> <submit_attrs> </submit_attrs> </submit> </config> <allow_frontends></allow_frontends> <attrs> <attr name="CONDOR_VERSION" const="False" glidein_publish="False" job_publish="False" parameter="True" publish="True" type="string" value="default"/> <attr name="GLEXEC_JOB" const="True" glidein_publish="False" job_publish="False" parameter="True" publish="False" type="string" value="False"/> <attr name="GLIDEIN_CMSSite" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="TX_US_XXXXX"/> <attr name="GLIDEIN_CPUS" const="True" glidein_publish="False" job_publish="True" parameter="True" publish="True" type="string" value="8"/> <attr name="GLIDEIN_Country" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="US"/> <attr name="GLIDEIN_Glexec_Use" comment="This has been REQUIRED for historical reasons, OPTIONAL/NONE alt values" const="False" glidein_publish="True" job_publish="False" parameter="True" publish="True" type="string" value="NONE"/> <attr name="GLIDEIN_MaxMemMBs" const="True" glidein_publish="True" job_publish="False" parameter="True" publish="True" type="int" value="49152"/> <attr name="GLIDEIN_Max_Walltime" const="True" glidein_publish="False" job_publish="False" parameter="True" publish="True" type="int" value="171000"/> <attr name="GLIDEIN_ResourceName" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="TX_US_XXXXX"/> <attr name="GLIDEIN_Site" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="TX_US_XXXXX"/> <attr name="GLIDEIN_Supported_VOs" const="True" glidein_publish="False" job_publish="False" parameter="True" publish="True" type="string" value="CMS,MIS"/> <attr name="USE_CCB" const="True" glidein_publish="True" job_publish="False" parameter="True" publish="True" type="string" value="True"/> <attr name="X509_CERT_DIR" const="True" glidein_publish="False" job_publish="True" parameter="True" publish="True" type="string" value="/cvmfs/oasis.opensciencegrid.org/mis/certificates"/> </attrs> <files> </files> <infosys_refs> </infosys_refs> <monitorgroups> </monitorgroups> </entry> 
  3. Finally, build up a global ssh fingerprint list so that the FACTORY_HOST trusts the keys of both the BOSCO_HOST and the FRONTEND_HOST.
    [root@FACTORY_HOST ~]$ ssh-keyscan -t rsa,dsa BOSCO_HOST >> /etc/ssh/ssh_known_hosts [root@FACTORY_HOST ~]$ ssh-keyscan -t rsa,dsa FRONTENT_HOST >> /etc/ssh/ssh_known_hosts 
  4. Stop, reconfigure and restart your factory. If successful, the FACTORY_HOST is now properly configured. You may now submit user jobs to the BOSCO_HOST via the FRONTEND_HOST.
    [root@FRONTEND_HOST ~]$ service gwms-factory stop [root@FRONTEND_HOST ~]$ service gwms-factory reconfig [root@FRONTEND_HOST ~]$ service gwms-factory start 
>
>
  1. Remove and retouch the 60-campus_factory.config file.
    [root@FACTORY_HOST ~]$ rm /etc/condor/config.d/60-campus_factory.config
    [root@FACTORY_HOST ~]$ touch /etc/condor/config.d/60-campus_factory.config
  2. Now, add the entry for the BOSCO_HOST to factory configuration file, glideinWMS.xml.
    <entry name="CMS_TX_US_XXXXX_BOSCO" auth_method="key_pair" enabled="True" gatekeeper="BOSCO_USER@BOSCO_HOST" gridtype="batch BATCH_TYPE" rsl="" trust_domain="bosco" verbosity="std" work_dir="~/">
       <config>
          <max_jobs>
             <default_per_frontend glideins="256" held="50" idle="50"/>
             <per_entry glideins="256" held="50" idle="50"/>
             <per_frontends>
             </per_frontends>
          </max_jobs>
          <release max_per_cycle="20" sleep="0.2"/>
          <remove max_per_cycle="5" sleep="0.2"/>
          <restrictions require_glidein_glexec_use="False" require_voms_proxy="False"/>
          <submit cluster_size="10" max_per_cycle="100" sleep="0.2" slots_layout="fixed">
             <submit_attrs>
             </submit_attrs>
          </submit>
       </config>
       <allow_frontends>
       </allow_frontends>
       <attrs>
          <attr name="CONDOR_VERSION" const="False" glidein_publish="False" job_publish="False" parameter="True" publish="True" type="string" value="default"/> <attr name="GLEXEC_JOB" const="True" glidein_publish="False" job_publish="False" parameter="True" publish="False" type="string" value="False"/>
          <attr name="GLIDEIN_CMSSite" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="TX_US_XXXXX"/>
          <attr name="GLIDEIN_CPUS" const="True" glidein_publish="False" job_publish="True" parameter="True" publish="True" type="string" value="8"/>
          <attr name="GLIDEIN_Country" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="US"/>
          <attr name="GLIDEIN_Glexec_Use" comment="This has been REQUIRED for historical reasons, OPTIONAL/NONE alt values" const="False" glidein_publish="True" job_publish="False" parameter="True" publish="True" type="string" value="NONE"/>
          <attr name="GLIDEIN_MaxMemMBs" const="True" glidein_publish="True" job_publish="False" parameter="True" publish="True" type="int" value="49152"/>
          <attr name="GLIDEIN_Max_Walltime" const="True" glidein_publish="False" job_publish="False" parameter="True" publish="True" type="int" value="171000"/>
          <attr name="GLIDEIN_ResourceName" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="TX_US_XXXXX"/>
          <attr name="GLIDEIN_Site" const="True" glidein_publish="True" job_publish="True" parameter="True" publish="True" type="string" value="TX_US_XXXXX"/>
          <attr name="GLIDEIN_Supported_VOs" const="True" glidein_publish="False" job_publish="False" parameter="True" publish="True" type="string" value="CMS,MIS"/>
          <attr name="USE_CCB" const="True" glidein_publish="True" job_publish="False" parameter="True" publish="True" type="string" value="True"/> <attr name="X509_CERT_DIR" const="True" glidein_publish="False" job_publish="True" parameter="True" publish="True" type="string" value="/cvmfs/oasis.opensciencegrid.org/mis/certificates"/>
       </attrs>
       <files>
       </files>
       <infosys_refs>
       </infosys_refs>
       <monitorgroups>
       </monitorgroups>
    </entry>
    
  3. Finally, build up a global ssh fingerprint list so that the FACTORY_HOST trusts the keys of both the BOSCO_HOST and the FRONTEND_HOST.
    [root@FACTORY_HOST ~]$ ssh-keyscan -t rsa,dsa BOSCO_HOST >> /etc/ssh/ssh_known_hosts
    [root@FACTORY_HOST ~]$ ssh-keyscan -t rsa,dsa FRONTEND_HOST >> /etc/ssh/ssh_known_hosts
    
  4. Stop, reconfigure and restart your factory. If successful, the FACTORY_HOST is now properly configured. You may now submit user jobs to the BOSCO_HOST via the FRONTEND_HOST.
    [root@FRONTEND_HOST ~]$ service gwms-factory stop
    [root@FRONTEND_HOST ~]$ service gwms-factory reconfig
    [root@FRONTEND_HOST ~]$ service gwms-factory start 
    
 

Troubleshooting

 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback