LQCD Homepage

LQCD Home

QDCOC Computing

QBATCH Information


Introduction

Current Status

Quick Start

File Management

Job Description

Interactive Jobs

Queues on ACC Mbds

Power Cycling

qhdwcheck wrapper
qhdwcheck errors database web front

Deleting jobs

Basic PBS commands

PBS accounting

Machine Status


Web Display (Under Construction)
(Allocation status of all available partitions)

QCDOC Status (USDOE only)
(Partitions, Jobs DB, etc.)

Batch System: Current Status
(Available Queues, Running Jobs, etc.)

Errors Database
(DB of ASIC and Wire errors.)

Accounting and Usage Statistics


QCDOC Usage (USDOE only)(Under Construction)
USDOE QCDOC Machine Usage

New Users


Computer Accounts

Accessing QCDOC

CTS accounts

CyberSecurity Training

RBRC Users Mailing List

USDOE Users Mailing List


Internal Links
(Available to QCDOC Admins Only)

QBATCH: Introduction

Introduction

    The batch system for QCDOC is basically a customized version of the Torque Resource Manager. Using the batch system to submit jobs on QCDOC machine partitions provides certain advantages over running jobs manually, such as: job queueing, job monitoring and notification, recovery from machine hungs, event logging etc.
    Some of the QBATCH features are:
  • Every QCDOC machine partition has a Batch queue associated with it. For most of the partitions, the queue name is the name of the machine partition (for a list of available queues, do a: qstat -q on the Front-End host).
  • We have defined Limited Walltime (OneHr, FourHr etc.) batch queues on some of the general usage MotherBoards on Air Cooled Crates (ACC). For RBRC users, the Limitied Walltime queues are on ACC8; for USDOE users they are on ACC7.
  • Interactive queues (I1, I2) have also been defined on ACC motherboards. All interactive queues have a 1 Hour walltime limit.
  • Every queue can only have one running job (max_running=1). All other submitted jobs will be queued and will be scheduled on a FIFO basis.
  • The latest version of the QBATCH scripts is in: $QBATCH_HOME/. $QBATCH_HOME is a user environment variable defined at login.
  • Partitions do not need to be allocated in advance to use the batch system.
Prerequisites
In order to be able to submit QCDOC Batch jobs:
  • You must be using QOS v2.6.0 or later that supports the -d (or --detach) qdaemon option. Before submitting a job you should source the appropriate setup script. Since the current QOS production version is located at /qcdoc/sfw/qos/v2.6.0/pro, the setup script you need to run is (bash):
       source /qcdoc/sfw/qos/v2.6.0/pro/scripts/setup.sh
       

One of ten national laboratories overseen and primarily funded by the Office of Science of the U.S. Department of Energy (DOE), Brookhaven National Laboratory conducts research in the physical, biomedical, and environmental sciences, as well as in energy technologies and national security. Brookhaven Lab also builds and operates major scientific facilities available to university, industry and government researchers. Brookhaven is operated and managed for DOE's Office of Science by Brookhaven Science Associates, a limited-liability company founded by Stony Brook University, the largest academic user of Laboratory facilities, and Battelle, a nonprofit, applied science and technology organization.
Privacy and Security Notice