BNL Logo

Scientific Computing and Data Facility

Recent Announcements

Click on an announcement title below to expand and view more information.

06/09/2026

IT Services

Group Responsible: IT Services

Affected Area: OpenShift Virtual Enviroment

Maintenance Type: transparent

The OpenShift cluster will be undergoing planned maintenance tomorrow to upgrade cluster version. This will begin on Tuesday 6/9/26 @ 8AM. This should be "transparent" to all. If you see any issues or problem, please let notify us.


Submitted by: Joe Frith <jfrith@bnl.gov>

06/04/2026

IT Services

Group Responsible: IT Services

Affected Area: GPFS02 File System

Maintenance Type: scheduled downtime

Due to a hardware upgrade on the Phenix GPFS Cluster (GPFS02), we will be scheduling down time for the cluster for Thursday 6/04 between 10AM ~ 12PM. This will bring all storage for the GPSF02 cluster offline.


Submitted by: Joe Frith <jfrith@bnl.gov>

06/03/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: Roundcube webmail

Maintenance Type: service interruption

Service has been restored to Roundcube webmail.


Submitted by: Christian Lepore <clepore@bnl.gov>

06/03/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: SCDF Webmail

Maintenance Type: service interruption

We are currently experiencing an interruption in connection to https://webmail.rhic.bnl.gov/ , we are looking into the issue and hope to have resolved shortly


Submitted by: Chris Lepore <clepore@bnl.gov>

06/03/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: Roundcube webmail

Maintenance Type: service interruption

Roundcube webmail is currently experiencing issues due to a previously planned transition to new proxy servers. We are working on the issue and will issue updates as needed. We apologize for the delay.


Submitted by: Christian Lepore <clepore@bnl.gov>

05/28/2026

IT Services

Group Responsible: IT Services

Affected Area: OpenShift Virtual Enviroment

Maintenance Type: transparent

The OpenShift cluster will be undergoing planned upgrade/maintenance tomorrow. This will begin on 5/28 @ 8AM. This should be "transparent" to all. If you see any issues or problem, please let notify us.


Submitted by: Joe Frith <jfrith@bnl.gov>

05/21/2026

IT Services

Group Responsible: IT Services

Affected Area: Phenix GPFS (GPFS02)

Maintenance Type: scheduled downtime

Due to a hardware issue on the Phenix gpfs Cluster (GPFS02), we will be scheduling down time for the cluster for Thursday 5/21 @10AM ~ 12PM. This will bring all storage for the GPSF02 cluster offline.


Submitted by: Joe Frith <jfrith@bnl.gov>

05/14/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: SCDF Mattermost

Maintenance Type: service interruption

SCDF Mattermost will be down for a short reboot to apply patches at approximately 23:00 EST. Users may experience a brief interruption while the server reboots but should be able to resume normal usage following this update.


Submitted by: Louis Pelosi <lpelosi@bnl.gov>

05/14/2026

IT Services

Group Responsible: IT Services

Affected Area: OpenShift Virtual Enviroment

Maintenance Type: transparent

The OpenShift cluster will be undergoing planned maintenance today to upgrade cluster version from 4.18.26 to 4.18.34. This was begun on 5/12 with some issues, they have been resolved and the upgrade will now be completed. This should be "transparent" to all. If you see any issues or problem, please let notify us.


Submitted by: Joe Frith <jfrith@bnl.gov>

05/13/2026

IT Fabric

Group Responsible: IT Fabric

Affected Area: Openshift cluster - all VM networking and container workloads.

Maintenance Type: transparent

The Openshift cluster will be undergoing planned maintenance to upgrade cluster version from 4.18.26 to 4.18.34. This **should** be transparent according to Redhat. They claim they fixed the issue that caused all VM secondary networks to drop during the master node upgrade portion during last upgrade. If all goes well, early Thursday morning we will upgrade to 4.19.30 but another notification will be sent on Wednesday.


Submitted by: Robert Hancock <hancock@bnl.gov>

05/04/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: The NX service will not be available during this time

Maintenance Type: service interruption

The NX servers will be rebooted to remediate a significant linux vulnerability. All NX sessions will be terminated, please save your work.


Submitted by: Saroj Kandasamy <saroj@bnl.gov>

05/01/2026

IT Fabric

Group Responsible: IT Fabric

Affected Area: sphnx and spool condor pools

Maintenance Type: service interruption

sphnx and spool condor pool worker nodes are being drained and rebooted to remediate a critical linux vulnerability. sphnx worker nodes will be fully drained en mass, running jobs will be allowed to run to completion (unless killed by their owner), but no new jobs will be able to run until some nodes fully drain, reboot, and start accepting new jobs again. spool worker nodes will get a rolling partial drain + reboot to try to best balance remediation speed and minimization of disruption. Some jobs will be killed*, but unless they include their own hold expression specifying otherwise, they'll get automatically restarted on another node. * full drain with no jobs killed would take way too long for a rolling remediation. When a node is remediated, jobs running for <2hrs will be killed, as will remaining jobs once a node reaches a 90% drain level (90% of it's cpu's unallocated), but as stated, killed jobs should automatically be restarted. The above plans were quickly developed with the largest stakeholders. We recognize this may be problematic for some spool experiments/users, my apologies. Thank you for your patience and understanding.


Submitted by: Matt Cowan <cowan@bnl.gov>

05/01/2026

IT Services

Group Responsible: IT Services

Affected Area: SSH logins for SCDF Staff will be interrupted.

Maintenance Type: service interruption

Due to the Linux kernel issue explained in CVE-2026-31431, we will be scheduling a reboot of admingw01&02 and staffgw01&02. This will cause connections to drop during the reboot process. Interruption time should be minimal.


Submitted by: Joe Frith <jfrith@bnl.gov>

04/30/2026

IT Fabric

Group Responsible: IT Fabric

Affected Area: SCDF managed interactive/submit nodes

Maintenance Type: service interruption

To remediate a significant linux vulnerability, all SCDF managed interactive submit nodes will be rebooted shortly with a ~10min warning mesg. Submitted condor jobs will continue running and reconnect with the submit node after the reboot. For worker nodes, we're working on a solution to balance prompt response with minimizing disruption of running jobs as much as feasible. Interactive logins and processes will be terminated when the int/sub nodes reboot, once the nodes are back up, you should be able to continue your work. Thank you for your patience and understanding.


Submitted by: Matt Cowan <cowan@bnl.gov>

04/02/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: BNLBox

Maintenance Type: scheduled downtime

BNLBox service will be down for upto 1 hour due to its maintenance.


Submitted by: Hironori Ito <hito@bnl.gov>

04/01/2026

IT Services

Group Responsible: IT Services

Affected Area: RT

Maintenance Type: service interruption

The RT VM will be migrated off the RHEV platform. The RT web site will be unavailable during the migration but emails will be queued and delivered once the migration is complete. This will begin at 8:30am on Wednesday April 1st and is expected to take 3-4 hours. An update will be sent when the migration is complete.


Submitted by: Mark Berry <mberry@bnl.gov>

03/25/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: BNLBox

Maintenance Type: transparent

The scheduled BNLBox upgrade for today has been canceled. A new announcement will be issued once downtime has been rescheduled.


Submitted by: Louis Pelosi <lpelosi@bnl.gov>

03/25/2026

Service & Tools

Group Responsible: Service & Tools

Affected Area: BNLBox

Maintenance Type: scheduled downtime

BNLBox will be unavailable on 03/25/2026 starting at 08:15 AM EST for approximately one hour to perform a scheduled upgrade. During this time, users will experience a temporary disruption in service. We apologize for the inconvenience and appreciate your patience.


Submitted by: Louis Pelosi <lpelosi@bnl.gov>

03/02/2026

IT Services

Group Responsible: IT Services

Affected Area: Atlas GPFS

Maintenance Type: transparent

We will be upgrading the Atlas GPFS servers (atlasgpfs01) on Monday 3/2/2026 beginning @10:00 AM. This should be a transparent upgrade to the RHEL and GPFS software. If there are any issues, please open an RT ticket for these to be addressed.


Submitted by: Joe Frith <jfrith@bnl.gov>

02/19/2026

IT Services

Group Responsible: IT Services

Affected Area: HPC GPFS & RHEL 8

Maintenance Type: transparent

We will be upgrading the HPC/IC2 GPFS servers (hpcgpfs01) beginning @ 9:30AM. This should be a transparent upgrade to the RHEL and GPFS software. If there are any issues, please open an RT ticket for these to be addressed.


Submitted by: Joe Frith <jfrith@bnl.gov>