System and Infrastructure Status News

ACES Partial Maintenance, October 9-10

Published

Infrastructure News Type: Outage Partial

Affected Infrastructure: aces.tamu.access-ci.org

Start Date: October 9, 2024, 2:00 p.m.

End Date: October 10, 2024, 11:00 p.m.

There will be upcoming maintenance on some GPU components for the ACES cluster. The maintenance period is 9am on Wednesday October 9 to 6pm on Thursday October 10. Other parts of ACES will remain available. For specific details, please visit the HPRC website at https://hprc.tamu.edu/.

Posted: March 20, 2026

Bridges-2 Maintenance October 8-10, 2024

Published

Infrastructure News Type: Outage Full

Affected Infrastructure: bridges2-em.psc.access-ci.org, bridges2-gpu.psc.access-ci.org, bridges2-rm.psc.access-ci.org, bridges2-ocean.psc.access-ci.org

Start Date: October 8, 2024, 1:00 p.m.

End Date: October 9, 2024, 1:00 p.m.

Thank you for your patience, Bridges-2 has returned early from it's scheduled maintenance! Bridges-2 Team ========================================================================================================================================= Dear Bridges-2 users, Bridges-2, including all VMs and filesystems, will be unavailable due to scheduled maintenance starting on Tuesday October 8, 2024 at 8am Eastern Time. We anticipate that the system will return by Thursday October 10 at 8am Eastern Time. During this time, you will be unable to access the system. The slurm queue will be preserved and queued jobs will begin running once the machine has returned to service. This maintenance is for an operating system upgrade to all compute nodes, login nodes and data transfer nodes(DTNs). Please direct any questions to help@psc.edu (mailto:help@psc.edu) and our team will be happy to assist you. Thank you, Bridges-2 Team

Posted: March 20, 2026

ACCESS central/online services outage/news email signup now available

Published

Infrastructure News Type: Reconfiguration

Affected Infrastructure: system-news.access-ci.org

Start Date: October 7, 2024, 11:00 a.m.

End Date: October 31, 2024, 11:00 p.m.

ACCESS introduces the ability for members of the community to signup to receive emails about ACCESS central/online services outages and other news. To signup for these emails please JOIN the ACCESS System Status News affinity group (https://support.access-ci.org/affinity-groups/access-system-status-news).

Posted: March 20, 2026

Testing Broadcast of Infrastructure News

Published

Infrastructure News Type: Outage Full

Affected Infrastructure: support.access-ci.org

Start Date: October 1, 2024, 6:00 a.m.

End Date: July 13, 2024, 6:00 a.m.

This is a test.

Posted: March 20, 2026

ACCESS XDMoD Downtime

Published

Infrastructure News Type: Outage Partial

Affected Infrastructure: xdmod.access-ci.org

Start Date: September 30, 2024, 12:00 p.m.

End Date: October 3, 2024, 5:00 p.m.

Update 10/01/2024: The main web service has successfully been updated, however there was an unrelated power outage that is causing a hardware issue for one of the databases. The following services are only partially available at this time: - Efficiency tab - Job Viewer tab for the SUPReMM realm Another update will be posted when the service is fully restored in the coming days. We apologize for any inconveniences. End of Update There is a scheduled downtime for ACCESS XDMoD on Monday, September 30th from approximately 07:00 EDT until 07:00 EDT the following day. The service will be updated from XDMoD 10.5 to 11.0 during this time. The web service and API access through the Data Analytics Framework will be unavailable during the outage. If you wish to use the xdmod-data Python package with ACCESS XDMoD after the upgrade, you will need to upgrade the package to the latest 1.0.1 version using pip install --upgrade xdmod-data. An update will be posted when the service is fully restored. We apologize for any inconvenience this may cause.

Posted: March 20, 2026

Reconfiguration of registry.access-ci.org to use DynamoDB instead of LDAP

Published

Infrastructure News Type: Reconfiguration

Affected Infrastructure: registry.access-ci.org

Start Date: September 10, 2024, 1:00 p.m.

End Date: September 10, 2024, 1:05 p.m.

On September 10, 2024 at 8:00am CDT, the ACCESS User Registry at https://registry.access-ci.org will be reconfigured to use DynamoDB for user attribute storage instead of LDAP. The switch to DynamoDB affects only OAuth2/OIDC clients using the ACCESS CI "Named Configuration" which causes ACCESSID@access-ci.org to be returned as the sub claim in the id_token. DynamoDB (https://aws.amazon.com/dynamodb/) is an AWS-managed serverless NoSQL service with high performance and a 99.999% SLA. Switching to DynamoDB is a one-line configuration change which can easily be reverted if any issues arise. No downtime is expected. For any questions or concerns, please contact help@cilogon.org (mailto:help@cilogon.org) or open an ACCESS Help Ticket (https://access-ci.atlassian.net/servicedesk/customer/portal/2/create/30).

Posted: March 20, 2026

DUO Authentication Maintenance Saturday September 7 2024 05:00am EDT

Published

Infrastructure News Type: Outage Partial

Affected Infrastructure: allocations.access-ci.org, identity.access-ci.org, registry.access-ci.org, access-ci.org

Start Date: September 7, 2024, 9:00 a.m.

End Date: September 7, 2024, 3:00 p.m.

DUO Security has announced a maintenance period for Saturday September 7 starting at 05:00am EDT, for an estimated period of 6 hours, which may cause disruption to ACCESS DUO authentication attempts: Begin forwarded message: From: support-noreply@status.duosecurity.com Subject: Duo Maintenance - Multiple Deployments: Scheduled Maintenance - 7 September 2024 Date: August 30, 2024 at 4:45:20 PM EDT Multiple Deployments: Scheduled Maintenance Upcoming scheduled maintenance noticeThe Duo Site Reliability Engineering (SRE) team is scheduling regular maintenance on the following deployments, beginning with DUO70: DUO70 DUO1 DUO55 DUO62 DUO63 DUO73 DUO65 DUO79 DUO77 DUO80 DUO78 The goal of this maintenance is to better balance authentication traffic across all Duo deployments in our continued efforts to provide a high performance and resilient service. We expect minimum user impact since each specific migration window will occur during non-peak times for your organization. Do I need to take action? If your organization has strict IP filtering/firewall rules in place, you should ensure that all Duo IP ranges listed here https://help.duo.com/s/article/1337 and DNS names exist in any filtering rules if firewalls, SSL inspection, or other proxy rules whitelist communication to specific Duo IP ranges. In the event of an outage or failure, Duo’s service could automatically failover to any IP in those ranges. Otherwise, no action is required. How does this migration affect you? During the migration, we expect minimum intermittent authentication failures when users attempt to log in to Duo protected applications. Users may need to retry the authentication or potentially wait a few minutes before attempting another login. After the migration Duo Administrators with the Owner and Administrator role will have their notification settings for the Duo Status Page https://status.duo.com/ adjusted to reflect the new deployment within 24 hours. Duo Administrators with the other roles will need to update their Duo Status Page preferences manually using the steps in https://help.duo.com/s/article/2060. Resources • Documentation: How do I find my StatusPage deployment ID in the Duo Admin Panel and sign up for updates? - https://help.duo.com/s/article/2060 * Article: What are Duo's IP ranges and data residency areas by deployment? - https://help.duo.com/s/article/1337 * Duo Status Page - https://status.duo.com/ For any questions or concerns please email support@duo.com (mailto:support@duo.com). Thank you for being a Cisco Duo customer! Duo Site Reliability Engineering Start time Sep 7, 05:00 EDT Estimated duration 6 hours Components affected DUO1 - Core Authentication Service DUO1 - Admin Panel DUO1 - Push Delivery DUO1 - Phone Call Delivery DUO1 - SMS Message Delivery ...and 72 more components. View full scheduled maintenance details (https://stspg.io/dx573gm8pvdm)

Posted: March 20, 2026

idp.access-ci.org Upgrade

Published

Infrastructure News Type: Reconfiguration

Affected Infrastructure: identity.access-ci.org

Start Date: August 19, 2024, 2:00 p.m.

End Date: August 19, 2024, 2:30 p.m.

On Monday August 19, 2024 at 9:00am CDT (14:00 UTC), the Shibboleth IdP (https://shibboleth.atlassian.net/wiki/spaces/IDP5) software used by the ACCESS CI Identity Provider (https://idp.access-ci.org/idp/) will be upgraded from v4.3.3 to v5.1.2. No downtime is expected.

Posted: March 20, 2026

Bridges-2 Ocean Filesystem Issues Persist

Published

Infrastructure News Type: Degraded

Affected Infrastructure: bridges2-ocean.psc.access-ci.org

Start Date: August 9, 2024, 3:30 p.m.

End Date: August 11, 2024, 2:30 p.m.

We are aware of the ongoing Bridges-2 Filesystem issues and are working with the vendor to address them. We will let you know when all of the issues have been resolved. We thank you for your continued patience.

Posted: March 20, 2026

Expanse maintenance - 5AM (PT) 07/24/2024 to 5AM (PT) 07/25/2024

Published

Infrastructure News Type: Outage Full

Affected Infrastructure: expanse.sdsc.access-ci.org, expanse-gpu.sdsc.access-ci.org, expanse-ps.sdsc.access-ci.org

Start Date: July 24, 2024, 12:00 p.m.

End Date: July 25, 2024, 12:00 p.m.

SDSC Expanse will be under maintenance from 5AM (PT) 07/24/2024 to 5AM (PT) 07/25/2024. During this time we will be working on the direct liquid cooling (DLC) infrastructure. A reservation has been put in place to prevent jobs from running during the maintenance period. The "squeue" command output will show "ReqNodeNotAvail, Reserved for maintenance" for jobs that do not fit in the time period before the maintenance begins. These jobs will run after we release the maintenance reservation.

Posted: March 20, 2026

Bridges-2 Ocean Filesystem Issues

Published

Infrastructure News Type: Degraded

Affected Infrastructure: bridges2-ocean.psc.access-ci.org

Start Date: July 19, 2024, 11:15 p.m.

End Date: July 22, 2024, 3:00 p.m.

We are aware of some intermittent issues with the Bridges-2 Ocean Filesystem. You might notice that your jobs pause and take slightly longer to run than usual, but most should continue to complete correctly. Our staff is working to resolve the issue.

Posted: March 20, 2026

Kerberos Replica Potential Outage

Published

Infrastructure News Type: Degraded

Affected Infrastructure: kerberos.access-ci.org

Start Date: July 18, 2024, 9:12 p.m.

End Date: July 20, 2024, 10:00 a.m.

Due to a cooling issue at NCSA, it is possible that the replica KDC hosted on-site could become unresponsive. Kerberos services should still operate in the event the replica goes down but their may be a delay as things fail over to another replica server.

Posted: March 20, 2026

Upgrade registry.access-ci.org

Published

Infrastructure News Type: Degraded

Affected Infrastructure: registry.access-ci.org

Start Date: July 18, 2024, 11:00 a.m.

End Date: July 18, 2024, 11:30 a.m.

The COmanage Registry software which powers https://registry.access-ci.org/ will be upgraded from v4.1.x to v4.3.x . This process will take about 15 minutes and will invalidate existing web sessions. In particular, new user enrollments may be affected. Users should contact support@access-ci.atlassian.net (mailto:support@access-ci.atlassian.net) to report any issues that occur during user registration.

Posted: March 20, 2026

CILogon logins are failing for some users

Published

Infrastructure News Type: Outage Partial

Affected Infrastructure: identity.access-ci.org

Start Date: July 15, 2024, 3:07 p.m.

End Date: July 15, 2024, 5:28 p.m.

Web login to ACCESS portals and other services is failing for some users due to a problem in the CILogon service. Technical details: The AWS EFS mount points used by the LDAP servers became unavailable. We are in the process of removing/replacing EFS mount points and restarting the LDAP servers. The process will take several minutes.

Posted: March 20, 2026

Bridges-2 Jet Maintenance Monday July 15

Published

Infrastructure News Type: Outage Full

Affected Infrastructure: bridges2-em.psc.access-ci.org, bridges2-gpu.psc.access-ci.org, bridges2-rm.psc.access-ci.org, bridges2-ocean.psc.access-ci.org

Start Date: July 15, 2024, 1:30 p.m.

End Date: July 15, 2024, 11:00 p.m.

Bridges-2 will be unavailable on Monday, July 15 in order to upgrade the Jet filesystem. The vendor believes that this patch will fix an issue that has been causing Jet filesystem hangs and crashes over the past few weeks. Thank you for your patience. If you have any questions or problems, please submit a ticket to help@psc.edu.

Posted: March 20, 2026

Jira Service Management Incident - Some products are hard down - 3 July 2024

Published

Infrastructure News Type: Degraded

Affected Infrastructure: tickets.access-ci.org

Start Date: July 3, 2024, 11:42 p.m.

End Date: July 4, 2024, 1:43 a.m.

Between 03-07-2024 20:08 UTC to 03-07-2024 20:31 UTC, we experienced issue creation and project for Jira Service Managements. The issue has been resolved and the service is operating normally. https://jira-service-management.status.atlassian.com/incidents/ctpbgx33bd7r

Posted: March 20, 2026

Bridges-2 Continued Maintenance

Published

Infrastructure News Type: Degraded

Affected Infrastructure: bridges2-em.psc.access-ci.org, bridges2-gpu.psc.access-ci.org, bridges2-rm.psc.access-ci.org, bridges2-ocean.psc.access-ci.org

Start Date: July 2, 2024, 3:08 a.m.

End Date: July 2, 2024, 10:08 p.m.

Update: Bridges-2 has returned to normal service! The Bridges-2 Filesystem continues to experience some issues. The vendor has been notified and the PSC team continues to work to restore complete service.

Posted: March 20, 2026

Bridges-2 Maintenance Monday, July1 - Tuesday, July 2

Published

Infrastructure News Type: Outage Full

Affected Infrastructure: bridges2-em.psc.access-ci.org, bridges2-gpu.psc.access-ci.org, bridges2-rm.psc.access-ci.org, bridges2-ocean.psc.access-ci.org

Start Date: July 1, 2024, 1:00 p.m.

End Date: July 1, 2024, 9:00 p.m.

Bridges-2 has returned early from scheduled maintenance. Over the past few weeks, the Bridges-2 Ocean filesystem has encountered multiple disruptions to service. The vendor has a fix in place, but we will need to take an emergency maintenance period to implement it. As such, Bridges-2, including all VMs and filesystems, will be unavailable from 8:00AM Eastern time on Monday, July 1 through Noon Eastern time on Tuesday, July 2. We apologize for the short notice, but this is a fix that must be implemented as quickly as possible to prevent ongoing disruptions. Please contact help@psc.edu (mailto:help@psc.edu) if you have any questions or problems.

Posted: March 20, 2026

Important Update: Changes to Ticket Automation and Status Updates

Published

Infrastructure News Type: Reconfiguration

Affected Infrastructure: tickets.access-ci.org

Start Date: June 29, 2024, 6:19 p.m.

End Date: July 1, 2024, 5:00 a.m.

Due to recent licensing changes implemented by Atlassian, we have had to re-configure how automated ticket actions are performed. As a result, the status changes based on the comments added to tickets won’t be happening anymore. Additionally, some minor automatic corrections will no longer happen instantaneously but will happen eventually. These changes are being implemented over the next few days, so please bear with us if you encounter any discrepancies. We expect these changes to be completed by July 1, 2024.

Posted: March 20, 2026

Bridges-2 Degredation

Published

Infrastructure News Type: Outage Full

Affected Infrastructure: bridges2-em.psc.access-ci.org, bridges2-gpu.psc.access-ci.org, bridges2-rm.psc.access-ci.org, bridges2-ocean.psc.access-ci.org

Start Date: June 29, 2024, 4:16 p.m.

End Date: July 1, 2024, 1:00 p.m.

It appears that the Bridges-2 Ocean filesystem is trying to get a head start on it's scheduled maintenance for Monday. We are experiencing some odd failures. While some of the system remains available, you may see some degraded service this weekend. The maintenance beginning at 8AM Eastern Time on Monday should address these issues.

Posted: March 20, 2026