BCDR Policy

Business Continuity and Disaster Recovery (BCDR) Policy

AES ETL Control Panel for Microsoft Fabric

Publisher: Assurance eServices
Workload Name: AssuranceEServices.AESETLPanel
Document Version: 1.1
Effective Date: April 28, 2026
Last Updated: April 28, 2026
Next Review Date: October 28, 2026


Table of Contents

  1. Executive Summary
  2. Policy Overview
  3. Scope and Objectives
  4. Recovery Objectives
  5. Architecture and Components
  6. Business Continuity Strategy
  7. Disaster Recovery Strategy
  8. Backup and Data Protection
  9. Incident Response Procedures
  10. Testing and Maintenance
  11. Roles and Responsibilities
  12. Communication Plan
  13. Compliance and Audit

Executive Summary

This Business Continuity and Disaster Recovery (BCDR) policy outlines Assurance eServices’ commitment to maintaining the availability, integrity, and security of the AES ETL Control Panel workload for Microsoft Fabric. This policy ensures minimal disruption to customer operations during unexpected events, natural disasters, or system failures.

Key Highlights:

  • RTO (Recovery Time Objective): 4 hours
  • RPO (Recovery Point Objective): 15 minutes
  • Service Availability SLA: 99.9%
  • Global Distribution: Automatic failover across Azure regions
  • Customer Data Protection: All pipeline configurations and business data stored in customer-controlled Microsoft Fabric OneLake
  • Minimal AES Data: Only 7 operational identifiers stored in AES secure databases (AES-256 encrypted)

Data Collection and Storage Transparency

Customer-Controlled Data (Your OneLake):

  • ✅ All pipeline configurations and job definitions
  • ✅ All business data processed by pipelines
  • ✅ Execution logs (if enabled by customer)
  • ✅ User preferences and settings
  • ✅ Complete control over retention, backup, and recovery

AES Operational Data (AES Secure Databases):

AES collects and stores ONLY 7 operational identifiers for basic service operation:

Data PointPurposeStorage
tenantIdIdentify your Fabric tenantAES Database (AES-256)
workspaceIdIdentify workspaceAES Database (AES-256)
graphqlitemIdIdentify pipeline itemAES Database (AES-256)
JobCountTrack job countAES Database (AES-256)
JobStepCountTrack job step countAES Database (AES-256)
LastJobAuditIDTrack last audit IDAES Database (AES-256)
LastJobStepAuditIDTrack last step audit IDAES Database (AES-256)

What AES Does NOT Store:

  • ❌ User identities, names, emails, or personal information
  • ❌ Execution logs or detailed diagnostic data
  • ❌ Performance metrics or telemetry
  • ❌ Pipeline configurations or business logic
  • ❌ Authentication credentials or tokens

See our complete Privacy Policy for details.


Policy Overview

Purpose

The purpose of this BCDR policy is to:

  1. Ensure continuous availability of the AES ETL Control Panel workload
  2. Minimize service disruption during unforeseen events
  3. Protect customer data and maintain data integrity
  4. Define recovery procedures and responsibilities
  5. Establish testing and maintenance schedules
  6. Comply with industry standards and regulatory requirements

Commitment

Assurance eServices is committed to:

  • Maintaining a robust and tested disaster recovery plan
  • Regular review and updates of BCDR procedures
  • Transparent communication with customers during incidents
  • Continuous improvement of resilience and recovery capabilities
  • Meeting or exceeding committed SLAs

Scope and Objectives

Scope

This BCDR policy covers:

  • Frontend Application: Azure Static Web Apps hosting infrastructure
  • Authentication Services: Microsoft Entra ID integration
  • Customer Data: Pipeline configurations, business data, and execution logs (if enabled) stored in customer’s OneLake
  • AES Operational Data: 7 operational identifiers stored in AES secure databases
    • tenantId, workspaceId, graphqlitemId, JobCount, JobStepCount, LastJobAuditID, LastJobStepAuditID
  • Execution Services: Fabric Spark compute within customer capacity
  • Support Infrastructure: Documentation, support portals, and communication channels

Out of Scope:

  • Customer’s Microsoft Fabric capacity (managed by Microsoft)
  • Customer’s OneLake storage (managed by Microsoft)
  • Customer’s network infrastructure
  • Third-party services not controlled by Assurance eServices

Objectives

Primary Objectives:

  1. Availability: Maintain 99.9% uptime for the workload frontend
  2. Data Integrity: Ensure zero data loss for customer configurations
  3. Recovery Speed: Restore full service within 4 hours of incident
  4. Customer Communication: Notify customers within 30 minutes of service disruption
  5. Continuous Operation: Enable customers to continue operations during regional outages

Recovery Objectives

Service Level Commitments

MetricTargetMeasurement
RTO (Recovery Time Objective)4 hoursTime to restore full service functionality
RPO (Recovery Point Objective)15 minutesMaximum acceptable data loss window
Service Availability99.9%Monthly uptime percentage
Incident Response Time30 minutesTime to acknowledge critical incidents
Customer Notification30 minutesTime to notify customers of service disruption
Critical Bug Fix48 hoursTime to deploy critical security patches

Priority Classification

Priority 1 (Critical):

  • Complete service outage affecting all customers
  • Security breach or data exposure
  • Authentication system failure
  • RTO: 4 hours | RPO: 15 minutes

Priority 2 (High):

  • Partial service degradation affecting multiple customers
  • Performance issues impacting user experience
  • Non-critical feature failures
  • RTO: 8 hours | RPO: 1 hour

Priority 3 (Medium):

  • Single customer impact
  • Minor feature issues
  • Documentation or UI inconsistencies
  • RTO: 24 hours | RPO: 4 hours

Priority 4 (Low):

  • Cosmetic issues
  • Enhancement requests
  • Non-urgent documentation updates
  • RTO: 5 business days | RPO: N/A

Architecture and Components

System Architecture

┌─────────────────────────────────────────────────────────────┐
│                    Customer's Fabric Tenant                  │
│  ┌────────────────────────────────────────────────────────┐ │
│  │              Microsoft Fabric Platform                  │ │
│  │  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐ │ │
│  │  │   OneLake    │  │ Spark Compute│  │  Monitoring  │ │ │
│  │  │   Storage    │  │    Engine    │  │     Hub      │ │ │
│  │  └──────────────┘  └──────────────┘  └──────────────┘ │ │
│  └────────────────────────────────────────────────────────┘ │
│                            ▲                                 │
│                            │                                 │
└────────────────────────────┼─────────────────────────────────┘
                             │
                   ┌─────────┴──────────┐
                   │   HTTPS / API      │
                   └─────────┬──────────┘
                             │
┌────────────────────────────▼─────────────────────────────────┐
│              AES ETL Control Panel Workload                   │
│  ┌────────────────────────────────────────────────────────┐  │
│  │         Azure Static Web Apps (Global)                  │  │
│  │  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐  │  │
│  │  │   Primary    │  │  Secondary   │  │    Backup    │  │  │
│  │  │   Region     │  │   Region     │  │   Region     │  │  │
│  │  │  (West US 2) │  │  (East US 2) │  │ (West Europe)│  │  │
│  │  └──────────────┘  └──────────────┘  └──────────────┘  │  │
│  └────────────────────────────────────────────────────────┘  │
└───────────────────────────────────────────────────────────────┘
                             │
                   ┌─────────┴──────────┐
                   │  Microsoft Entra   │
                   │   Authentication   │
                   └────────────────────┘

Component Resilience

ComponentRedundancyFailoverRecovery
Frontend ApplicationMulti-region deploymentAutomatic< 5 minutes
Authentication (Entra)Microsoft-managed HAAutomatic< 1 minute
Customer Data (OneLake)Microsoft-managed HAAutomaticN/A (Always available)
AES Operational Data (7 identifiers)Multi-region backupsAES-managed restore< 30 minutes
Spark ExecutionCustomer capacityN/ACustomer-controlled
Static Content (CDN)Global edge distributionAutomatic< 1 minute

Business Continuity Strategy

Proactive Measures

1. High Availability Architecture

  • Multi-region deployment across 3 Azure regions
  • Azure Static Web Apps with automatic load balancing
  • Global CDN distribution for static assets
  • Health monitoring with automatic failover

2. Monitoring and Alerting

  • Real-time health checks every 60 seconds
  • Automated alerting for service degradation
  • Performance monitoring and threshold alerts
  • Customer-facing status page updates

3. Redundancy and Fault Tolerance

  • No single point of failure in critical path
  • Stateless frontend architecture for horizontal scaling
  • Data stored in highly available OneLake storage
  • Graceful degradation for non-critical features

Operational Resilience

Dependency Management:

  • Microsoft Fabric Platform: Customer’s Fabric capacity managed by Microsoft (99.9% SLA)
  • Azure Static Web Apps: Multi-region hosting with automatic failover
  • Microsoft Entra ID: Enterprise authentication with Microsoft’s SLA
  • Azure CDN: Global content delivery with edge caching
  • OneLake Storage: Customer’s pipeline data with Microsoft’s durability guarantees
  • AES Databases: 7 operational identifiers with automated encrypted backups

Capacity Planning:

  • Automated scaling based on usage patterns
  • Regular capacity reviews and load testing
  • Performance benchmarking and optimization
  • Resource allocation monitoring

Disaster Recovery Strategy

Disaster Scenarios

Scenario 1: Regional Azure Outage

  • Impact: Primary region unavailable
  • Detection: Health checks fail for 2 consecutive minutes
  • Response: Automatic DNS failover to secondary region
  • Recovery Time: 5-10 minutes
  • Customer Impact: Minimal, transparent failover

Scenario 2: Complete Azure Service Disruption

  • Impact: All Azure regions affected
  • Detection: Cross-region health check failure
  • Response: Activate tertiary backup region (West Europe)
  • Recovery Time: 30 minutes to 2 hours
  • Customer Impact: Service unavailable during failover

Scenario 3: Customer Data Loss (OneLake)

  • Impact: Customer’s OneLake storage affected
  • Detection: Customer report or OneLake API errors
  • Response: Customer utilizes OneLake versioning and snapshots (Microsoft Fabric-managed)
  • Recovery Time: Customer-initiated, typically 15 minutes to 1 hour
  • Customer Impact: Restore from last checkpoint (OneLake versioning)
  • AES Role: Advisory support only; customer controls OneLake data recovery

Scenario 3b: AES Operational Data Loss

  • Impact: AES operational identifiers (7 data points) corrupted or lost
  • Detection: Service health monitoring alerts
  • Response: Restore from AES encrypted backups
  • Recovery Time: 15-30 minutes
  • Customer Impact: Minimal; identifiers used for basic tracking only

Scenario 4: Security Incident

  • Impact: Potential unauthorized access
  • Detection: Security monitoring alerts
  • Response: Immediate service isolation and investigation
  • Recovery Time: 4-24 hours depending on severity
  • Customer Impact: Temporary service suspension

Scenario 5: Application Bug or Bad Deployment

  • Impact: Service degradation or outage
  • Detection: Automated testing or customer reports
  • Response: Rollback to previous stable version
  • Recovery Time: 15-30 minutes
  • Customer Impact: Brief service interruption

Recovery Procedures

Automatic Recovery (No Manual Intervention):

  1. Regional failover via Azure Traffic Manager
  2. CDN cache refresh and distribution
  3. Health check validation of new region
  4. Automatic service restoration

Manual Recovery (Requires Intervention):

  1. Detection: On-call engineer alerted via PagerDuty
  2. Assessment: Evaluate scope and impact (15 min)
  3. Communication: Update status page and notify customers (15 min)
  4. Execution: Execute recovery runbook (2-3 hours)
  5. Validation: Verify service restoration and data integrity (30 min)
  6. Notification: Confirm resolution to customers (15 min)

Data Recovery

Customer-Controlled Data Recovery (Your OneLake):

Pipeline Configurations:

  • Source: Fabric item storage in customer’s OneLake
  • Backup Method: OneLake automatic versioning (Microsoft-managed)
  • Retention: Customer-controlled via OneLake policies
  • Recovery Method: Customer restores via Fabric UI or API
  • Recovery Time: Customer-initiated, typically < 5 minutes

Execution Logs (Optional, Customer-Enabled):

  • Source: OneLake Lakehouse tables (if enabled by customer)
  • Backup Method: Delta table versioning in customer’s OneLake
  • Retention: Customer-configurable (default 90 days)
  • Recovery Method: Time-travel queries on Delta tables (customer-controlled)
  • Recovery Time: Immediate (no recovery needed)
  • Note: AES does NOT store execution logs

Business Data:

  • Source: Customer’s OneLake storage
  • Backup Method: OneLake versioning and snapshots (Microsoft-managed)
  • Retention: Customer-controlled
  • Recovery Method: Customer-managed via Fabric platform
  • Recovery Time: Customer-controlled

AES Operational Data Recovery (AES Databases):

7 Operational Identifiers:

  • Source: AES secure databases
  • Data Points: tenantId, workspaceId, graphqlitemId, JobCount, JobStepCount, LastJobAuditID, LastJobStepAuditID
  • Backup Method: Automated daily backups with multi-region redundancy
  • Encryption: AES-256 encrypted backups
  • Retention: Duration of service + 90-day grace period after termination
  • Recovery Method: Automated restore from encrypted backups
  • Recovery Time: 15-30 minutes
  • Purpose: Basic service operation tracking only

Backup and Data Protection

Data Classification

Customer-Controlled Data (Your OneLake):

Data TypeStorage LocationBackup MethodRetentionCriticality
Pipeline ConfigurationsCustomer’s OneLakeOneLake versioning (Microsoft-managed)Customer-controlledCritical
Business DataCustomer’s OneLakeOneLake versioning (Microsoft-managed)Customer-controlledCritical
Execution Logs (Optional)Customer’s OneLake LakehouseDelta table historyCustomer-configurableMedium
User PreferencesCustomer’s OneLake item propertiesOneLake versioningCustomer-controlledLow

AES-Managed Data:

Data TypeStorage LocationBackup MethodRetentionCriticality
Operational Identifiers (7)AES secure databasesAutomated daily backups (AES-256)Active + 90 days post-terminationMedium
Application CodeGitHub repositoryGit version controlIndefiniteCritical
Static AssetsAzure Static Web AppsDeployment history30 daysLow

Backup Strategy

Customer Data Backups (Customer’s OneLake):

  • Responsibility: Microsoft Fabric platform (customer-controlled policies)
  • Mechanism: OneLake automatic versioning and snapshots
  • Frequency: Real-time with every save
  • Retention: Customer-configurable via OneLake policies
  • Recovery: Customer restores via Fabric UI, API, or OneLake tools
  • Encryption: AES-256 (Microsoft-managed)
  • Data Includes: Pipeline configurations, job definitions, business data, execution logs (if enabled)

AES Operational Data Backups (AES Databases):

  • Responsibility: Assurance eServices
  • Data: 7 operational identifiers (tenantId, workspaceId, graphqlitemId, JobCount, JobStepCount, LastJobAuditID, LastJobStepAuditID)
  • Mechanism: Automated database backups with multi-region redundancy
  • Frequency: Daily automated backups
  • Retention: Duration of active service + 90-day grace period post-termination
  • Recovery: AES-managed automated restore
  • Encryption: AES-256 encrypted backups
  • Purpose: Service restoration and basic operational tracking

Application Code:

  • Repository: GitHub with branch protection
  • Backup Frequency: Real-time commits
  • Retention: Indefinite
  • Recovery: Git clone and redeploy

Data Protection Measures

Customer Data Protection (Your OneLake):

  • Encryption at Rest: AES-256 for all OneLake data (Microsoft-managed)
  • Encryption in Transit: TLS 1.2+ for all API communication
  • Key Management: Microsoft-managed encryption keys
  • Access Control: Fabric RBAC (Role-Based Access Control)
  • Authentication: Microsoft Entra ID with MFA support
  • Audit Logging: Optional customer-controlled logging to OneLake
  • Data Residency: Remains in customer’s selected Fabric capacity region
  • No Data Transfer: Zero customer data transfer to AES or third parties

AES Operational Data Protection (7 Identifiers):

  • Encryption at Rest: AES-256 for database storage
  • Encryption in Transit: TLS 1.2+ for all communication
  • Key Management: AES-managed encryption keys
  • Access Control: Strict internal access controls (least privilege)
  • Audit Logging: All access to operational data logged
  • Data Minimization: Only 7 identifiers collected (no personal data)
  • Retention: Active service + 90-day grace period
  • Deletion: Automatic deletion after grace period, or immediate upon request

Separation of Concerns:

  • Customer pipeline data and business information: 100% customer-controlled in OneLake
  • AES operational identifiers: Minimal tracking data in AES secure databases
  • No overlap: AES cannot access customer’s OneLake data
  • Customer cannot access AES operational databases

Incident Response Procedures

Incident Classification

Critical (P1):

  • Complete service outage
  • Security breach or data exposure
  • Response: Immediate escalation, 24/7 response

High (P2):

  • Partial service degradation
  • Performance issues affecting multiple customers
  • Response: Within 1 hour, business hours priority

Medium (P3):

  • Single customer impact
  • Non-critical feature failures
  • Response: Within 4 hours, business hours

Low (P4):

  • Cosmetic issues
  • Enhancement requests
  • Response: Next business day

Incident Response Workflow

Phase 1: Detection (0-5 minutes)

  1. Automated monitoring alerts triggered
  2. PagerDuty notification to on-call engineer
  3. Initial triage and classification

Phase 2: Assessment (5-20 minutes)

  1. Verify incident scope and customer impact
  2. Classify incident priority (P1-P4)
  3. Assemble response team if needed
  4. Begin incident log documentation

Phase 3: Communication (20-30 minutes)

  1. Update status page: https://status.assuranceeservices.com
  2. Send email notifications to affected customers
  3. Post updates to support portal
  4. Establish communication cadence (every 30 min for P1)

Phase 4: Mitigation (30 minutes – 4 hours)

  1. Execute recovery runbook
  2. Apply immediate workarounds if available
  3. Escalate to engineering team if needed
  4. Monitor recovery progress
  5. Validate service restoration

Phase 5: Resolution (Varies)

  1. Confirm full service restoration
  2. Verify data integrity
  3. Conduct smoke testing
  4. Update status page to “Resolved”
  5. Send resolution notification to customers

Phase 6: Post-Incident Review (Within 5 days)

  1. Document incident timeline
  2. Conduct root cause analysis
  3. Identify preventive measures
  4. Update runbooks and documentation
  5. Publish post-mortem (for P1 incidents)

Escalation Matrix

LevelRoleContact MethodResponse Time
Level 1On-Call EngineerPagerDutyImmediate
Level 2Engineering LeadPhone + Email15 minutes
Level 3Engineering DirectorPhone30 minutes
Level 4CTOPhone1 hour
Level 5CEOPhone2 hours

Communication Templates

Initial Notification:

Subject: [INCIDENT] AES ETL Control Panel - Service Disruption

We are currently experiencing an issue with the AES ETL Control Panel 
workload affecting [scope]. Our team is actively investigating.

Status: Investigating
Impact: [Description]
Started: [Timestamp]
Next Update: [Timestamp]

Updates: https://status.assuranceeservices.com
Support: support@assuranceeservices.com

Resolution Notification:

Subject: [RESOLVED] AES ETL Control Panel - Service Restored

The issue affecting the AES ETL Control Panel workload has been 
resolved. All services are now operating normally.

Resolution Time: [Timestamp]
Duration: [Duration]
Root Cause: [Brief description]

A detailed post-incident review will be published within 5 business days.

Thank you for your patience.

Testing and Maintenance

Testing Schedule

Test TypeFrequencyScopeDuration
DR Failover TestQuarterlyFull regional failover4 hours
Backup RestorationMonthlySample data recovery2 hours
Incident Response DrillQuarterlyTeam response simulation3 hours
Security TestingAnnuallyPenetration testing1 week
Load TestingMonthlyPerformance validation4 hours
Runbook ValidationQuarterlyProcedure verification2 hours

DR Testing Procedures

Quarterly Failover Test:

  1. Pre-Test Planning (1 week before):
    • Schedule testing window
    • Notify customers via email
    • Assemble testing team
    • Review runbooks
  2. Test Execution:
    • Simulate primary region failure
    • Trigger failover to secondary region
    • Validate service availability
    • Test data access and integrity
    • Measure actual RTO/RPO
  3. Post-Test Activities:
    • Document test results
    • Compare against targets
    • Identify improvement areas
    • Update runbooks
    • Report to stakeholders

Maintenance Windows

Scheduled Maintenance:

  • Frequency: Monthly (first Sunday of each month)
  • Window: 2:00 AM – 6:00 AM UTC
  • Notification: 7 days advance notice
  • Impact: Minimal, rolling updates when possible

Emergency Maintenance:

  • Trigger: Critical security patches
  • Notification: 4 hours advance notice (when possible)
  • Approval: CTO authorization required

Review and Updates

Policy Review:

  • Frequency: Semi-annually (January and July)
  • Participants: Engineering, Operations, Security, Legal
  • Deliverables: Updated policy document, change log

Runbook Updates:

  • Frequency: After each incident or test
  • Process: Document lessons learned, update procedures
  • Approval: Engineering Lead review

Roles and Responsibilities

BCDR Team Structure

Incident Commander (IC)

  • Role: Overall incident coordination
  • Responsibilities:
    • Declare and classify incidents
    • Coordinate response team
    • Make executive decisions
    • Customer communication oversight
  • Contact: Available 24/7 via PagerDuty

Operations Lead

  • Role: Technical recovery execution
  • Responsibilities:
    • Execute recovery procedures
    • Coordinate with Azure support
    • System health validation
    • Technical status updates
  • Contact: On-call engineer rotation

Communications Lead

  • Role: Customer and stakeholder communication
  • Responsibilities:
    • Status page updates
    • Customer email notifications
    • Support ticket management
    • Executive briefings
  • Contact: Support team manager

Data Recovery Specialist

  • Role: Data restoration and validation
  • Responsibilities:
    • AES operational data (7 identifiers) recovery from encrypted backups
    • Assist customers with OneLake data recovery guidance
    • Data integrity verification for AES operational databases
    • Backup validation and testing
    • Coordination with Microsoft Fabric support for OneLake issues
  • Contact: Senior engineer on-call

Security Lead

  • Role: Security incident response
  • Responsibilities:
    • Security threat assessment
    • Breach investigation
    • Compliance coordination
    • Forensics preservation
  • Contact: Security team lead

Customer Responsibilities

Customer’s Role in BCDR:

  1. OneLake Data Management:
    • Configure appropriate retention policies for your OneLake data
    • Enable and validate OneLake versioning for critical pipeline configurations
    • Test data recovery procedures using OneLake restore capabilities
    • Manage execution log retention (if enabled)
    • Control backup policies for business data
  2. Capacity Management:
    • Maintain adequate Fabric capacity for normal operations and recovery
    • Monitor capacity utilization and plan for growth
    • Ensure sufficient capacity for disaster recovery scenarios
  3. Access Management:
    • Maintain current contact information for incident notifications
    • Designate backup administrators for continuity
    • Review and update Fabric workspace permissions regularly
    • Manage Microsoft Entra ID access policies
  4. Testing Participation:
    • Participate in scheduled DR tests (optional but recommended)
    • Validate your OneLake data recovery procedures
    • Test pipeline restoration from OneLake backups
    • Report service issues promptly
  5. Data Ownership:
    • Understand that all pipeline configurations and business data remain in your OneLake
    • AES stores only 7 operational identifiers for basic service tracking
    • You control all aspects of your data lifecycle (retention, deletion, recovery)

Communication Plan

Stakeholders

Internal Stakeholders:

  • Engineering team
  • Support team
  • Executive leadership
  • Sales and customer success

External Stakeholders:

  • Customers (tenant administrators)
  • Microsoft Fabric support team
  • Azure support team
  • Compliance auditors

Communication Channels

Status Page: https://status.assuranceeservices.com

  • Real-time service status
  • Incident history
  • Scheduled maintenance calendar
  • Subscribe for email/SMS alerts

Email Notifications:

  • Incident alerts to tenant administrators
  • Maintenance window announcements
  • Post-incident summaries

Support Portal: https://assuranceeservices.com/support

  • Submit support tickets
  • Access knowledge base
  • View open incidents

Emergency Contact:

Communication Frequency

During Active Incident:

  • P1 (Critical): Every 30 minutes
  • P2 (High): Every 2 hours
  • P3 (Medium): Every 4 hours
  • P4 (Low): Daily update

Scheduled Maintenance:

  • Initial Notice: 7 days before
  • Reminder: 24 hours before
  • Start: At maintenance window start
  • Completion: Upon completion

Compliance and Audit

Regulatory Compliance

Standards Adherence:

  • SOC 2 Type II: Annual audit
  • ISO 27001: Information security management
  • GDPR: European data protection
  • CCPA: California consumer privacy
  • HIPAA: Healthcare data protection (when applicable)

Audit Requirements

BCDR Audit Activities:

  1. Quarterly DR Test Results Review
  2. Annual Penetration Testing Report
  3. Incident Response Log Review
  4. RTO/RPO Compliance Measurement
  5. Backup Validation Records
  6. Access Control Audit Logs

Documentation Retention

Retention Periods:

  • Incident Logs: 7 years
  • DR Test Results: 5 years
  • Backup Logs: 3 years
  • Communication Records: 3 years
  • Policy Revisions: Indefinite

Compliance Reporting

Monthly Reports:

  • Service availability metrics
  • Incident summary
  • RTO/RPO compliance

Quarterly Reports:

  • DR test results
  • BCDR policy compliance
  • Improvement initiatives

Annual Reports:

  • Comprehensive BCDR review
  • Audit findings and remediation
  • Strategic improvements

Data Retention After Service Termination

Customer Data (Your OneLake)

Immediate Access:

  • All pipeline configurations, business data, and execution logs remain in your OneLake under your complete control
  • AES does not and cannot delete your OneLake data
  • You manage retention and deletion based on your requirements
  • Zero impact from AES service termination

AES Operational Data (7 Identifiers)

Post-Termination Retention:

  • Grace Period: 90 days after service termination
  • Purpose: Allow service restoration if customer renews
  • Data: Only the 7 operational identifiers (tenantId, workspaceId, graphqlitemId, JobCount, JobStepCount, LastJobAuditID, LastJobStepAuditID)
  • Automatic Deletion: Automatically and permanently deleted after 90-day grace period
  • Immediate Deletion Option: Request immediate deletion by contacting privacy@assuranceeservices.com

No Data Transfer:

  • AES never transfers customer OneLake data to any location
  • No backup of customer business data by AES at any time
  • No retention of customer pipeline configurations by AES
  • Only 7 minimal identifiers retained during grace period

See our complete Privacy Policy for details on data handling practices.


Document Control

Version History

VersionDateAuthorChanges
1.0April 27, 2026AES BCDR TeamInitial publication
1.1April 28, 2026AES BCDR TeamUpdated data storage, backup, and recovery sections to align with Privacy Policy

Document Approval

RoleNameSignatureDate
CTO[Name][Signature]April 28, 2026
Engineering Director[Name][Signature]April 28, 2026
Legal Counsel[Name][Signature]April 28, 2026

Document Distribution


Contact Information

BCDR Inquiries:

Emergency Response (24/7):

Policy Questions:


Document Classification: Public
Next Review: October 27, 2026
Policy Owner: Chief Technology Officer
Document Location: https://assuranceeservices.com/bcdr-policy


This document represents Assurance eServices’ commitment to business continuity and disaster recovery for the AES ETL Control Panel workload. Regular reviews and updates ensure this policy remains current and effective.