Skip to main content

SMDS Integration Access Guide

Architectural guidance for consuming SMDS master data through EMP, API, database read replica, or datalake. This guide describes design considerations, compliance constraints, and requirements that architects should address when integrating with SMDS.

Data Consumption Pathways

Pathway selection depends on latency, volume, query patterns, and operational constraints. EMP is the recommended primary path for change-driven consumption; API for request-response; database read replica only when event streaming is not feasible; datalake for analytical or batch workloads.

Events (RETINA / EMP)

Event-driven consumption via Kafka topics provides near-real-time master data changes. Topics are log-compacted for full retention.

Consumption Strategy

  • Production: Start from the latest offset to avoid loading historical backlog and reduce initial sync time.
  • Lower environments: Starting from offset=0 can be used for load testing and validating consumption under high outbound volume.

Data Residency (GDPR)

SMDS master data must be stored exclusively within the European region. Design your persistence layer, caches, and any replicated copies to comply with EU-only storage.

EMP Access Requirements

AreaRequirement
Application identityApplication name(s), MAC Id, platform/category (TbM, FbM, MbM, Corporate Platform, BEP)
Consumption scopeRequired topics (Customer, Customer Contact, Vessel, etc.)
OwnershipPrimary and secondary contacts, team distribution list
Business caseBenefit case and utilization plan
TechnicalApplication ID(s) for consumer list
Environment timelineSIT, PP, Production consumption dates; confirm successful testing per environment
PerformanceBulk testing in SIT or PP before production high-load scenarios
GovernanceData retention, error handling, retry strategy
Data scopePrimary fields consumed; whether the application modifies master data and forwards to downstream systems
DesignArchitecture and integration design document

API

REST API access for transactional or on-demand reads. Use when you need request-response semantics or cannot consume events.

Design Considerations

  • Resilience: Define fallback or degradation when the API is unavailable. Backpressure from SMDS can affect availability.
  • Throughput: Document peak-hour and average daily load to align capacity and throttling.
  • Compliance: Confirm GDPR and data sensitivity handling for attributes consumed.

API Access Requirements

AreaRequirement
Identity & scopeRequested proxies, application(s), MAC Id, platform/category
OwnershipPrimary/secondary contacts, team distribution list
Business caseBenefit case and API usage in the solution
Environment timelineSIT, PP, Production; confirm successful testing per environment
Load profilePeak-hour load (req/hr), average daily load (req/day)
ImpactBusiness impact and major-incident risk if API is down
ResilienceAlternative or backup process when API is unavailable (mandatory)
DurationHow long API access is required
Non-functionalAvailability SLAs, security, GDPR affirmation, data sensitivity
PerformanceResponse time expectations, scalability for future load
IntegrationDownstream systems and integration points
TestingTools and processes for validation
DocumentationDesign and functional documentation level
SupportSupport and maintenance responsibilities

Database – Read Replica

Direct database access is a fallback when EMP consumption is not viable. Treat it as temporary; plan migration to event-driven consumption.

When to Use

Use the read replica only when:

  • Event streaming does not fit your architecture (e.g., legacy batch, specific query patterns).
  • You have a clear migration path to EMP or API.

Operational Impact

Read replica usage affects shared infrastructure. High query volume or complex queries can impact other consumers. Qualified data engineers should own and govern this integration.

Database Access Requirements

AreaRequirement
RationaleWhy EMP consumption is not feasible
IdentityApplication(s) accessing the database
OwnershipPrimary/secondary contacts, team distribution list
Business caseBenefit case and intended use
GovernanceConfirmation that experienced data experts will own the connection
LifecyclePlanned start and end dates; keep usage time-bound
Access patternTables, fields, and frequency (daily, hourly, ad-hoc)
PerformanceQuery volume estimate and performance impact assessment
Security & complianceEncryption, RBAC, GDPR and regional compliance
MigrationStrategy to move from database to EMP when feasible

Datalake

Datalake access supports analytical and batch workloads. Consider sync latency, freshness, and your local storage strategy.

Design Considerations

  • Sync latency: Understand impact of data sync delays beyond 24 hours.
  • Local storage: If storing master data locally, specify regions (EU, APAC, US) and ensure GDPR alignment for EU data.

Datalake Access Requirements

AreaRequirement
IdentityApplication(s), MAC Id, platform/category
Consumption scopeRequired topics (Customer, Vendor, etc.)
OwnershipPrimary/secondary contacts, team distribution list
Business caseBenefit case and utilization plan
ImpactImpact of downtime or sync delay beyond 24 hours
StorageLocal storage plans and regions
Data scopePrimary fields consumed; modification and downstream forwarding
DesignUsage of this data in your application

Integration Landscape

Platform Approvers

Access requests are reviewed by the SMDS platform team:

ApproverRoleScope
Rahul SinghSenior Platform ArchitectAPI, EMP, DB (SMDS-CMD/OPS)
Ramesh Varma KutcherlapatiSenior Solution ArchitectAPI, EMP, DB (SMDS-OPS)
Mahendra Singh PariharSenior Data EngineerAPI, EMP, DB (SMDS-OPS)
Gopala Krishnan RajendranSenior Data ArchitectAPI, EMP, DB (SMDS-CMD)
Samit SinhaData Product Delivery LeadAPI, EMP, DB (SMDS-OPS)
Ajit Vijay DawareSenior Engineering ManagerAPI, EMP, DB (SMDS-CMD)
Was this page helpful?