Technical Architecture

Runs on YOUR AWS Account

Flustack deploys entirely within your own AWS infrastructure. Your data never crosses account boundaries — full control, zero vendor lock-in on data residency, enterprise security from day one.

Zero Data Egress30+ AWS ServicesServerless-First

System Architecture

End-to-End Data Pipeline

Eight integrated layers working together — from raw source data to business-ready insights, all orchestrated within a single AWS account.

Data Quality Architecture

Medallion Architecture

A five-layer data lake guarantees every record is validated, enriched, and analytics-ready before reaching your dashboards. Bad data is isolated and traceable — never silently discarded.

Landing

Raw data as-is from every source. Immutable, append-only.

S3 · AppFlow · Kinesis · MSK
Bronze

Deduplicated, parsed, schema-normalized. Glue jobs apply initial transformations.

AWS Glue 4.0 · Apache Iceberg
Silver

Validated, enriched, anomalies flagged. SageMaker RCF runs here.

SageMaker · Bedrock · Iceberg
Gold

Analytics-ready, aggregated, domain-modeled. Athena and QuickSight consume from here.

Athena · QuickSight · EMR Serverless
Quarantine⚠ isolated

Isolated bad records with full lineage. Reprocessable after correction.

S3 · CloudWatch · DynamoDB

Quarantine sits outside the main flow — bad records are routed here automatically and reprocessed after correction.

Full AWS Stack

Every Service, Every Layer

Flustack is an opinionated architecture over 30+ AWS services — the right service for each job, connected and configured out of the box.

Ingestion
S3Landing, Bronze, Silver, Gold, Quarantine zones
AppFlowSaaS connectors (Salesforce, ServiceNow, Slack)
Kinesis Data StreamsReal-time streaming source
MSK (Kafka)Alternative streaming source
Secrets ManagerJDBC credentials, API keys, tenant API keys
Compute
AWS Glue 4.0ETL PySpark jobs — batch and streaming
EMR ServerlessHigh-volume jobs and complex joins
AWS LambdaTriggers, connectors, API handler, GC
Orchestration
Step FunctionsMain ingestion flow per source
EventBridgeScheduled triggers per tenant per source
EventBridge SchedulerCron schedules per tenant
SNSOperational alerts (KMS-encrypted)
AI / ML
Amazon BedrockSchema inference, semantic mapping, error analysis
Amazon SageMakerRCF anomaly detection, risk scoring, batch inference
Storage
Glue Data CatalogIceberg table metadata
Apache IcebergFormat for Silver and Gold layers
Governance
KMSCustomer Managed Key per tenant
Lake FormationColumn and cell-level access control
IAMLeast-privilege roles per service
Consumption
Amazon AthenaSQL queries on Iceberg tables
QuickSightBI dashboards
QuickSight QNLQ in Spanish and English (optional)
Control Plane
DynamoDBControl plane Single Table Design — source config + job state
Observability
CloudWatch LogsStructured JSON logs (13 log groups)
CloudWatch MetricsBusiness KPIs
CloudWatch InsightsPredefined operational queries
API Layer
API GatewayREST API exposure
FastAPI + MangumAPI handler (Lambda)
CI / CD
CodePipelineCI/CD pipeline per tenant
CodeBuildCDK synth + test execution
Local Dev
LocalStack ProLocal simulation of all AWS services
cdklocalCDK deploy against LocalStack

Why it matters

Three Principles That Define Flustack

Your Infrastructure

Every byte of data stays in your AWS account. No shared compute, no shared storage, no data egress to a vendor cloud. Flustack deploys via CDK into your environment and you own everything from day one.

AI-Native by Design

AI is not a bolt-on feature. Amazon Bedrock drives schema inference and semantic mapping at ingest time. SageMaker RCF runs anomaly detection at the Silver layer. Intelligence is baked into the pipeline, not added as an afterthought.

Multi-Tenant by Design

Native tenant isolation from the data model up — KMS Customer Managed Key per tenant, dedicated EventBridge schedules, isolated pipelines. Onboard a new client in minutes without touching existing tenant infrastructure.

Ready to see it in your environment?

Schedule a technical walkthrough with our engineers and see how Flustack maps to your existing AWS setup.