Modern Data+AI Stack - Spec Tables

Platform Components

SoftwarePurposeLicenseVersionCapabilityEnvEst. Savings per Year
DuckDBStorage and querying of structured dataMIT0.9.2+In-process analytical databaseAll100,000
dbtDefine, test, and document data transformationsApache 2.01.6.0+Data transformation frameworkAll20,000
SQLiteLightweight storage for metadataPublic Domain3.40.0+Embedded databaseAll5,000
LanceDBStorage and searching of vector embeddingsApache 2.00.5.0+Vector databaseAll50,000
KuzuDBEmbeddable graph databaseApache 2.00.0.14+Graph database engineAll40,000
n8nIntegration and automation of data workflowsFair Code1.0.0+Workflow automationAll36,000
UVDependency management and environment setupMIT0.1.0+Python package managerAll200 eng hours
EvidenceCreate interactive dashboards and reportsMIT2.0.0+Data visualization frameworkAll50,000
Rill DataReal-time dashboards and data explorationMIT0.40.0+BI and self-service analytics platformAll40,000
DoclingExtract structured data from documentsMIT2.30.0+Document processing toolkitAll25,000
MarimoData exploration and analysisApache 2.00.1.0+Interactive notebooksAll15,000
LocalStacksLocal AWS emulation for developmentApache 2.02.0.0+Local cloud service emulatorLocal30,000

Development & Support Tools

SoftwarePurposeLicenseVersionCapabilityEnvEst. Savings per Year
PythonCore runtime for most componentsPSF License3.11+Programming languageAll$0
Node.jsRequired for Evidence dashboardsMIT18.0.0+JavaScript runtimeAll$0
GitTrack changes to code and configurationGPL-2.02.35.0+Version control systemAll5,000
GhosttyHigh-performance terminal emulatorMITLatestGPU-accelerated terminalAll200 eng hours
WarpModern, Rust-based terminalProprietaryLatestAI-enhanced terminal with blocksmacOS200 eng hours
AirflowAlternative for scheduling and monitoring workflowsApache 2.02.6.0+Workflow orchestrationAll40,000
ObsidianDocumentation viewer with diagrams & wikilinksProprietary1.4.5+Markdown knowledge baseLocal500 eng hours
LogseqKnowledge graph and outlinerAGPL-3.00.9.0+Connected note-taking systemAll300 eng hours
CursorDevelopment environment with AI capabilitiesProprietaryLatestAI-assisted code editorLocal500 eng hours

Optional & Extension Components

SoftwarePurposeLicenseVersionCapabilityEnvEst. Savings per Year
MinIOLocal alternative to AWS S3 for data lake storageAGPL-3.0LatestS3-compatible object storageLocal10,000
AWS S3Scalable data lake storageN/A (Service)N/ACloud object storageCloudN/A
PostgreSQLAlternative storage for metadataPostgreSQL License14.0+Relational databaseAny30,000
AlembicDatabase schema migrationsMIT1.10.0+Migration toolAll$0
DBeaverUniversal database toolApache 2.023.0.0+Database GUIAll8,000
SupersetAlternative to Evidence for visualizationApache 2.02.1.0+BI platformAny50,000

AI Components

SoftwarePurposeLicenseVersionCapabilityEnvEst. Savings per Year
LibreChatSelf-hosted AI chat interfaceAGPL-3.00.6.0+Chat interface for multiple LLMsAll25,000
Claude DesktopLocal desktop app for Claude AIProprietaryLatestDesktop client for Anthropic’s ClaudeLocal10,000
CrewAIFramework for orchestrating AI agentsMIT0.22.0+Multi-agent orchestration frameworkAll40,000
RestackLocal LLM and AI stack deploymentProprietaryLatestOne-click AI deployment platformAll20,000
AWS BedrockManaged service for foundation modelsProprietaryLatestAccess to multiple foundation modelsCloudPay-per-use
OllamaRun open source LLMs locallyMIT0.1.19+Local LLM runnerAll25,000
LlamaIndexData framework for LLM applicationsMIT0.9.0+RAG frameworkAll30,000
GGMLMachine learning library for edge devicesMIT0.1.0+Tensor library for efficient inferenceAll15,000