Skip to content

VelocityAI is an intelligent architecture analysis tool that provides instant, structured feedback on system designs. Built for architects, engineers, and technical leaders who think at high velocity but need comprehensive validation of their architectural decisions.

Notifications You must be signed in to change notification settings

CodeRandomMC/VelocityAI-A-Systems-Architect-Co-Pilot

Repository files navigation

VelocityAI - Systems Architect Toolset 🏗️🤖

AI-augmented architecture review companion that matches high-velocity thinking patterns

Python 3.8+ License: MIT Gradio

🎯 Mission

Democratize architectural expertise through AI augmentation. Transform architecture reviews from weeks-long processes into minute-long iterations, enabling rapid system design validation without sacrificing depth or quality.

📋 Project Overview

VelocityAI is an intelligent architecture analysis tool that provides instant, structured feedback on system designs. Built for architects, engineers, and technical leaders who think at high velocity but need comprehensive validation of their architectural decisions.

VelocityAI Interface

🎯 Core Purpose

  • Accelerate Architecture Reviews: Compress weeks of traditional review cycles into minutes
  • Scale Expertise: Get senior-level architectural insights without requiring senior architects
  • Export & Share Results: Generate professional PDF reports and documentation for stakeholders
  • Enable Rapid Iteration: Fail fast, learn fast, improve fast
  • Maintain Quality: Never sacrifice architectural rigor for speed

🚀 Key Features

  • Dual AI Provider Support: Choose between cloud (Google Gemini) or local (LM Studio) inference
  • Structured Analysis: Consistent JSON output covering 7 critical architectural dimensions
  • Real-time Feedback: Instant analysis through an intuitive web interface
  • Privacy-First Options: Keep sensitive designs internal with local LLM support
  • Zero Configuration: Works out of the box with minimal setup

🏗️ Architectural Dimensions Analyzed

  1. Scalability - Growth handling and elasticity
  2. Reliability - Fault tolerance and resilience patterns
  3. Security - Threat modeling and vulnerability assessment
  4. Performance - Latency, throughput, and resource optimization
  5. Maintainability - Code quality and evolution capability
  6. Cost Efficiency - Resource utilization and TCO optimization
  7. Observability - Monitoring, debugging, and operational insight

🛠️ Technology Stack

  • Backend: Python with modular architecture
  • Frontend: Gradio web interface
  • AI Providers: Google GenAI (Gemini), LM Studio (Local LLMs)
  • Output Format: Structured JSON → Rendered Markdown
  • Configuration: Environment-based with dynamic UI controls

🏗️ System Architecture

Frontend: Gradio Web Interface

  • Clean, responsive design optimized for rapid iteration
  • Real-time analysis feedback with streaming updates
  • Markdown input/output with syntax highlighting
  • Dynamic model selection and provider switching
  • Export controls for generating reports in multiple formats

Backend: Dual-Provider Architecture

  • Google GenAI: Cloud-based Gemini Flash/Pro models
  • LM Studio: Local model support with flexible host configuration
  • Automatic model detection and connection testing
  • Graceful fallback handling and error recovery
  • Export engine for PDF, HTML, and Markdown report generation

Export Capabilities

  • Professional PDF reports with styled formatting
  • Interactive HTML reports for digital sharing
  • Markdown exports for version control and documentation
  • In-UI previews and downloadable outputs
  • Customizable theming and templating system

Export Showcase

Core Components Structure

SystemArchticectToolset/
├── main.py              # Application orchestration and entry point
├── config.py            # Configuration constants and environment settings
├── core_logic.py        # Analysis logic and specialized prompt engineering
├── llm_clients.py       # LLM client abstractions with factory pattern
├── ui_components.py     # Gradio UI components and custom styling
├── test_refactored.py   # Comprehensive test suite and validation
├── requirements.txt     # Python dependencies
└── README.md           # This documentation

Modular Design Philosophy

The application follows a separation of concerns architecture:

  • Configuration Layer (config.py): Environment and model settings
  • Client Layer (llm_clients.py): AI provider abstractions
  • Logic Layer (core_logic.py): Analysis algorithms and prompt engineering
  • UI Layer (ui_components.py): Interface components and styling
  • Export Layer (export_utils.py): Report generation and formatting
  • Orchestration Layer (main.py): Application coordination and event handling

🚀 Quick Start

Prerequisites

pip install -r requirements.txt

Dependencies:

  • gradio - Interactive web interface framework
  • google-genai - Google Gemini API client
  • requests - HTTP client for LM Studio communication
  • python-dotenv - Environment variable management
  • bleach - Output sanitization for security

Option 1: Cloud-Based Analysis (Google GenAI)

  1. Get API Key: Visit Google AI Studio
  2. Configure Environment: Create .env file with GOOGLE_API_KEY=your_api_key_here
  3. Launch Application:
    python main.py
  4. Access Interface: Open http://localhost:7860 in your browser
  5. Select Provider: Choose "Google GenAI" and select Gemini model
  6. Start Analyzing: Paste your architecture plan and get instant feedback

Option 2: Local Analysis (LM Studio)

Local LLM Support

  1. Install LM Studio: Download from lmstudio.ai
  2. Load Model: Use 7B+ parameter models for better analysis quality
  3. Start Server: Enable LM Studio's local server (Server tab → Start Server)
  4. Launch Application:
    python main.py
  5. Configure Connection:
    • Select "LM Studio (Local)" provider
    • Set host configuration (default: localhost:1234)
    • Test connection to verify availability
  6. Select Model: Choose from automatically detected models
  7. Start Analyzing: Input architecture plans for local, private analysis

Testing Installation

# Run comprehensive tests
python test_refactored.py

# Launch application
python main.py

💡 Usage Example

Input: Architecture Plan in Markdown

# Real-time Analytics Dashboard

## Overview

Track user interactions and display analytics in real-time for a SaaS platform.

## Components

- **Frontend**: React SPA with WebSocket connections
- **API Gateway**: Single Node.js service on AWS EC2
- **Database**: PostgreSQL instance (same EC2)
- **Real-time**: WebSocket connections for live updates

## Data Flow

1. User performs action → Frontend captures event
2. Event sent via POST /events → API processes and stores in PostgreSQL
3. API broadcasts update via WebSocket → All connected dashboards update

Output: Structured AI Analysis

📋 Plan Summary Real-time analytics system with React frontend, Node.js API, and PostgreSQL database using WebSockets for live updates.

✅ Strengths

  • Simple architecture with clear data flow
  • Real-time capability through WebSocket implementation
  • Minimal technology stack reduces complexity

🔍 Areas for Improvement

  • Database Layer (HIGH): Single PostgreSQL instance creates bottleneck and single point of failure
  • Scalability (MEDIUM): Monolithic API will struggle under load
  • Security (CRITICAL): No authentication or rate limiting mentioned

🚀 Next Steps

  1. Implement database replication for high availability
  2. Add Redis for caching and session management
  3. Design API authentication and authorization strategy
  4. Plan horizontal scaling approach for API layer

🧠 AI Analysis Framework

The tool employs "Archimedes" - a specialized AI system prompt embodying 25+ years of senior principal architect experience. Analysis covers:

Structured Output Format

{
  "summaryOfReviewerObservations": "Executive summary of architectural assessment",
  "planSummary": "What the system does and its core components",
  "strengths": [
    {
      "dimension": "Scalability | Security | Performance | etc.",
      "point": "Specific strength identified",
      "reason": "Why this design choice is beneficial"
    }
  ],
  "areasForImprovement": [
    {
      "area": "Specific architectural concern",
      "concern": "Exact problem or risk identified",
      "suggestion": "Actionable, pragmatic recommendation",
      "severity": "CRITICAL | HIGH | MEDIUM | LOW",
      "impact": "Consequence if not addressed",
      "tradeOffsConsidered": "Associated trade-offs"
    }
  ],
  "strategicRecommendations": [
    {
      "recommendation": "High-level architectural improvements",
      "rationale": "Why this direction is beneficial",
      "potentialImplications": "Implementation effort required"
    }
  ],
  "nextStepsAndConsiderations": [
    "Prioritized next steps and clarifying questions"
  ]
}

🔄 Development & Contributing

Project Philosophy

Speed + Simplicity + Quality

  • Rapid iteration without sacrificing architectural rigor
  • AI augmentation to scale expertise, not replace it
  • Open source tools that democratize architectural knowledge

Architecture Principles

  1. Modular Design: Separation of concerns for maintainability
  2. Provider Agnostic: Support both cloud and local AI inference
  3. Security First: Privacy options for sensitive architectural data
  4. User Experience: Intuitive interface that matches thinking velocity
  5. Extensibility: Easy to add new providers, models, or analysis types

Contributing Guidelines

We welcome contributions that align with the project's high-velocity, AI-augmented approach:

  • Bug Reports: Use GitHub issues with detailed reproduction steps
  • Feature Requests: Propose enhancements that improve analysis speed or quality
  • Code Contributions: Follow the modular architecture patterns
  • Documentation: Help improve clarity and accessibility

Development Setup

# Clone and install dependencies
git clone <repository-url>
cd SystemArchticectToolset
pip install -r requirements.txt

# Run tests
python test_refactored.py

# Launch development server
python main.py

📄 License

MIT License - Build whatever you want with this tool.


📈 Updates & Changelog

Note: This section is for ongoing updates. Add new entries at the top of each release section.

🔥 Latest Release - v2.0.0 (July 2025)

✨ Major Features

🏗️ Modular Architecture Refactoring

  • Complete codebase restructuring into maintainable modules
  • Separation of concerns: config, clients, logic, UI, and orchestration
  • Comprehensive test suite with test_refactored.py
  • Type annotations and enhanced error handling throughout

🚀 Enhanced LM Studio Integration

  • Dynamic host configuration (not limited to localhost:1234)
  • Real-time connection testing with status feedback
  • Automatic model discovery and refresh capabilities
  • Support for remote LM Studio instances
  • Enhanced error handling for connection issues

🎨 Improved User Experience

  • Cleaner, more intuitive interface design
  • Visual connection status indicators
  • Collapsible configuration sections
  • Better error messages and user guidance
  • Streaming analysis updates for real-time feedback

🛠️ Technical Improvements

🔒 Security Enhancements

  • Output sanitization with bleach library
  • XSS protection for markdown rendering
  • Safe link handling with rel="nofollow"
  • Input validation and error boundary handling

⚡ Performance Optimizations

  • Optimized API client initialization
  • Better connection pooling and timeout handling
  • Reduced UI rendering overhead
  • Efficient model list caching

🧪 Testing & Quality

  • Comprehensive test coverage across all modules
  • Automated validation of core functionality
  • Error scenario testing
  • Integration test suite

📝 Documentation Updates

📚 Enhanced Documentation

  • Portfolio-ready README with clear mission statement
  • Structured project overview and technical specifications
  • Professional formatting with badges and visual hierarchy
  • Clear installation and usage instructions

🗂️ New Documentation Structure

  • Centralized updates section for ongoing changes
  • Technical architecture documentation
  • Contributing guidelines and development setup
  • Clear licensing and usage terms

🔮 Planned Updates (Roadmap)

Next Release (v2.1.0 - Planned)

  • Custom Model Support: Fine-tuned architecture analysis models
  • Batch Analysis: Process multiple architecture plans simultaneously
  • Export Capabilities: PDF and detailed report generation
  • Template Library: Common architectural pattern templates

Future Enhancements (v3.0.0+)

  • Team Collaboration: Multi-user architecture review workflows
  • Integration APIs: Connect with existing architecture tools
  • Metrics Dashboard: Track architecture quality over time
  • Advanced Analytics: Historical trend analysis and recommendations

Built with high-velocity thinking and AI augmentation 🚀

About

VelocityAI is an intelligent architecture analysis tool that provides instant, structured feedback on system designs. Built for architects, engineers, and technical leaders who think at high velocity but need comprehensive validation of their architectural decisions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages