[AI SYSTEMS ENGINEER]

BUILDING
AI SYSTEMS
THAT SCALE

AI Infrastructure · GPU Optimization · ML Pipelines
3+
YEARS EXPERIENCE
$0
VENDOR LOCK-IN
24/7
AUTONOMOUS OPERATION
[01]

WHO

Software Engineer specialized in AI systems, building self-hosted infrastructure, scalable backend/frontend applications, and production-ready machine learning pipelines. I focus on creating systems that operate independently, avoid vendor lock-in, and maintain full data control.

My expertise covers the full lifecycle: from GPU-optimized model deployment and high-performance APIs to CI/CD pipelines, monitoring, and automated testing. I design systems with reliability, scalability, and quality at the core, integrating strong QA practices across every layer.

Projects like Kurai-Transcribe, KuraiMusik, and KurAI2Video reflect this approach—AI systems running 24/7 with zero API costs, optimized performance, and production-grade stability, combining advanced AI capabilities with solid engineering and automation.

Self-hosted AI: Zero API costs, complete data sovereignty, full control
GPU optimization: CUDA, memory management, performance tuning for AI workloads
Production systems: CI/CD, monitoring, and scalable deployments
[02]

EXPERIENCE

QA Automation Engineer

Quality Assurance & Automation 2026 – Present
  • Built and maintained automated testing frameworks for 24/7 production systems
  • Designed end-to-end testing strategies for fintech platforms
  • Implemented CI/CD pipelines for reliable deployments and regression testing
  • Established QA processes focused on performance, reliability, and system optimization

Software Engineer

Fintech & Banking Systems 2024 - 2026
  • Developed and maintained secure fintech applications for financial operations and transaction processing
  • Built scalable backend services and APIs for high-volume systems with strong consistency and reliability
  • Designed and implemented frontend interfaces for financial dashboards and internal banking tools
  • Applied best practices in authentication, authorization, and data protection for sensitive financial data
  • Implemented monitoring, logging, and automation to ensure high availability and system resilience

Full Stack Developer

Fullstack Development 2023 - 2024
  • Developed scalable fullstack applications with modern JavaScript frameworks and RESTful APIs
  • Built and optimized backend systems with robust validation, authentication, and performance tuning
  • Designed and managed relational and NoSQL databases for high-performance data processing
  • Created responsive and user-focused interfaces with strong UX principles and real-time features
[03]

WORK

[01]

Kurai-Transcribe

2026

Production-grade self-hosted transcription API processing millions of minutes monthly. Custom vocabulary injection, named entity recognition with SpaCy, automatic punctuation restoration. Advanced GPU memory management with semaphore-based VRAM allocation prevents OOM errors. Voice activity detection + Pyannote speaker diarization. Real-time monitoring dashboard with Prometheus metrics.

Faster-WhisperFastAPISpaCyCTranslate2PyannotePrometheus
[02]

KuraiMusik

2026

Autonomous content generation system running 24/7 with zero human intervention. Dynamic content buffer system powered by advanced generative AI. Circuit breaker patterns ensure fault tolerance and reliability. Intelligent content orchestration with real-time metadata management and automated scheduling.

FastAPICeleryRedisACE-StepPostgreSQL
[03]

KurAI2Video

2026

Enterprise-grade self-hosted video generation platform using state-of-the-art Wan 2.1 T2V-1.3B model. Eliminates cloud API costs that charge $0.15-0.40 per second—10,000 videos would cost $15,000-40,000. Advanced GPU memory management with dynamic CPU↔GPU offloading prevents OOM errors. Strict concurrency control ensures one active job at a time for predictable performance. Built-in monitoring dashboard tracks GPU utilization, VRAM usage, and model performance in real-time. Production-ready Docker deployment with automated health checks and failover mechanisms.

Wan 2.1 T2V-1.3BPyTorch BF16FastAPIDockerCUDAPrometheus
[04]

Daily Journal

2025

100% private personal diary app for Android. No accounts, no subscriptions, no trackers. Your entries never leave your device. Minimalist interface, emotional analysis, dark mode, and works offline. Developed with privacy as the top priority.

KotlinRoom DatabaseMaterial DesignAndroid Jetpack
[04]

STACK

AI Systems

  • Self-hosted AI
  • GPU Optimization
  • ML Infrastructure
  • Production ML
  • Model Deployment

Testing & QA

  • Selenium
  • Java
  • API Testing
  • CI/CD
  • Test Automation

Backend & Infrastructure

  • Python
  • FastAPI
  • Docker
  • Kubernetes
  • PostgreSQL
  • Redis

AI/ML Stack

  • PyTorch
  • CUDA
  • TensorRT
  • MLflow
  • Prometheus
  • Wan 2.1 T2V
[05]

CONTACT

Available for AI infrastructure consulting and development projects.

Specializing in self-hosted AI systems, MLOps platforms, and production ML infrastructure.

Let's build AI systems that actually work at scale.