Sajal Regmi

Engineer

arkios.sh — zsh
status --project
currently leading arkios.ai at Karela Technologies Inc.
focus
LLM · AI · Infrastructure · System Architecture
socials
_

Engineering Works

2026 Deep Learning

Transformer from Scratch: Building the Architecture That Powers Modern AI

A comprehensive deep-dive into implementing the Transformer architecture from the ground up. Covers self-attention mechanisms, positional encodings, multi-head attention, feed-forward networks, and the encoder-decoder structure that revolutionized NLP and became the foundation for GPT, BERT, and modern LLMs.

Transformers Deep Learning Attention Mechanism PyTorch NLP
📦 View on GitHub
2025 Real-time Collaboration

Enabling Realtime Doodling Feature on Lecturely.ai with Stroke Level Precision

Deep technical exploration of implementing pixel-perfect realtime doodling capabilities in a collaborative learning platform. Covers stroke extraction algorithms, OT/CRDT integration for conflict resolution, and performance optimization strategies for buttery-smooth 60fps rendering on mobile devices.

React Native iOS PencilKit Real-time OT/CRDT
2025 Microservices

Ins and Outs of a True Microservice Architecture

Comprehensive analysis of building production-ready microservices beyond the hype. Explores service boundaries, inter-service communication patterns, distributed tracing, circuit breakers, and the operational complexity that comes with distributed systems at scale.

Node.js Microservices System Design Architecture Distributed Systems
2025 Mobile Development

Developing a Native Bridge Between React Native and Fabric

Technical deep-dive into building high-performance native bridges connecting React Native's JavaScript layer with iOS native capabilities. Covers Objective-C++ integration, synchronous vs asynchronous bridge patterns, memory management, and performance benchmarking for complex native modules.

React Native iOS Native Bridge Objective-C++ Fabric

Open Source & Research

React Native Markdown Editor

📦 npm Package

Comprehensive npm package for rendering markdown content in React Native applications with full scientific equation LaTeX support. Features custom parsing engine, performance-optimized rendering, and native module integration for mathematical expressions.

React Native Markdown LaTeX npm

Enhanced port of react-native-pencil-kit enabling granular stroke-level operations. Supports individual stroke addition, deletion, and management capabilities essential for building real-time collaborative drawing applications with pixel-perfect precision.

React Native PencilKit iOS Real-time Collaboration

GPT Semantic Cache

⚡ LLM Tool

Intelligent npm package for caching LLM responses based on semantic similarity rather than exact string matching. Reduces API costs and latency by 60-80% through vector embeddings, similarity search, and context-aware cache invalidation strategies.

LLM Caching Vector Embeddings npm

Academic research paper presenting novel approaches to LLM response caching using semantic similarity metrics. Details architecture, algorithms, and performance benchmarks showing significant improvements in response time and cost reduction for production AI applications.

Research LLM Semantic Caching Academic

Work Experience

Engineer

  • Built a full-featured IDE for firmware engineers with integrated hardware debugging, crash analysis, and real-time serial log monitoring—enabling end-to-end embedded development in a single tool
  • Implemented live variable watch, firmware flashing with verification, and RTOS inspection (task states and per-task stack usage) to give engineers deep runtime visibility into embedded systems
  • Developed a function profiler for performance analysis, automated device connection detection, and debug adapter diagnostics tooling to accelerate the firmware development cycle
  • Engineered low-latency C++ serial communication drivers optimized for high-throughput, real-time device interaction with minimal protocol overhead
  • Architected a debug orchestration pipeline supporting multiple hardware programmers with built-in flash verification and serial-based test automation
C++ Embedded Systems RTOS Serial Communication Low-Latency Drivers Firmware Tooling

Lead Engineer & CEO

  • Led the engineering team to design and build an AI-powered enterprise platform enabling organizations to onboard teams and leverage AI agents for operations, automation, and software development, with a full role-based access control model and enterprise-grade security
  • Architected Software Foundry—a software factory module that provisions fully isolated, per-tenant cloud environments via infrastructure-as-code, allowing an AI agent to scaffold, build, and iterate on full-stack applications through natural language
  • Designed a high-throughput distributed RPC communication layer between platform servers and per-tenant compute nodes, with mutual TLS authentication and network-level isolation
  • Implemented zero-trust security architecture: certificate-based service authentication, automated secrets management at provisioning time, and full infrastructure lifecycle management (provision and teardown per tenant)
  • Engineered the core platform server handling authentication, billing, team management, and AI agent orchestration, backed by relational and in-memory data stores on cloud infrastructure
Node.js / Express.js gRPC AWS Terraform PostgreSQL Redis AI Agents Distributed Systems Enterprise Architecture TypeScript

Lecturely AI

Jan 2025 - Present

Senior LLM Engineer (Remote)

  • Architected multi-agent LLM orchestration system with knowledge-graph-enhanced RAG, achieving 40% improvement in response relevance via prompt-relevant context injection.
  • Engineered React Native→Objective-C++ native bridge for PencilKit integration, implementing Fabric renderer synchronization with JSI bindings and synchronous RPC patterns, enabling lees than 60ms latency for stroke capture at 120fps
  • Designed stroke-level OT/CRDT hybrid algorithm for real-time collaborative canvas, supporting 20+ concurrent users with conflict-free merging, operational transformation for stroke sequences, and delta compression.
  • Built production CI/CD with blue-green deployments, implementing health checks, staged rollouts, and automatic rollback on error rate >1%, achieving 99.95% uptime SLA and zero-downtime releases
LLMs RAG (Knowledge Graphs) React Native Fabric Objective-C++ Bridge OT/CRDT AWS ECS JSI

Creatiphe LLC

May 2024 - July 2024

Entrepreneur in Residence

  • Refined business evaluation process with rigorous testing strategies and clear success metrics, fostering data-driven culture
  • Developed MVPs with freelancers and implemented testing strategies, enhancing engagement and conversions by 30%
  • Incubated talent agency bridging Nepalese tech talent with global opportunities, overseeing recruitment for 200 candidates
  • Increased Happy O'Clock revenue by 10% through strategic automation and retention strategies
Product Management Automation Project Management

Ticketsewa Pvt Ltd. ( Closed )

May 2023 - May 2024

Co-Founder & Principal Engineer

  • Architected distributed ticketing platform handling 10K concurrent requests with CQRS pattern, event sourcing, and API gateway routing across 12 microservices (Node.js/TypeScript), achieving <\99% uptime and 150ms p95 latency
  • Engineered rate-limited payment orchestration integrating 7 airline GDS APIs, 10 event platforms, and 20 bus operators with circuit breaker pattern, idempotency keys, and distributed transaction compensation.
  • Built real-time seat allocation engine with Redis distributed locks and optimistic concurrency control, preventing overselling with serializable isolation and implementing 2-phase commit for inventory consistency across providers
  • Led agile transformation implementing Scrum with velocity tracking, CI/CD pipelines (GitHub Actions), and automated E2E testing, improving deployment frequency from weekly to daily and reducing lead time by 65%
  • Secured $5K grant from Youth Self Employment Fund and scaled engineering team from 0→8 engineers in 6 months
Node.js/TypeScript Microservices CQRS/Event Sourcing Redis Cluster MySQL (Read Replicas) Docker/K8s CI/CD

PUDO Nepal Pvt Ltd

Aug 2020 - July 2021

Project Lead

  • Oversaw development of revolutionary last-mile delivery platform, projected to increase efficiency by 30%
  • Strategically located 170 PUDO stations to serve 500K+ potential customers in Kathmandu
  • Conducted market research influencing feature integration for 20% efficiency improvement
  • Collaborated on revenue model projecting support for 2000 stations and 300 delivery workers
  • Awarded 5% stock options for outstanding performance
Market Research Product Strategy UX Design

Karyathalo IT and Web Solutions

Aug 2020 - July 2021

Product Development Intern

  • Delivered e-governance platform for Kathmandu Metropolitan City (5M users) using WordPress multisite with custom plugin architecture, implementing role-based access control, audit logging, and form validation reducing physical visits by 30% (200K+ requests/month)
  • Engineered MySQL query optimization with indexing strategies and Redis object caching, reducing page load time from 4.2s→1.1s (74% improvement) and supporting 2K concurrent users on single VPS
  • Led requirements analysis with municipal stakeholders, translating 50+ user stories into technical specifications with Figma prototypes and API contracts
WordPress Multisite PHP/MySQL Redis Caching Figma/Prototyping Security Best Practices

Karela Games

Feb 2019 - Feb 2020

Game Developer and Product Lead

  • Executed end-to-end game development in C# Unity with 1K+ organic downloads
  • Orchestrated creation of three games: Frustration Pill, Texting and Walking, Vertical Challenge
  • Integrated Facebook analytics and conducted market research increasing downloads by 50% across titles.
  • Collaborated with publisher Super Developers implementing Agile methodologies to streamline prototype development and improve team productivity.
C# Unity Game Development Analytics

Education

Bachelor of Science in Biomedical Engineering

  • Engineering foundation with focus on systems thinking, biomechanics, and medical device design principles
  • Coursework in signal processing, data structures, algorithms, and computational modeling that directly translates to scalable software architecture
  • Applied engineering rigor to software development, bringing disciplined approach to system design and performance optimization
Systems Engineering Signal Processing Computational Modeling Data Structures