Sajal Regmi

Engineer

arkios.sh — zsh
status --project
currently leading arkios.ai at Karela Technologies Inc.
focus
LLM · AI · Infrastructure · System Architecture
socials
_

Engineering Works

2026 Deep Learning

Transformer from Scratch: Building the Architecture That Powers Modern AI

A comprehensive deep-dive into implementing the Transformer architecture from the ground up. Covers self-attention mechanisms, positional encodings, multi-head attention, feed-forward networks, and the encoder-decoder structure that revolutionized NLP and became the foundation for GPT, BERT, and modern LLMs.

Transformers Deep Learning Attention Mechanism PyTorch NLP
📦 View on GitHub
2025 Real-time Collaboration

Enabling Realtime Doodling Feature on Lecturely.ai with Stroke Level Precision

Deep technical exploration of implementing pixel-perfect realtime doodling capabilities in a collaborative learning platform. Covers stroke extraction algorithms, OT/CRDT integration for conflict resolution, and performance optimization strategies for buttery-smooth 60fps rendering on mobile devices.

React Native iOS PencilKit Real-time OT/CRDT
2025 Microservices

Ins and Outs of a True Microservice Architecture

Comprehensive analysis of building production-ready microservices beyond the hype. Explores service boundaries, inter-service communication patterns, distributed tracing, circuit breakers, and the operational complexity that comes with distributed systems at scale.

Node.js Microservices System Design Architecture Distributed Systems
2025 Mobile Development

Developing a Native Bridge Between React Native and Fabric

Technical deep-dive into building high-performance native bridges connecting React Native's JavaScript layer with iOS native capabilities. Covers Objective-C++ integration, synchronous vs asynchronous bridge patterns, memory management, and performance benchmarking for complex native modules.

React Native iOS Native Bridge Objective-C++ Fabric

Open Source & Research

React Native Markdown Editor

📦 npm Package

Comprehensive npm package for rendering markdown content in React Native applications with full scientific equation LaTeX support. Features custom parsing engine, performance-optimized rendering, and native module integration for mathematical expressions.

React Native Markdown LaTeX npm

Enhanced port of react-native-pencil-kit enabling granular stroke-level operations. Supports individual stroke addition, deletion, and management capabilities essential for building real-time collaborative drawing applications with pixel-perfect precision.

React Native PencilKit iOS Real-time Collaboration

GPT Semantic Cache

⚡ LLM Tool

Intelligent npm package for caching LLM responses based on semantic similarity rather than exact string matching. Reduces API costs and latency by 60-80% through vector embeddings, similarity search, and context-aware cache invalidation strategies.

LLM Caching Vector Embeddings npm

Academic research paper presenting novel approaches to LLM response caching using semantic similarity metrics. Details architecture, algorithms, and performance benchmarks showing significant improvements in response time and cost reduction for production AI applications.

Research LLM Semantic Caching Academic

Work Experience

Engineer

  • Built a full-featured IDE for firmware engineers with integrated hardware debugging, crash analysis, and real-time serial log monitoring—enabling end-to-end embedded development in a single tool
  • Implemented live variable watch, firmware flashing with verification, and RTOS inspection (task states and per-task stack usage) to give engineers deep runtime visibility into embedded systems
  • Developed a function profiler for performance analysis, automated device connection detection, and debug adapter diagnostics tooling to accelerate the firmware development cycle
  • Engineered low-latency C++ serial communication drivers optimized for high-throughput, real-time device interaction with minimal protocol overhead
  • Architected a debug orchestration pipeline supporting multiple hardware programmers with built-in flash verification and serial-based test automation
C++ Embedded Systems RTOS Serial Communication Low-Latency Drivers Firmware Tooling

Lead Engineer & CEO

  • Led the engineering team to design and build an AI-powered enterprise platform enabling organizations to onboard teams and leverage AI agents for operations, automation, and software development, with a full role-based access control model and enterprise-grade security
  • Architected Software Foundry—a software factory module that provisions fully isolated, per-tenant cloud environments via infrastructure-as-code, allowing an AI agent to scaffold, build, and iterate on full-stack applications through natural language
  • Designed a high-throughput distributed RPC communication layer between platform servers and per-tenant compute nodes, with mutual TLS authentication and network-level isolation
  • Implemented zero-trust security architecture: certificate-based service authentication, automated secrets management at provisioning time, and full infrastructure lifecycle management (provision and teardown per tenant)
  • Engineered the core platform server handling authentication, billing, team management, and AI agent orchestration, backed by relational and in-memory data stores on cloud infrastructure
Node.js / Express.js gRPC AWS Terraform PostgreSQL Redis AI Agents Distributed Systems Enterprise Architecture TypeScript

PUDO Nepal Pvt Ltd

Aug 2020 - July 2021

Project Lead

  • Oversaw development of revolutionary last-mile delivery platform, projected to increase efficiency by 30%
  • Strategically located 170 PUDO stations to serve 500K+ potential customers in Kathmandu
  • Conducted market research influencing feature integration for 20% efficiency improvement
  • Collaborated on revenue model projecting support for 2000 stations and 300 delivery workers
  • Awarded 5% stock options for outstanding performance
Market Research Product Strategy UX Design

Karyathalo IT and Web Solutions

Aug 2020 - July 2021

Product Development Intern

  • Delivered e-governance platform for Kathmandu Metropolitan City (5M users) using WordPress multisite with custom plugin architecture, implementing role-based access control, audit logging, and form validation reducing physical visits by 30% (200K+ requests/month)
  • Engineered MySQL query optimization with indexing strategies and Redis object caching, reducing page load time from 4.2s→1.1s (74% improvement) and supporting 2K concurrent users on single VPS
  • Led requirements analysis with municipal stakeholders, translating 50+ user stories into technical specifications with Figma prototypes and API contracts
WordPress Multisite PHP/MySQL Redis Caching Figma/Prototyping Security Best Practices

Karela Games

Feb 2019 - Feb 2020

Game Developer and Product Lead

  • Executed end-to-end game development in C# Unity with 1K+ organic downloads
  • Orchestrated creation of three games: Frustration Pill, Texting and Walking, Vertical Challenge
  • Integrated Facebook analytics and conducted market research increasing downloads by 50% across titles.
  • Collaborated with publisher Super Developers implementing Agile methodologies to streamline prototype development and improve team productivity.
C# Unity Game Development Analytics

Education

Bachelor of Science in Biomedical Engineering

  • Engineering foundation with focus on systems thinking, biomechanics, and medical device design principles
  • Coursework in signal processing, data structures, algorithms, and computational modeling that directly translates to scalable software architecture
  • Applied engineering rigor to software development, bringing disciplined approach to system design and performance optimization
Systems Engineering Signal Processing Computational Modeling Data Structures