Skip to content
View baudhya's full-sized avatar

Block or report baudhya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
baudhya/README.md

Hi ๐Ÿ‘‹, I'm Siddharth Baudh

Scientist at DRDO India | Ex-IISc | Systems & ML Infra Enthusiast

baudhya


๐ŸŽ“ M.Tech in Computer Science, Indian Institute of Science (IISc), Bangalore
๐Ÿ”ฌ Scientist, Defence Research and Development Organisation (DRDO), India
๐Ÿ’ผ Former Software Engineer @ Finoit Technologies
๐Ÿ“ซ Reach me: siddharthbaudhya6216@gmail.com | LinkedIn


๐Ÿ› ๏ธ Interests & Specialties

  • Systems Programming: eBPF, Kernel Internals, OS-Level Instrumentation
  • Compilers & Security: LLVM FunctionPass, syscall tracing, sandboxing
  • Machine Learning Infra: CUDA, AVX512, parallel training, PyTorch, Ring-AllReduce
  • High-Performance Systems: Branch predictors, simulation (ChampSim), memory checkpointing
  • Languages & Tools: C++, Python3, eBPF, CUDA, Docker, Git, FastAPI, Celery

๐Ÿš€ Highlight Projects

  • ๐Ÿงฉ Memory Checkpointing with eBPF โ€“ Checkpoints heap memory at runtime using syscall-level instrumentation
  • ๐Ÿ” In-Kernel Sandboxing โ€“ Combines LLVM IR instrumentation + eBPF for enforcing syscall-level policies
  • โš™๏ธ Optimizing Dilated Convolution โ€“ CUDA + SIMD (AVX2/AVX-512) version achieving up to 800x speedup
  • ๐Ÿ“ˆ Data-Parallel LeNet โ€“ Distributed training using Ring-AllReduce; 50% faster with same accuracy

๐Ÿง  Background

I'm passionate about building robust, high-performance systems at the intersection of systems software, machine learning infrastructure, and computer security. I enjoy working close to the OS and hardware, optimizing performance, and developing infrastructure for future AI systems.


๐Ÿ“Œ Letโ€™s Connect

Open to collaborations, research projects, and deep tech discussions in OS, ML systems, compilers, and secure computing.

๐Ÿ“ฌ Email: siddharthbaudhya6216@gmail.com
๐Ÿ”— LinkedIn: linkedin.com/in/baudhya

Pinned Loading

  1. dilated-convolution-optimization dilated-convolution-optimization Public

    Optimizing Performance of Dilated Convolution (DC) - Single-threaded , multi-threaded and GPU

    C++

  2. In-Kernel-Per-Process-Sandbox In-Kernel-Per-Process-Sandbox Public

    Linux kernel-level enforcement engine to enforce policy.

    Python

  3. memory-checkpoint-ebpf memory-checkpoint-ebpf Public

    A cutting-edge eBPF-based solution for efficient memory checkpointing and restoration, enabling lightweight process monitoring, low-latency execution, and seamless state recoveryโ€”offering a modern โ€ฆ

    Python

  4. PageRank PageRank Public

    Python

  5. spinlocks spinlocks Public

    Spinlocks are synchronization primitives used in concurrent programming to achieve mutual exclusion by having threads repeatedly check (or "spin") for lock availability, which is efficient when thโ€ฆ

    C++

  6. rat-race-game rat-race-game Public

    Python