Skip to content
View kaushal07wick's full-sized avatar

Block or report kaushal07wick

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kaushal07wick/README.md

Kaushal Kumar Choudhary 👋

Python · FastAPI · ML · LLM · CUDA · Rust
Systems-first ML engineer who ships production inference, data pipelines, and fast backend services.


What I build

  • Production-grade backend services and APIs with Python and FastAPI for model serving and tooling.
  • Reliable data ingestion and ETL for analytics and retrieval augmented workflows.
  • LLM infra: prompt iteration, eval loops, lightweight retrievers and RAG orchestration.
  • Performance work: CUDA-aware inference optimizations and low-latency Rust sidecars.

Experience highlights

  • Built FastAPI backends that expose model outputs, instruments, and realtime endpoints used by product teams.
  • Designed and ran ingestion pipelines and ETL that feed OLAP and retrieval systems.
  • Implemented evaluation and ranking loops to measure and improve LLM output quality.
  • Wrote Rust watcher/updater services for low-latency node monitoring and state propagation.

Selected projects


Tech stack

Python · FastAPI · AsyncIO · SQL · Postgres · ClickHouse · Docker · Airflow · CUDA · PyTorch · vLLM · Rust · Git · CI/CD · Logging · Metrics


Quick wins I bring

  • Make model outputs production-ready with clear APIs, validation, and observability.
  • Turn prototyped LLM flows into repeatable RAG pipelines.
  • Reduce latency and cost through targeted inference optimizations.
  • Ship small, high-impact features fast in early-stage teams.

Contact

Email: [email protected]
Twitter: https://twitter.com/ofcboogeyman LinkedIn: https://www.linkedin.com/in/11kaushalkumar/

Pinned Loading

  1. breeboost breeboost Public

    Bree Data Science Project : Detecting Fraud from Credit Users, using synthetic transactional data

    HTML

  2. donna donna Public

    Python

  3. FamAgent FamAgent Public

    Python

  4. FinSight FinSight Public

    AI-powered analysis and vector search capabilities for JP Morgan Chase & Co. Earnings Call Transcripts.

    Python

  5. mercor_git_app mercor_git_app Public

    Python

  6. quartermaster-data quartermaster-data Public

    Python