Welcome tomy site
Welcome tomy site

@farooq-azam-khan

Machine Learning Engineer

MLE specializing in
delivering scalable AI solutions.

with VLLMs, Ollama, and NVIDIA TensorRT for optimized model serving pipelines.

interested in

(
graph algorithms
reinforcement learning algorithms
)
Enhanced LLM Agents

Bridging best parts of theory and practice to create impactful, intelligent systems.

Blog Highlights

Importance of Maximum Likelihood Estimation for Machine Learning
Learn about the Maximum Likelihood Estimator and the Gradient Descent Algorithm.
Scaling LLMs with Triton Inference Server: A Hands-on Guide
Get hands-on experience with deploying Large Language Models (LLMs) at scale using NVIDIA's Triton Inference Server.
Term Frequency-Inverse Document Frequency
In this tutorial we will look at what TF and IDF are and how they can be use to process text data for Machine learning.
Large Scale Vector Comparison
In this post, we will look at the quora qna dataset and aim to encode and compare all question pairs. The purpose of is to look at a real dataset.
Comparing Vectors with Cosine Simlarity Function
This tutorial will focus on the math behind text vector similarity using numpy, pytorch, and stentence-transformers libraries in python.

Personal AI Projects

    Race Car RL Agent
    Using Deep Q Learning to create a self driving car.
    Garbage Classification with CNN
    Efficient Net finetuned on Classifying different types of Garbage.
    Haiku Generator
    Prompt engineering experimentation to generate Haikus with LLMs.

Additional Projects

    D3-js Reference Tutorial
    Web app to reference D3 library and its many features.
    Chat App
    A Chat application built to explore web socket connections and server client relations with the protocol. App built with elm, expressjs, and websockets.
    Twitch Python Discord Bot
    Discord bot built with Python.
    Rust Tauri App
    Experimental app designed to test the features offered by Tauri library.

2025 VISON