Arslan Mehmood - AI Engineer | ContraWork by Arslan Mehmood

Arslan Mehmood

ML AI | Backend | Computer Vision | GenAI | LLM Agents

New to Contra

Arslan is ready for their next project!

Lahore, Pakistan

Work Posts Services About

Lahore, Pakistan

Cover image for LakeShield - AI-Powered Video Monitoring

LakeShield - AI-Powered Video Monitoring and Vessel Intelligence Platform I led the development of LakeShield as the Senior AI/ML Engineer and Lead Developer, taking the platform from initial research and experimentation to a scalable production system. My responsibilities included: 🔹 Designing the end-to-end AI and video-processing architecture 🔹 Building YOLO-based boat and vehicle detection pipelines 🔹 Developing object tracking and movement-analysis workflows 🔹 Implementing OCR for extracting boat registration information 🔹 Creating scalable pipelines for processing thousands of surveillance videos 🔹 Developing FastAPI backend services and automated data workflows 🔹 Building a Next.js analytics dashboard integrated with Supabase 🔹 Deploying and operating the AI pipeline on cloud GPU infrastructure 🔹 Optimizing model accuracy, inference speed, infrastructure costs, and reliability 🔹 Managing production monitoring, troubleshooting, maintenance, and continuous improvements The platform transforms raw surveillance footage into structured operational insights, enabling automated vessel monitoring, vehicle activity analysis, registration extraction, and reporting. This project involved complete technical ownership across Computer Vision, AI/ML, backend development, cloud infrastructure, data engineering, MLOps, and production operations. #ComputerVision #VideoAnalytics #ArtificialIntelligence #ObjectDetection #OCR #MLOps #FastAPI #NextJS #Supabase #CloudEngineering

Cover image for Shelfr - AI-Powered Retail Shelf

Shelfr - AI-Powered Retail Shelf Intelligence Platform I led the development of Shelfr as the Senior Computer Vision Engineer and Lead Developer, taking the platform from the initial idea and system architecture through development, deployment, and production operations. My work included: 🔹 Designing the complete computer vision and backend architecture 🔹 Building product detection, shelf analysis, OCR, and image-processing pipelines 🔹 Developing APIs and scalable data-processing workflows 🔹 Deploying and managing production systems on GCP cloud servers 🔹 Optimizing model accuracy, processing speed, and infrastructure performance 🔹 Managing production monitoring, reliability, troubleshooting, and ongoing improvements 🔹 Leading technical decisions across AI, backend, cloud infrastructure, and DevOps The platform converts real-world retail shelf images into structured product and shelf-level insights, helping automate retail auditing, product visibility analysis, and inventory workflows. #ComputerVision #RetailAI #LeadDeveloper #AIEngineering #GCP #MLOps #Python #CloudEngineering

⚖️ Built a French Legal AI Assistant powered by advanced RAG and LLM technology. The system enables users to ask complex legal questions and receive accurate, context-aware answers grounded in French legal documents. Key features include: 🔹 Custom legal document ingestion and chunking 🔹 Metadata-based vector search 🔹 Hybrid retrieval and reranking 🔹 Agentic RAG workflows using LangGraph 🔹 Source-grounded answers with legal references 🔹 Private deployment on an Azure VM using locally hosted LLMs The main focus was improving retrieval accuracy, reducing hallucinations, and making large collections of legal documents easier to search and understand. #LegalAI #RAG #LLM #ArtificialIntelligence #LangGraph #Azure #GenerativeAI #MachineLearning

Cover image for AI-Powered PDF Data Extraction
My role:

AI-Powered PDF Data Extraction My role: AI Data Processing and Extracton Engineer Organizations often struggle to extract structured and useful information from large volumes of unstructured PDF documents. I developed a flexible AI-powered data extraction solution that allows users to define the specific entities and fields they want to retrieve. The system processes different PDF formats, identifies relevant information, and converts it into structured, usable data. The solution reduces manual document processing, improves retrieval accuracy, and can be adapted to different document types and business requirements. A working demo link is attached.

Cover image for AI Agents & RAG Chatbots

AI Agents & RAG Chatbots with Persistent Memory I design and build intelligent AI agents and chatbots that maintain conversation context, retrieve reliable information, and interact with external tools and APIs. Core Capabilities 🔹 Persistent conversation and long-term memory 🔹 RAG-powered answers with reduced hallucinations 🔹 Tool calling, APIs, web search, and file retrieval 🔹 Multi-agent and multi-step workflows 🔹 Integration with OpenAI, Claude, Gemini, and open-source LLMs 🔹 Vector databases including pgvector, Pinecone, Weaviate, FAISS, and ChromaDB Technologies LangGraph, LangChain, Agno, PydanticAI, Haystack, FastAPI, OpenAI, Claude, Gemini, Hugging Face, PostgreSQL, pgvector, Pinecone, and Weaviate

Cover image for AI Vision for Retail, Industrial

AI Vision for Retail, Industrial & Monitoring Workflows Overview I have built and deployed multiple real-world computer vision systems for industrial inspection, retail automation, and monitoring workflows. My responsibilities covered: 🔹 Dataset preparation and labeling 🔹 Object detection model training 🔹 Segmentation model training 🔹 YOLO-based detection and tracking 🔹 Image/video inference pipeline development 🔹 Model evaluation and threshold tuning 🔹 Production deployment support 🔹 Cloud server management and optimization 🔹 Building practical AI workflows for real-world operational environments Fish Quality Inspection System - lythium.cl (http://lythium.cl) I led the development of an advanced fish quality inspection solution for an industrial workflow. The system used image analysis to monitor fish quality and support automated fish sorting based on AI predictions. 🔹 Led the development of an advanced AI-powered fish quality inspection system for an industrial workflow. 🔹 Built an image analysis pipeline to monitor fish quality from production-line images. 🔹 Trained object detection models to identify fish and relevant visual quality indicators. 🔹 Trained segmentation models to support more detailed visual inspection of fish regions. 🔹 Designed the AI workflow to support automated fish sorting based on model predictions. 🔹 Worked on inspection logic that could classify or route fish based on quality-related outputs. 🔹 Designed the system for conveyor-belt usage, where images need to be processed consistently and reliably. 🔹 Focused on production issues such as image quality, camera consistency, lighting variation, and model reliability. 🔹 Helped convert visual inspection from a manual/rule-based workflow into an AI-supported inspection pipeline. 🔹 Built the system to reduce manual inspection effort and improve production workflow efficiency. Shelfr.ai (http://Shelfr.ai) - Retail Automation Platform I developed AI image solutions for retail automation and execution. The system handled large-scale product detection across 10,575+ SKUs, price tag detection, shelf and display type detection, and gap detection for empty shelf spaces. 🔹 Developed large-scale AI image solutions for retail automation and execution. 🔹 Worked on product detection across 10,575+ SKUs, where each SKU represented a unique product. 🔹 Built object detection workflows to identify products from retail shelf images. 🔹 Developed price tag detection to locate and extract price label areas from store images. 🔹 Worked on shelf and display type detection to understand the retail environment layout. 🔹 Built gap detection logic to identify empty shelf spaces and out-of-stock areas. 🔹 Supported computer vision workflows for retail compliance, shelf monitoring, and store execution. 🔹 Worked with high-volume image data and production-level inference requirements. 🔹 Managed high-load production servers on Google Cloud Platform. 🔹 Implemented load balancing and autoscaling to improve system stability under production traffic. 🔹 Focused on scalable AI infrastructure capable of handling real-world retail image workloads. 🔹 Helped create AI systems for inventory visibility, shelf condition monitoring, and retail execution analytics. lake-shield.com (http://lake-shield.com) - USA LAKES - Boat Detection & Inspection System 🔹 Worked on a YOLO-based boat detection, tracking, and monitoring system. 🔹 Labeled datasets for boat detection and inspection model training. 🔹 Prepared image/video data for object detection training workflows. 🔹 Trained YOLO object detection models to detect boats in monitoring footage. 🔹 Built a detection pipeline capable of identifying boats from visual data. 🔹 Worked on boat tracking logic to monitor boat movement across frames. 🔹 Supported inspection and monitoring workflows using computer vision predictions. 🔹 Developed an end-to-end pipeline from labeled data to trained model and inference output. 🔹 Focused on practical model performance in outdoor environments where lighting, distance, angle, and background can vary. 🔹 Helped build a monitoring system that could support automated detection and review instead of fully manual observation. My Responsibilities Across These Projects 🔹 Led AI/computer vision system development 🔹 Designed labeling and dataset preparation workflows 🔹 Trained YOLO/object detection models 🔹 Trained segmentation models where needed 🔹 Built image and video inference pipelines 🔹 Evaluated models using practical production metrics 🔹 Improved model performance through dataset cleanup, retraining, and threshold tuning 🔹 Integrated AI models into backend or operational workflows 🔹 Supported production deployment and infrastructure optimization 🔹 Worked with real-world constraints such as lighting, camera angle, image quality, latency, and false detection rates Technologies Used 🔹 Python 🔹 YOLO / YOLOv8 🔹 Object Detection 🔹 Image Segmentation 🔹 OpenCV 🔹 PyTorch 🔹 FastAPI 🔹 Google Cloud Platform 🔹 Linux Servers 🔹 Load Balancing 🔹 Autoscaling 🔹 Custom Data Labeling Workflows 🔹 Model Training 🔹 Model Evaluation 🔹 Inference Pipeline Development 🔹 Production AI Deployment

Cover image for French Legal AI Assistant &

French Legal AI Assistant & Agentic RAG System Overview I designed, built, and deployed a specialized Legal AI Assistant for French lawyers using agentic RAG, legal data pipelines, vector search, reranking, open-source LLMs, and citation-grounded answer generation. The system allowed lawyers to ask legal questions and receive answers grounded in French law articles, legal references, and relevant judicial cases. Problem / Challenge Legal data is very different from normal document data. A generic RAG pipeline using fixed-size chunks often breaks legal meaning, misses important context, or retrieves incomplete references. The main challenges were: 🔹 Legal documents had different structures and lengths 🔹 Articles and laws could not be randomly split into fixed-size chunks 🔹 Each answer needed traceable legal references 🔹 Retrieval had to understand legal scope, not just semantic similarity 🔹 The system needed to reduce hallucinations for legal users 🔹 Deployment had to respect privacy and regulatory requirements My Expertise I worked as the Lead AI Engineer / Agentic RAG Developer responsible for the complete system design and implementation. My responsibilities included: 🔹 Legal data pipeline architecture 🔹 Document parsing and preprocessing 🔹 Custom legal chunking strategy 🔹 Vector database design 🔹 Agentic RAG workflow development 🔹 Retrieval optimization and reranking 🔹 Open-source LLM deployment 🔹 Backend API development with FastAPI 🔹 Secure Azure cloud deployment 🔹 Multi-tenant system support French Legal Data Engineering Pipeline I built an automated ETL pipeline to process thousands of French legal documents, articles, and judicial cases. The pipeline handled: 🔹 Raw legal document ingestion 🔹 Text cleaning and normalization 🔹 Legal article extraction 🔹 Section-aware document structuring 🔹 Custom chunk generation 🔹 Metadata extraction for article number, article title, section, source, and reference 🔹 Embedding generation 🔹 Vector database ingestion 🔹 Repeatable updates for future legal data expansion The chunking strategy was designed so legal articles were not cut in the middle or separated from their meaning. Agentic RAG Workflow Instead of using a simple one-step vector search, I built a LangGraph-based agentic RAG workflow. The workflow included: 🔹 User query understanding 🔹 Legal intent detection 🔹 Legal domain and scope identification 🔹 Generation of 2–5 targeted legal search queries 🔹 Retrieval of relevant chunks for each query 🔹 Deduplication of repeated results 🔹 Reranking of retrieved legal evidence 🔹 Source-grounded answer generation This improved tested retrieval accuracy from around 50% to 95%+. Retrieval, Citations & Case Law The retrieval system was designed to make answers transparent and verifiable. I implemented: 🔹 Vector search for semantic legal retrieval 🔹 Reranking to improve relevance 🔹 Metadata-based source traceability 🔹 Citation-backed answer generation 🔹 Article-level legal references 🔹 Typesense-based retrieval for French judicial cases 🔹 Supporting case law returned with legal answers This allowed lawyers to verify the exact legal source behind each generated response. Open-Source LLM & Cloud Deployment I evaluated and deployed open-source LLM infrastructure for private legal AI usage. The deployment included: 🔹 Qwen2.5:14B for French legal reasoning 🔹 Ollama and vLLM for model serving 🔹 Embedding and reranker models on a private Azure GPU VM 🔹 NVIDIA T4 16GB GPU optimization 🔹 Python/FastAPI backend APIs 🔹 Secure Azure deployment in the France region 🔹 Multi-tenant isolated access 🔹 GitHub CI/CD and Linux server management The system was designed for privacy, reliability, and regulatory compliance. Technologies Used 🔹 Python 🔹 FastAPI 🔹 LangChain 🔹 LangGraph 🔹 LangSmith 🔹 Ollama 🔹 vLLM 🔹 Qwen2.5:14B 🔹 ChromaDB 🔹 Typesense 🔹 Vector Databases 🔹 Reranking Models 🔹 Embedding Models 🔹 Azure Cloud 🔹 Linux 🔹 GitHub CI/CD Impact 🔹 Built a production-ready legal AI assistant for lawyers 🔹 Improved retrieval accuracy from ~50% to 95%+ in tested scenarios 🔹 Reduced hallucinations through citation-grounded generation 🔹 Enabled lawyers to verify answers using article and case references 🔹 Created a scalable legal data pipeline for thousands of documents 🔹 Deployed private open-source LLM infrastructure for legal compliance 🔹 Delivered a strong foundation for future legal AI workflows