AI System Optimization with TensorRT Engines
Optimizing AI systems by implementing TensorRT-based neural network graph optimization, resulting in significant performance improvements.
ML Engineer
Software Architect
AI Developer
C++
CUDA
Fine-Tuning Llama-2 for
Domain-specific Question Answering
In this project, I start with the Llama-2 model with 7B parameters and fine-tune it for question-answering tasks on two new domains.
ML Engineer
Software Engineer
AI Developer
LangChain
Python
PyTorch
3D Object Detection from Fisheye Images without Training Images.
Implemented a solution directly from a published paper to predict 3D objects from fisheye cameras without training images, using the vanilla 3D object detector.
ML Engineer
Software Engineer
AI Developer
C++
Python
scikit-learn
Optimizing Neural Network via Model Structural Pruning - PyTorch
Optimizing neural networks via model pruning. The project highlights the efficiency of model pruning in increasing inference speed while retaining accuracy.
ML Engineer
Software Engineer
AI Developer
Python
PyTorch