GPT-2 from fine-tuning to production

Wasiq Malik

AI Model Developer
AI Developer
AWS
Python
PyTorch
• Fine-tuned GPT-2 to generate full-length text articles based on an input prompt and its semantics.
• Built training/inference pipelines using huggingface transformers and a REST API using cortex.dev
• Deployed on an AWS EKS cluster with V100 GPUs, resulting in a 2-4s inference time.
• Provided support for production bugs and maintenance after go-live for writeme.ai
Partner With Wasiq
View Services

More Projects by Wasiq