Llama-3, an open-source large language model, currently holds the top position among LLMs. It is expected to equal GPT-4 with its next release. Many companies are opting for fine-tuned versions of Llama 3 instead of proprietary models like ChatGPT due to privacy concerns. Deploying high-throughput LLMs is complex and costly. MonsterGPT, a novel finetuning and deployment agent, enables developers to deploy LLMs by simply asking in natural language without coding or infra setup. With MonsterGPT, users can fine-tune an LLM on their dataset, deploy an open source LLM as an API endpoint, and deploy their fine-tuned LLMs as an API endpoint. The system automatically recommends a GPU with suitable VRAM for the selected model and provides a deployment URL that offers complete granular control over parameters such as max tokens, streaming, top_p, and top_k. Deployment of AI models with MonsterGPT is considered the easiest, fastest, and most affordable option currently available.