LLM Engineer
Company DescriptionVyro is at the forefront of innovation, transforming content creation through advanced AI and Machine Learning technologies. As a rapidly growing Gen-AI and SaaS-focused company, we empower creativity across industries with state-of-the-art tools. Our flagship products include ImagineArt, an AI-powered design studio that turns text into stunning visuals, and Chatly, an intelligent multi-modal assistant leveraging frontier AI models for seamless task management and idea generation.With 15+ products, over 2.5 billion images processed, and 800,000+ daily active users, Vyro is actively shaping the future of creative tools. Join our passionate team of Vyronauts to make an impact and innovate with us! Role DescriptionThis is a full-time, on-site role for an LLM Engineer based in Islamabad. The role involves designing, developing, and fine-tuning LLMs, building agentic AI workloads, implementing data-driven algorithms, and deploying scalable solutions. You will collaborate closely with cross-functional teams to integrate cutting-edge machine learning capabilities into Vyro’s products, while exploring new methods to enhance performance, reliability, and efficiency.QualificationsExperience & Education• 4+ years of industry experience in Machine Learning or NLP• Bachelor’s degree in Computer Science (BSCS) or a related fieldFrontier Model Orchestration• Deep experience leveraging closed-source SOTA models from OpenAI, Anthropic, bolthires, and xAI• Strong understanding of complex reasoning, tool-use, and multi-step AI pipelinesAdvanced Architectures• Expert grasp of transformer variants and Mixture-of-Experts (MoE) architectures• Proven hands-on experience with open-weight SOTA models such as Llama 3.x, Mistral Large, Qwen 2.5, Phi-4, etc.Agentic Frameworks• Mastery of multi-agent orchestration using frameworks like LangGraph (stateful agents), AutoGen, or CrewAI• Experience implementing DSPy for declarative, self-optimizing prompt pipelinesProduction RAG & Memory Systems• Implementation experience with GraphRAG and hybrid retrieval strategies• Expertise with vector stores (Qdrant, Milvus, Weaviate) and semantic caching for long-term agent memoryInference Optimization• Experience deploying high-throughput models using vLLM, TensorRT-LLM, or SGLang• Familiarity with FlashAttention-2, KV caching, and quantization techniques (AWQ, EXL2)Why Join Us?• Work on innovative AI products like Chatly and ImagineArt that are shaping the future of user interaction and creativity• Collaborate with a passionate, talented team that values experimentation, innovation, and data-driven decision-making• Competitive salary and benefits package• A growth-driven culture that encourages learning, ownership, and continuous improvementNote:This is an onsite position at our office in H12, Islamabad, for residents of Pakistan. Candidates residing outside of Pakistan may be considered for remote work opportunities.Apply tot his job