Develop and optimize the machine learning models that power Point11's AI Discovery and Agents products. You'll work on LLM integration, prompt engineering, retrieval-augmented generation, and real-time inference pipelines serving enterprise customers.
Responsibilities
Build and fine-tune ML models for AI discovery, conversational agents, and ad optimization
Design and implement RAG pipelines and vector search infrastructure
Optimize model inference for sub-100ms response times at scale
Evaluate and integrate new foundation models (GPT, Claude, Gemini) as they release
Collaborate with engineering and product to translate research into production features
Qualifications
3+ years of experience in ML engineering or applied AI
Strong experience with Python, PyTorch or TensorFlow, and LLM frameworks (LangChain, LlamaIndex)
Hands-on experience with vector databases (Pinecone, Weaviate, or similar)
Understanding of transformer architectures and modern NLP techniques
Experience deploying models in production environments
What We Offer
Competitive salary + meaningful equity
Work on cutting-edge AI with real enterprise impact
Fully remote with flexible hours
Health, dental, and vision insurance
GPU credits and access to the latest models for experimentation
Interested in this role?
Apply now or reach out to learn more about working at Point11.