UUP.Labs is hiringAI Agent Engineer
Overview:
UP.Labs is a dynamic venture studio dedicated to building innovative startup companies from the ground up. Our team thrives on solving complex problems, driving technological advancements, and creating impactful digital products. We're seeking a skilled AI Agent Engineer to join our growing team and contribute to our mission of launching the next wave of successful startups.
Technical Challenge:
As an AI Agent Engineer at UP.Labs, you will design, implement, and deploy agentic AI workflows — systems where LLMs orchestrate multi-step reasoning, tool use, and decision-making — to power real-time tooling across manufacturing, logistics, and supply chain domains. You will be responsible for building solutions that behave predictably and produce near-deterministic outputs in production environments. This is a hands-on role requiring strong technical expertise, creativity, and a passion for innovation in the transportation industry.
In this role you will:
- Design, build, and deploy agentic workflows (multi-step LLM chains with tool calling, retrieval, and structured output) for real-time, business-critical use cases.
- Engineer for determinism and consistency by implementing constrained decoding, structured outputs, caching layers, and evaluation harnesses.
- Build and maintain evaluation and regression frameworks — automated pipelines that measure accuracy, latency, and behavioral consistency across prompt and model changes.
- Integrate LLM agents with external tools and APIs (databases, rules engines, business systems) using frameworks like LangFuse, LangChain, LangGraph, CrewAI, or custom orchestration.
- Deploy agentic systems on cloud infrastructure (AWS, Azure, and/or GCP), optimizing for low-latency inference and cost efficiency.
- Implement guardrails, fallback logic, and observability to ensure agents fail gracefully and every decision is traceable.
- Collaborate with data scientists, software engineers, and business stakeholders to translate business rules into agent behavior and tool definitions.
- Stay current with the latest advancements in AI agents, large language models, and cloud technologies.
Required Skills:
- Practical, hands-on experience building and deploying agentic AI systems in production environments.
- Proficiency in Python and experience building production backend systems.
- Experience with LLM APIs (OpenAI, Anthropic, etc.) and agentic frameworks (LangFuse, LangChain, LangGraph, CrewAI, AutoGen, or equivalent).
- Strong understanding of prompt engineering for reliability: structured outputs, few-shot patterns, chain-of-thought, and techniques that minimize hallucination.
- Experience building evaluation and testing pipelines for AI systems, including behavioral evals and golden-set testing.
- Expertise in at least one major cloud provider (AWS, Azure, and/or GCP).
- Familiarity with Databricks, including experience working with its data engineering and analytics capabilities.
- Familiarity with vector databases (Pinecone, Weaviate, pgvector) and retrieval-augmented generation (RAG) patterns.
- Solid knowledge of version control systems (e.g., Git) and CI/CD pipelines.
- Strong problem-solving skills and ability to work collaboratively across teams.
Preferred Expertise:
- Advanced degree (Master's or PhD) in Computer Science, Machine Learning, or a related field.
- Expertise in containerized deployment with Docker.
- Experience building systems where AI outputs feed directly into business-critical decisions.
- Experience in the transportation and logistics industry.
- Familiarity with MLOps/LLMOps tooling.
- Experience with fine-tuning or distillation to optimize for speed and cost at inference time.
- Knowledge of rules engines or constraint solvers and how to combine them with LLM reasoning.
UP.Labs Summary:
We build high-growth technology startups that enable faster, cleaner, and safer movement of people and goods. Our vision is to transform the moving world by pairing leading corporations and entrepreneurs with a proven methodology for launching and scaling software and hardware companies.
We work with corporate investors over a multi-year period to launch a portfolio of mobility-focused ventures. Our team is dedicated to the first year of a new venture’s life cycle, from ideation to minimum viable product build (and beyond) to recruiting and hiring the full-time team who will scale the business.
Location: Remote
Apply for this job
Please let UP.Labs know you discovered this position on TRYremote so we can keep providing you with quality remote tech jobs.
