EPAM is a leading global provider of digital platform engineering and development services.
We are committed to having a positive impact on our customers, our employees, and our communities.
We embrace a dynamic and inclusive culture.
Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow.
No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.We are hiring a Senior Python Engineer to help drive the evolution of an enterprise knowledge base platform.This platform is transitioning from a Confluence-focused RAG chatbot to a comprehensive agentic knowledge system.
You will play a key role in expanding the platform's capabilities, including hybrid retrieval, multi-source ingestion, evaluation pipelines, agent infrastructure, and shared chat platform primitives.ResponsibilitiesArchitect and develop backend services using Python 3.11, FastAPI, Pydantic, SQLAlchemy async, and asyncpgDesign retrieval and orchestration strategies that balance quality, latency, cost, safety, and operational simplicityBuild robust agent runtime features such as memory boundaries, tool sandboxing, permissions, and budget controlsEnhance answer grounding, failure analysis, and citation enforcement to improve reliability and accuracyEstablish observability and operational feedback loops with OpenTelemetry, Prometheus, Grafana, Docker, Helm, and GitHub ActionsCollaborate with product and engineering teams to support multiple conversational interfaces through a unified knowledge platformDevelop ingestion infrastructure for current and future content sourcesImplement observability across application, pipeline, database, and model-serving layersManage cost, latency, throughput, and failure modes for AI-intensive workloadsCreate release workflows that validate AI behavior changes beyond code compilationRequirementsMinimum 3 years of relevant professional experience in software engineeringExtensive hands‐on experience with Python in platform, automation, or infrastructure‐focused environmentsExperience building command‐line tools using Python, Golang, or RustPractical knowledge of LangGraph, LangChain, pgvector, and modern retrieval pipelinesExperience designing evaluation frameworks for LLM‐powered systems, including regression detection and quality assessmentStrong background with Docker, Helm, GitHub Actions, and Kubernetes‐based workflowsUnderstanding of embedding pipelines, vector search, and operational aspects of LLM‐driven systemsAdvanced observability skills, including metrics, tracing, dashboards, alerting, and log analysisExperience with ingestion, ETL, or large‐scale content‐processing pipelinesAbility to approach system design with considerations for reliability, cost, latency, throughput, and recoveryExcellent oral and written communication skills in English at B2+ level or higherNice to haveExperience with FastAPI for building web APIsFamiliarity with Grafana and Splunk for monitoring and log analysisKnowledge of Qdrant, Neo4j, or other vector and graph database infrastructureWe offerInternational projects with top brandsWork with global teams of highly skilled, diverse peersEmployee financial programsPaid time off and sick leaveUpskilling, reskilling and certification coursesUnlimited access to the LinkedIn Learning library and 22,000+ coursesGlobal career opportunitiesVolunteer and community involvement opportunitiesEPAM Employee GroupsAward‐winning culture recognized by Glassdoor, Newsweek and LinkedIn
#J- *-Ljbffr
By continuing you agree to our Terms & Privacy Policy.