We are looking for a skilled Python Developer to join our AI/ML engineering team and contribute to the development of intelligent systems, data pipelines, and RAG services.
Project Work Format:
Remote work from home (office setup possible upon request)
Project documentation in Confluence
Task management in Jira
Main Tasks:
Develop and maintain backend services and data pipelines using Python 3 (functions, classes, context managers, modules, dependency management).
Work within virtual environments using venv, Poetry, or Pipenv.
Build and integrate AI/NLP solutions with OpenAI API, Hugging Face Transformers, LangChain, or LlamaIndex.
Implement and optimize RAG (Retrieval-Augmented Generation) systems, including embedding generation, tokenization, and similarity search.
Work with vector databases such as Pinecone, FAISS, Chroma, Weaviate, or Qdrant for embedding storage and retrieval.
Create and maintain API wrappers and backend services using FastAPI or Flask.
Design and execute ETL pipelines, including:
Data extraction from sources such as Google Drive, Notion, and Shopify API.
Text cleaning (HTML stripping, stopword removal, normalization).
Data storage in structured databases or vector stores.
Model calls for automated Q&A and content generation.
Manage configuration and environment variables using dotenv.
Containerize and deploy services with Docker.
Deploy and maintain applications on Render, Railway, Hugging Face Spaces, or with a basic understanding of AWS.
Work with data formats such as JSON, CSV, and YAML.
Conduct code reviews and ensure adherence to best coding practices.
What We Expect from a Candidate:
Strong proficiency in Python 3 and its ecosystem.
Experience with virtual environments (venv, Poetry, or Pipenv).
Knowledge of REST / GraphQL APIs and experience integrating with external services using requests or httpx.