# Senior Data Engineer (LLM)

> Vecten · Warsaw, Poland (Remote) · — · Posted 2026-01-19

**Salary:** unknown currency 20,000–26,000

**Workplace:** remote

## Description

We are an AI-native data and technology partner for private capital and healthcare. Founded in 2010 and headquartered in Warsaw, we work with leading PE firms, VC funds, and healthcare organizations to build proprietary data infrastructure, deploy AI solutions, and drive AI-native transformation.

Our clients manage a cumulative $210B+ in assets. Our average engagement runs five years. Our NPS sits above 80. We don't need to claim credibility — we can show it.

We've also done to ourselves what we now do for clients. We've restructured our own company around AI — tools, policies, roles, delivery models. This isn't a pitch. It's a playbook we've already run, and we're hiring the engineers who will run it for others.  

### The project:

We are carrying out the project for our client, an American private equity and investment management fund - listed on the Forbes 500 list - based in New York.

We support them in the area of the infrastructure and data platform, and very recently we also build and experiment with Gen AI applications. The client operates very widely in the world of finance, loans, investments and real estate.

As a Senior Data Engineer with Frontend Focus you'll design and implement core systems that enable data science, data visualization, and agent-based applications at companies that implement data-driven decision processes to create a competitive advantage.

You'll build a data platform and internal agent-based tooling for data and business teams, including data pipeline orchestrator, data warehouses, and authenticated frontend interfaces, using:

**Technologies:** Python, Terraform, SQL, Pandas, Shell scripts, Next.js, React, FastAPI  
**Tools**: git, Docker, Snowflake, Pinecone, Neo4j, Jenkins, Jupyter Notebook, OpenAI API, Apache Airflow / Astronomer, Kubernetes, Artifactory, Linux  
**AWS**: EC2, ECS, ELB, IAM, RDS, Route53, S3, VPCs  
**Best Practices**: Continuous Integration, Code Reviews, OAuth2/OIDC authentication, observability

The ideal candidate will be well organized, eager to constantly improve and learn, driven and, most of all - a team player!

**Your responsibilities will include:**

-   Developing PoCs using latest technologies, experimenting with third party integrations
-   Delivering production grade applications once PoCs are validated
-   Creating solutions that enable data scientists and business analysts to be self-sufficient as much as possible
-   Designing and implementing secure, scalable access patterns (OAuth2/OIDC, authorization boundaries)
-   Finding new ways how to leverage Gen AI applications and underlying vector and graph data storages
-   Contributing across the stack including FastAPI backend services and agent-driven workflows
-   Contributing data technology stacks including data warehouses and ETL pipelines
-   Building data flows for fetching, aggregation and data modeling using batch and streaming pipelines
-   Documenting design decisions before implementation

## Requirements

What's important for us?

-   At least 5+ years of professional experience in data-related or full-stack engineering role
-   Undergraduate or graduate degree in Computer Science, Engineering, Mathematics, or similar
-   Expertise in Python and SQL languages
-   Experience with data warehouses (Snowflake) and different database technologies (RDBMS, vector, graphs)
-   Proven experience building secure, scalable web applications
-   Expertise in AWS stack and services, proficiency in using Docker
-   Experience with infrastructure-as-code tools, like Terraform
-   Excellent command in spoken and written English, at least C1
-   Creative problem-solving skills and excellent technical documentation
-   Ability to work with both Windows and Unix-like operating systems

**You will score extra points for:**

-   Experience with FastAPI or other Python web frameworks
-   Proficiency in Next.js / React for frontend development
-   Knowledge of OAuth2/OIDC flows, PKCE, token/session handling
-   Experience with integrating LLMs (OpenAI but also others, maybe open source)
-   Agent frameworks experience (OpenAI Agents, Agno, Strands) or MCP-style integrations
-   Understanding of LLMs fine tuning, embedding and vector semantic searching
-   Experience with Pinecone or Neo4j
-   Observability stacks experience (OpenTelemetry or similar)
-   Experience in building ETL processes and data pipelines with platforms like Airflow
-   AWS ECS, VPCs, and secure networking patterns
-   Proficiency in statistics and machine learning with Python libraries
-   Experience in working with repository manager, for example Jfrog Artifactory

## Benefits

**What do we offer?**

-   Working alongside a talented team of software engineers who are changing the image of Poland abroad
-   Culture of teamwork, professional development and knowledge sharing ([https://www.youtube.com/user/sunscraperscom](https://www.youtube.com/user/sunscraperscom))
-   Flexible working hours and remote work possibility
-   Comfortable office in central Warsaw, equipped with all the necessary tools for conquering the universe (Macbook Pro/Dell, external screen, ergonomic chairs)

  

Sounds like a perfect place for you? Don’t hesitate to click apply and submit your application today!

## Apply

[Apply at Vecten](https://apply.workable.com/vecten/j/815DFCCBEE/apply)

---
Powered by [Workable](https://www.workable.com)
