Job
- Level
- Senior
- Job Feld
- Data
- Anstellung
- Vollzeit
- Vertragsart
- Unbefristetes Dienstverhältnis
- Ort
- Berlin
- Arbeitsmodell
- Full Remote
Job Zusammenfassung
In dieser Position entwickelst du skalierbare Datenpipelines und optimierst die Dateninfrastruktur, um wachsende Datenvolumina effizient zu verarbeiten und zuverlässige Datenanalysen zu unterstützen.
Job Technologien
Deine Rolle im Team
- As a Senior Data Engineer, you'll play a key role in building and scaling the data infrastructure that powers our AI-driven platform.
- You'll be responsible for designing, implementing, and optimizing reliable and scalable data pipelines that process large volumes of structured and unstructured data, from synthetic LLM prompts to large-scale web-scraped datasets, across a growing AWS-based data ecosystem.
- This role is focused on enabling rapid scale.
- Our data volume and traffic are increasing quickly as we expand to new AI channels and data sources, and we need robust, production-grade data systems that can keep pace with that growth.
- You'll work closely with engineering, product, and go-to-market teams to ensure data is reliable, observable, and reusable across the organization.
- A core part of the role will be shaping the evolution of our data platform, including contributing to the design and implementation of our Data Lake architecture.
- You'll help ensure our pipelines can handle increasing load, maintain high data quality, and support new product capabilities as we scale.
- You'll also act as a trusted technical partner across teams, helping establish data best practices, improving operational reliability, and enabling teams to use data effectively in both product and business contexts.
- Design, build, and maintain scalable data pipelines that ingest, transform, and validate large volumes of data across multiple sources and channels.
- Improve the scalability, reliability, and performance of our data pipelines to support rapidly growing workloads and new data streams.
- Contribute to the design and implementation of our Data Lake architecture, enabling reliable data storage and reuse across teams.
- Manage and optimize data ingestion workflows, including data collected from web scrapers, third-party vendors, and internal systems.
- Monitor pipeline health, investigate incidents, and implement improvements to increase system reliability and observability.
- Support the onboarding and integration of new AI channels and data sources into the platform.
- Collaborate with teams across the organization to ensure data generated by different systems can be reused effectively for analytics and business intelligence.
- Identify and resolve performance bottlenecks in distributed systems, including rate limiting, concurrency, and throughput constraints.
- Advise engineering and product teams on data architecture, data quality, and best practices for managing scalable data workflows.
- Continuously evaluate and improve our data platform to support the company's rapid growth and evolving product needs.
Unsere Erwartungen an dich
Qualifikationen
- Proficiency in Python for data processing and automation.
Erfahrung
- Strong experience building and operating scalable data pipelines in production environments.
- Hands-on experience working with Data Lakes or Data Warehouses (e.g., AWS Athena or similar technologies).
- Experience with data transformation and modeling.
- Strong experience working with AWS.
- Experience using Infrastructure-as-Code tools to manage cloud infrastructure.
- Experience working with distributed systems and managing large-scale data workflows.
- Experience implementing monitoring, observability, and incident response practices for data systems.
- Nice to have: Experience working with large-scale web scraping or external data ingestion systems.
- Nice to have: Experience supporting systems with rapidly increasing traffic or data volume.
Unser Angebot
- This role is remote in Germany.
Themen mit denen du dich im Job beschäftigst
Job Standorte
Das ist dein Arbeitgeber
Bluefish AI
Bluefish AI ist ein dynamisches Unternehmen mit Sitz in New York, das eine fortschrittliche AI-Marketing-Plattform für große Unternehmen entwickelt. Diese Plattform hilft Firmen dabei, ihre Markenpräsenz in unterschiedlichen AI-Systemen zu überwachen und zu verbessern. Mit einem bemerkenswerten Wachstum und Kunden aus etwa 10% der Fortune 500 hat sich Bluefish AI als bedeutender Player im Bereich der AI-gestützten Marketingtechnologie positioniert.
Description
- Unternehmenstyp
- Startup
- Arbeitsmodell
- Full Remote, Hybrid, Onsite
- Branche
- Werbung, Marketing, PR