Logo Aleph Alpha

Senior AI Researcher

Job

  • Level
    Senior
  • Job Feld
    Data
  • Anstellung
    Vollzeit
  • Vertragsart
    Unbefristetes Dienstverhältnis
  • Ort
    Heidelberg
  • Arbeitsmodell
    Hybrid, Onsite
  • Job Zusammenfassung

    In diesem Job entwickelst du innovative Algorithmen zur Datenkurierung und verbesserst die Methodik für Vortraining-Korpora, während du an skalierbaren Pipelines arbeitest und verschiedene ML-Experimente analysierst.

    Job Technologien

    Deine Rolle im Team

    • As a Senior AI Researcher for Pre-training Data, you will shape and improve the underlying scientific methodology behind our pre-training corpora while also co-engineering the software and systems that enable this.
    • Working with engineers and other researchers to build scalable pipelines, you will focus on relevant theoretical and empirical research required to understand which data makes models perform best on our targeted capabilities.
    • In your day-to-day, you will design targeted ablations across various scales, derive and test hypotheses from training dynamics, develop novel algorithms for estimating data quality and performing data curation, and contribute to a range of engineering tasks which facilitate these research directions.
    • Together with a collaborative team of engineers and researchers, you will have a direct impact on the fundamental knowledge and capabilities of the models we ship.
    • You will also help or lead the writing of technical reports for internal and external readers, as well as presenting at and contributing to technical meetings and conferences on an as-needed basis.

    Unsere Erwartungen an dich

    Qualifikationen

    • A deep understanding of machine learning theory, specifically regarding foundation model training dynamics, scaling laws, and data-centric AI.
    • Familiarity with statistical methods for evaluation and experiment design.
    • Ability to reason about the information-theoretic properties of a dataset and its predictive power for evaluated tasks: not just processing data, but understanding its signal.
    • Strong Python skills and comfort with ML tooling and deep learning frameworks (especially PyTorch).
    • Willingness to relocate to Heidelberg or travel at least fortnightly.
    • A history of contributions to top-tier venues (NeurIPS, ICML, ICLR, ACL, etc.) specifically regarding data curation, scaling laws, synthetic data, or LLM pre-training.
    • Bonus, but not required: German language proficiency can be helpful for curating and assessing German-language data.

    Erfahrung

    • Experience designing and evaluating complex ML experiments related to data composition, curriculum learning, or data quality on language model training.
    • PhD in machine learning, NLP, or equivalent research experience focusing on large-scale language modeling or data curation.
    • Experience training foundation models from scratch and diagnosing data-induced training pathologies.

    Unser Angebot

    • 30 days of paid vacation.
    • Access to a variety of fitness & wellness offerings via Wellhub.
    • Mental health support through nilo.health.
    • Substantially subsidized company pension plan for your future security.
    • Subsidized Germany-wide transportation ticket.
    • Budget for additional technical equipment.
    • Flexible working hours for better work-life balance and hybrid working model.
    • Virtual Stock Option Plan.
    • JobRad Bike Lease.

    Benefits

    Gesundheit, Fitness & Fun

    Themen mit denen du dich im Job beschäftigst

    Job Standorte

    • Standort Heidelberg

      Baden-Württemberg

      Deutschland

    Das ist dein Arbeitgeber

    Aleph Alpha

    Aleph Alpha

    Als deutsches KI-Startup mit Sitz in Heidelberg fokussiert sich Aleph Alpha auf die Entwicklung von großen Sprachmodellen und generativer KI. Es bietet Lösungen für Unternehmen, die ihre eigenen KI-Kompetenzen aufbauen und ihre Daten schützen möchten.

    Description

  • Unternehmenstyp
    Startup
  • Arbeitsmodell
    Hybrid, Onsite
  • Branche
    Internet, IT, Telekom
  • Logo Aleph Alpha

    Senior AI Researcher

    Ort
    Heidelberg
    Arbeitsmodell
    Hybrid, Onsite
    Diversität
    Für alle Personen geeignet (m/w/d)

    Weitere Jobs