HN Who's Hiring💵 paid workby houtanbscored by google/gemini-2.5-flash

Forecasting Research Institute (FRI) | Data Engineer | REMOTE | Full-time We're a ~20-person nonprofit doing forecasting

Visit project ↗Discussion ↗

💵 Funded work— see “Deliver it” below for how to respond.

Opportunity

AI-buildable

Traction

Creativity

The take

effort: ~1-2 months

This is a job posting for a Data Engineer at the Forecasting Research Institute, a nonprofit conducting research on high-stakes problems like AI progress and biosecurity. The role involves building and maintaining ELT pipelines, managing a cloud data warehouse, and collaborating with analysts. The salary range is $75k-$130k.

Demand & gap

Painkillerdemand 86

Demand

Will. to pay

Gap

Buyer

Enterprise / orgs

The gap in what exists: FRI needs a dedicated data engineer to manage their growing data infrastructure, moving beyond external vendors to bring this critical function in-house.
Wedge to win: Position yourself as a highly efficient, AI-augmented data engineer who can quickly onboard and deliver robust, maintainable data solutions, emphasizing experience with Python, ETL, and cloud warehouses.
Reputation value (worth doing free for proof) — 70/100: Securing this full-time role at a respected research nonprofit would be strong proof of a candidate's data engineering capabilities and their ability to work on complex, high-stakes data projects.
Likely monetization: full-time salary
Incumbents to beat: Other data engineering job applicantsConsulting firms providing similar data engineering services

Land this role

What they want, where you stand, and the exact résumé edits to qualify.

résumé fit 50

Biggest lever: Gain practical, hands-on experience building and managing cloud data warehouses with formal dimensional modeling and orchestration.

✓ You already have

Pythonuvdata scriptingCSV/ETL gluePostgresREST / API designDockerGitHub Actions CIserverless / cronTypeScriptNext.jsReact

+ Gaps to close

Deep ETL/ELT pipeline development for data warehousingStrong experience with cloud data warehouses (e.g., Snowflake, BigQuery, Redshift)

Deliver it

A starter prompt for Claude Code, what you'll need, and how to reach them.

You are a senior data engineer. Your task is to outline a detailed, step-by-step plan for implementing and maintaining ELT pipelines and a dimensional data warehouse for the Forecasting Research Institute (FRI). FRI is a nonprofit collecting forecasting data from various sources (surveys, expert panels, AI systems) and needs to transition from an external vendor to in-house ownership. The core technology stack for the implementation should leverage Python for scripting, an orchestration tool like Apache Airflow or Dagster, a cloud data warehouse (e.g., Snowflake, BigQuery, or Redshift, assuming one is already in use or will be chosen based on current vendor setup, but specify a general approach), and SQL for dimensional modeling. Focus on building robust, scalable, and maintainable data infrastructure.

Outline the following sections:
1. **Phase 1: Discovery & Integration (1 week)**
* Steps for understanding existing vendor setup and data sources (e.g., identifying survey platforms, external APIs).
* Initial collaboration points with current external vendor and internal analysts.
* Tools/scripts for initial data exploration and schema assessment.
2. **Phase 2: ELT Pipeline Development (3-4 weeks)**
* Detailed steps for designing and implementing Python-based data extraction scripts for various sources.
* Strategies for incremental data loading and handling data quality issues.
* Selection and setup of an orchestration framework (Airflow/Dagster) for scheduling and monitoring.
* Steps for building the initial dimensional model (facts, dimensions) in the cloud warehouse using SQL.
3. **Phase 3: Ownership & Maintenance (Ongoing)**
* Strategies for taking full ownership from the external vendor.
* Plans for ongoing data pipeline monitoring, alerting, and error handling.
* Processes for collaborating with research analysts for new data requirements and ad-hoc queries.
* Approaches for optimizing warehouse performance and cost.

For each step, specify potential challenges and how to address them. The output should be a detailed, actionable plan ready for execution.

How you'd build it

1Familiarize with common survey platforms (e.g., Qualtrics, SurveyMonkey APIs) and external data sources relevant to forecasting.
2Design a scalable cloud data warehouse schema, likely using a dimensional model, to ingest and store forecasting data.
3Implement ELT pipelines in Python using a framework like Airflow or Dagster to extract data, load it into the warehouse, and transform it.
4Set up monitoring and orchestration for the data pipelines and warehouse to ensure data quality and availability.
5Collaborate with the FRI team to understand specific data needs and integrate with their existing external vendor solutions for the warehouse.
6Prepare for and complete the short work test and paid 10-hour test as part of the hiring process.

Risks & moats

The specific data sources and transformations might require domain-specific knowledge in forecasting that needs rapid learning.
Taking ownership from an external vendor mid-project can be challenging due to differing methodologies or documentation.
The hiring process includes a work test and a paid 10-hour test, which requires a significant time investment without guaranteed success.
Nonprofit budget constraints might lead to limited tooling or resources compared to a for-profit enterprise.

Original context

Forecasting Research Institute (FRI) | Data Engineer | REMOTE | Full-time We're a ~20-person nonprofit doing forecasting research on high-stakes problems, including AI progress, biosecurity, and nuclear risk. We have dozens of active projects that generate forecasting data from surveys, expert panels, and AI systems. We're looking for our first dedicated data engineer. You'd start alongside an external vendor extending an existing warehouse, then take full ownership. Concretely, the work would involve building ELT pipelines from survey platforms and external sources into a cloud warehouse, maintaining a dimensional model, collaborating with analysts, and overseeing orchestration & monitoring. You have solid Python + ETL/ELT, strong SQL and dimensional modeling, cloud warehouse experience. Nice to have: dbt/Airflow/Dagster, prior SWE work, interest in forecasting. Apply even if you don't tick every box! Conditions: - 100% Remote (worldwide) / Remote (global) - 30 days PTO, health insurance contribution - $75k–$130k, depending on experience - 3 team retreats/year Hiring process: short work test → paid 10-hour test → a few interviews. Apply at https://forecastingresearch.org/careers/

Forecasting Research Institute (FRI) | Data Engineer | REMOTE | Full-time We're a ~20-person nonprofit doing forecasting

The take

Demand & gap

Land this role

✓ You already have

+ Gaps to close

Deliver it

How you'd build it

Risks & moats

Original context

You may also want to look at

Résumé bullets to add

Plan to qualify

Prerequisites — cost & what to learn

Setup steps

Reach them