HN Who's Hiring💵 paid workby cachecoffeescored by google/gemini-2.5-flash

Espresso AI | Staff ML, Staff Infra, FDE | Brooklyn or San Francisco | Full time

Visit project ↗Discussion ↗

💵 Funded work— see “Deliver it” below for how to respond.

Opportunity

AI-buildable

Traction

Creativity

The take

effort: ~3+ months

Espresso AI is a company using LLMs to build neural optimizers, scheduling systems, and workload tuners, initially focused on making data warehouses and Spark jobs more efficient. They are hiring Staff ML, Staff Infra, and Full-Stack Data Engineers (FDEs) for highly technical, user-facing roles. The core idea is to predict and optimize compute resource usage for complex data workloads.

Demand & gap

Painkillerdemand 93

Demand

Will. to pay

Gap

Buyer

Enterprise / orgs

The gap in what exists: Existing resource management tools are often heuristic-based or reactive; the gap is in proactive, ML-driven intelligent optimization and scheduling for complex distributed workloads.
Wedge to win: Target enterprises running large-scale data warehouses and Spark clusters that struggle with high compute costs or performance bottlenecks, offering clear, quantifiable savings and efficiency gains through 'neural optimization'.
Reputation value (worth doing free for proof) — 70/100: Solving a complex, high-value problem for large enterprises would build significant credibility in ML and distributed systems, opening doors for high-end consulting or product partnerships.
Likely monetization: B2B SaaS based on resource savings / performance improvement; per-node or per-job pricing
Incumbents to beat: DatadogNew Relic

Land this role

What they want, where you stand, and the exact résumé edits to qualify.

résumé fit 35

Biggest lever: Gaining practical experience in training and deploying custom ML models for system optimization, not just consuming LLM APIs.

✓ You already have

LLM API usage (Claude/Gemini APIs, Vercel AI SDK)Structured output/tool calling with LLMsPrompt engineeringAgent/automation pipelinesShipping AI-native features into real appsPython for data scripting/ETLDebugging in production (full-stack web apps)Full-stack web development (Next.js, React, Node.js)API designInfra/delivery (Vercel, Docker, GitHub Actions, serverless)User-facing product development (Lumivara)

Deliver it

A starter prompt for Claude Code, what you'll need, and how to reach them.

You are a Staff ML Engineer. Your task is to design and implement a minimal viable product (MVP) for a 'neural optimizer' that predicts compute resource consumption for a simple Spark job based on its configuration and input data size. Focus on the core prediction engine using a basic neural network. Use Python with PyTorch/TensorFlow, scikit-learn for data preprocessing, and FastAPI for a lightweight prediction API. The MVP should accept job parameters (e.g., input data size, number of partitions, Spark executor memory) and output predicted CPU and memory usage, and estimated runtime. Assume synthetic or pre-collected historical data for training; do not worry about real-time telemetry ingestion for this MVP. Define clear data schemas for training and inference. Provide the Python code for the model definition, training loop, and FastAPI endpoint. Include instructions for setting up a virtual environment and running the API. The output should be a single Python file `main.py` and a `requirements.txt`.

Prerequisites — cost & what to learn

Python 3.9+Free · Free✓ in your stack

How you'd build it

1Research and prototype neural network architectures for predicting compute resource needs (CPU, memory, disk I/O) based on job characteristics (SQL queries, Spark job DAGs, data volume).
2Develop data pipelines to collect historical telemetry data from data warehouses and Spark clusters, including job execution logs, resource utilization metrics, and performance counters.
3Implement and train initial ML models using frameworks like PyTorch or TensorFlow, focusing on regression tasks to predict resource consumption and completion times for various workloads.
4Build a robust MLOps pipeline for model deployment, monitoring, and retraining, integrating with existing infrastructure (e.g., Kubernetes, Kafka) for real-time inference and feedback loops.
5Develop a user-facing API and dashboard (Next.js) for ingesting job definitions, receiving optimization recommendations, and visualizing performance improvements and resource savings.
6Engage with pilot users (FDE role) to gather feedback, debug issues in production environments, and iteratively refine models and system integrations.

Risks & moats

Requires deep expertise in ML, distributed systems, and data engineering, which is a significant knowledge gap for a solo operator.
Access to large, diverse, and clean datasets for training complex neural models is critical and hard to acquire without existing enterprise partnerships.
Building robust production-grade infrastructure for real-time inference and integration with diverse customer environments is highly complex.
Proving tangible ROI (cost savings, performance improvements) in complex enterprise data environments requires significant validation and trust.

Original context

Espresso AI | Staff ML, Staff Infra, FDE | Brooklyn or San Francisco | Full time We're using LLMs to build neural optimizers, neural scheduling systems, and neural workload tuners. (If you're ex-Google, you can think of it like Borg powered by LLMs.) Today we use ML to make data warehouses and spark jobs more efficient. We're hiring staff ML engineers to train models that can understand how much compute a job needs, how it scales to larger machines, whether a machine can run more jobs, and so on; and staff infra engineers to take those models and deploy them on real-world production systems. We're also looking for FDEs who can help us talk to users and run pilots. This is a pretty technical role (you need to be able to do data analysis and debug in prod) that's also user-facing - it should be a good fit for a former (or future) technical founder. If this sounds cool, please email me: alexis [at] espresso [dot] ai

Espresso AI | Staff ML, Staff Infra, FDE | Brooklyn or San Francisco | Full time

The take

Demand & gap

Land this role

✓ You already have

Deliver it

Prerequisites — cost & what to learn

How you'd build it

Risks & moats

Original context

You may also want to look at

+ Gaps to close

Résumé bullets to add

Plan to qualify

Setup steps

Reach them