HN Who's Hiring💵 paid workby jaimemedicalbnbscored by google/gemini-2.5-flash

Railtown AI | Head of Developer Relations | REMOTE (North America preferred) / Vancouver, BC | Full-time | $130k-$180k +

Visit project ↗Discussion ↗

💵 Funded work— see “Deliver it” below for how to respond.

Opportunity

AI-buildable

Traction

Creativity

The take

effort: ~3+ months

Railtown AI is hiring a Head of Developer Relations for their platform focused on AI-powered software development, including developer productivity, backend infrastructure for agents, a Python framework, and evaluation workflows. The role involves shaping the developer narrative and driving adoption through content, demos, docs, community, and open-source growth. This is a full-time, well-paid role with equity.

Demand & gap

Painkillerdemand 82

Demand

Will. to pay

Gap

Buyer

Enterprise / orgs

The gap in what exists: Enterprises and professional teams are actively seeking robust, production-grade tools for AI agent development, observability, and evaluation, as many existing solutions are fragmented or immature.
Wedge to win: Focus on a niche within agent evaluation or observability (e.g., specific compliance needs, very high-throughput agents) for enterprises willing to pay for specialized tooling.
Reputation value (worth doing free for proof) — 85/100: Building a credible, open-source tool or framework for AI agent evaluation or production deployment would establish the operator as a leading voice in the rapidly growing AI engineering space, attracting potential clients and partners.
Likely monetization: B2B seat-based SaaS, usage-based
Incumbents to beat: LangChainLlamaIndex

Land this role

What they want, where you stand, and the exact résumé edits to qualify.

résumé fit 40

Biggest lever: Gain dedicated experience in developer advocacy, community building, and scaling a technical audience.

✓ You already have

AI engineering: Claude / Gemini APIs, structured output / tool calling, RAG & embeddings, prompt engineering, agent/automation pipelinesWeb / full-stack: TypeScript, Next.js, React, Node.jsPythonInfra / delivery: Docker, GitHubClear communication (implied by solo product ownership and client interaction via SAP)Technical content creation (via product documentation/marketing of Lumivara)Ability to write code

+ Gaps to close

Developer Advocacy / DevRel / Technical Product Marketing (5+ years of dedicated experience)Scaling a developer platform or technical audience (dedicated experience)

Deliver it

A starter prompt for Claude Code, what you'll need, and how to reach them.

You are an expert full-stack developer. Your task is to conceptualize and outline the MVP for a 'solo-operator friendly' version of an AI agent evaluation and observability platform, inspired by the problem space Railtown AI addresses, but sharply niched for a solo founder. Focus on a specific pain point in agent evaluation that is underserved by existing open-source and commercial tools. The goal is to build a web-based tool where users can upload or define an AI agent's input/output pairs, run predefined or custom evaluation metrics (e.g., correctness, latency, cost), and visualize the results over time.

Your MVP should target the following:

1. **Core Functionality**: Allow users to define a 'test suite' for an AI agent. This includes:
* Input data (e.g., user prompts, context).
* Expected output or criteria for success (e.g., regex match, sentiment score threshold, human feedback collection point).
* Integration points for various LLMs (Anthropic Claude, OpenAI, Gemini via AI SDK v6) and potentially custom agent endpoints.
2. **Evaluation Metrics**: Implement basic metrics like:
* Latency (response time).
* Token usage (input/output counts).
* Correctness (against expected output/criteria).
* Cost estimation per run.
3. **Visualization**: A simple dashboard showing evaluation results over time for an agent, highlighting regressions or improvements.
4. **Tech Stack**: Next.js 16 App Router (React 19), Tailwind v4 for UI, AI SDK v6 with Gemini for LLM integrations, Neon Postgres for data storage. Use Prisma ORM. Deploy to Vercel.

**MVP Slice**: Focus purely on the 'Agent Evaluation' aspect. Users should be able to define a single agent, upload a small CSV of test cases, run evaluations, and see a basic tabular/chart summary of latency, tokens, and a 'pass/fail' based on simple output matching criteria. Assume the agent endpoint is a user-provided URL.

**Build Gate**: A user can define an agent, upload 10 test cases, run evaluations, and see a summary table within 5 minutes. The UI should be responsive and visually clear.

How you'd build it

1Analyze Railtown AI's existing products (Conductr, Railengine, Railtracks, Agent Evaluations) to understand their core value proposition and gaps.
2Develop a specialized AI agent evaluation framework as a standalone product, focusing on specific metrics or agent types currently underserved.
3Build a Python SDK/framework that integrates with popular AI model providers (e.g., Anthropic, OpenAI) and offers superior developer experience for agent development.
4Create a 'mini-platform' focusing on observability specifically for AI agent interactions and performance, distinct from general APM tools.
5Target indie hackers and small dev shops building AI agents by offering a generous free tier or open-sourcing key components to gain initial adoption.

Risks & moats

Building a comprehensive AI development platform is a massive undertaking, far beyond a solo operator's scope for competitive parity.
The market for AI agent development and evaluation is rapidly evolving and highly competitive, with many well-funded players.
Requires deep expertise in AI/ML, MLOps, and developer tools, which goes beyond typical web application development.
Successfully driving adoption for a new developer platform requires significant DevRel and community building effort, which is difficult for a solo operator.

Market it to your portfolio

fit 75

Agent Eval LabMCP Kitaimon

The audience for Railtown AI (developers building and evaluating AI agents) is a direct fit for the operator's agent-eval-lab, mcp-kit, ai-usage-monitor, and forge-kit. The operator could showcase how their tools enhance or complement aspects of the Railtown AI stack or address niche gaps.

Original context

Railtown AI | Head of Developer Relations | REMOTE (North America preferred) / Vancouver, BC | Full-time | $130k-$180k + equity | https://employmenthero.com/en-ca/jobs/position/railtown-ai-h... Railtown AI is building a unified platform for modern AI-powered software development. Our products span developer productivity engineering, backend infrastructure for agent and app development, a Python framework for production-ready agents, and structured evaluation workflows for responsible AI deployment. We're looking for a Head of Developer Relations to help make Railtown AI the default stack for building, observing, and evaluating AI agents. You'll own the developer narrative across Conductr, Railengine, Railtracks, and Agent Evaluations, then turn that story into adoption through technical content, demos, docs, community programs, open source growth, and direct feedback loops with product and engineering. Good fit if you've spent 5+ years in developer advocacy, DevRel, technical product marketing, or a similar developer-facing role; have scaled a developer platform or technical audience; understand AI tooling, agents, and observability; and can write code, demo live, and communicate

Railtown AI | Head of Developer Relations | REMOTE (North America preferred) / Vancouver, BC | Full-time | $130k-$180k +

The take

Demand & gap

Land this role

✓ You already have

+ Gaps to close

Deliver it

How you'd build it

Risks & moats

Market it to your portfolio

Original context

You may also want to look at

Résumé bullets to add

Plan to qualify

Prerequisites — cost & what to learn

Setup steps

Reach them