Skip to content

prox-technologies/prox-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Prox Founding Engineer Challenge

Vulcan OmniPro 220 Vulcan OmniPro 220 — inside panel

The Product

The Vulcan OmniPro 220 is a multiprocess welding system sold by Harbor Freight. It supports four welding processes (MIG, Flux-Cored, TIG, and Stick), runs on both 120V and 240V input, and has an LCD-based synergic control system.

Its owner's manual is 48 pages of dense technical content. Duty cycle matrices across multiple voltages and amperages, polarity setup procedures that differ per welding process, wire feed mechanisms with specific tensioner calibrations, wiring schematics, troubleshooting matrices, weld diagnosis diagrams, and a full parts list.

This is exactly the kind of product Prox exists for. Nobody knows how to use this machine straight out of the box but has time to read 48 page manual, but a complicated machine needs expert-level support.

Additional video: https://www.youtube.com/watch?v=kxGDoGcnhBw

Your Job

Build a multimodal reasoning agent for the Vulcan OmniPro 220 using the Claude Agent SDK. The agent must be able to answer deep technical questions about this product accurately, helpfully, and not just in text.

The manuals are in the files/ directory.

There is no limit to how far you can go. You can integrate voice. You can build a full interactive experience. Sky is the limit. The more ambitious and polished, the better.

What We're Testing

1. Deep Technical Accuracy

Your agent needs to answer questions like these correctly:

  • "What's the duty cycle for MIG welding at 200A on 240V?"
  • "I'm getting porosity in my flux-cored welds. What should I check?"
  • "What polarity setup do I need for TIG welding? Which socket does the ground clamp go in?"

We will test with questions that require cross-referencing multiple manual sections, understanding visual content (diagrams, schematics, charts), and handling ambiguous questions that need clarification from the user.

2. Multimodal Responses

This is the most important part. Your agent must not be text-only.

  • If someone asks about polarity setup, the agent should draw or show a diagram of which cable goes in which socket, not just describe it.
  • If the answer relates to a specific image in the manual (the wire feed mechanism, the front panel controls, the weld diagnosis examples), the agent should surface that image.
  • If a question is complex enough, the agent should generate interactive content: a duty cycle calculator, a troubleshooting flowchart, a settings configurator that takes process + material + thickness and outputs recommended wire speed and voltage.

When something is too cognitively hard to explain in words, the agent should draw it. Real-time diagrams, interactive schematics, visual walkthroughs generated through code.

For your agent to handle these responses well you need to reverse engineer Claude artifacts. Here are two places where you can start:

3. Tone and Helpfulness

Imagine your user just bought this welder and is standing in their garage trying to set it up. They're not an idiot, but they're not a professional welder either.

4. Knowledge Extraction Quality

The manual has a mix of text, tables, labeled diagrams, schematics, and decision matrices. Some critical information exists only in images (the welding process selection chart, the weld diagnosis photos, the wiring schematic). We want to see that your agent understands and presents the visual content, not just the text.

Tech Requirements

  • Use the Anthropic Claude Agent SDK as the foundation for your agent.
  • The project must run locally with a single API key provided via .env.
  • You are responsible for your own API costs during development.

How to Present Your Work

This matters. Your submission is not just the code — it's how you present it.

  • Build a frontend. The best way for us to evaluate your agent is if it has a clean, simple UI we can run immediately. This is realistically the only way to properly demo an agent like this.
  • Hosting is a plus. If you host it somewhere we can access without cloning, that's a strong signal. Not required, but it removes friction and shows initiative.
  • Write a clear README. Explain how your agent works, what design decisions you made, how knowledge is extracted and represented, and how to run it. Your documentation will be evaluated — we want to see how you think and communicate, not just how you code.
  • Video walkthrough is a huge plus. Record yourself demoing the agent and explaining your approach. Walk through the hard questions, show how it handles multimodal responses, explain your architecture. This gives us a much richer picture of your work than code alone.

We should be running your agent within 2 minutes of cloning your repo:

git clone <your-fork>
cd <your-fork>
cp .env.example .env   # we plug in our own Anthropic API key
# your install command (npm install, uv install, etc.)
# your run command (npm run dev, python app.py, etc.)

If it takes longer than that to set up, that's a problem.

What to Submit

  1. Fork this repo.
  2. Build your solution.
  3. Submit your fork URL through the form at useprox.com/join/challenge.

What Happens Next

We review submissions on a rolling basis and respond to every single one within a few days. Good luck.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages