AI Test Engineer
Zams is an AI command center for B2B sales teams. It connects to the tools you already use—Salesforce, Hubspot, Slack, Apollo, Gong and more—and turns them into one seamless system. Instead of clicking through tools, you just say what you want in English and Zams does it.
We’re a small, scrappy team backed by some of the top venture capital firms in the US, moving fast with a strong bias for action and attention to detail. Joining Zams now means stepping onto the ground floor of a company with a massive mission and momentum. You’ll have the chance to make an outsized impact, see your ideas go from concept to reality in weeks, and play a direct role in shaping how companies everywhere adopt AI. It’s the perfect environment for someone who thrives in fast-paced startups, wants to learn quickly, and is ready to grow alongside a team that’s defining the future of sales.
About You
We’re looking for a creative and curious Backend & AI Agent Testing Engineer to join our team. You’ll work hands-on with our AI agents and backend services, writing code, debugging, scripting tests, and evaluating LLM prompts to ensure our systems behave reliably — even in unpredictable scenarios.
This is not a traditional backend or QA role. You’ll quickly learn new tools, explore unfamiliar apps, and design tests from a user’s perspective to help tame and improve our cutting-edge AI systems.
Responsibilities
Build and Maintain
- Write, debug, and maintain backend code in Python or JavaScript, building test cases and backend scripts to support robust and experimental systems.
- Implement APIs and ensure authentication workflows work as expected, with exposure to integrations.
AI Agent Testing & Prompt Evaluation
- Design and execute creative test strategies for AI agent behavior, particularly around LLM-based (GenAI) agents, ensuring systems behave reliably despite their unpredictable nature.
- Evaluate AI agent outputs and prompts, contributing to LLM evaluation and metrics using tools like Deepchecks.
Scripting & Automation
- Write scripts and automation to test AI agents and backend workflows.
- Build lightweight automation frameworks and develop or extend test infrastructure; familiarity with Selenium or similar is a plus.
Experimentation & Exploration
- Dive into new, unfamiliar apps and services quickly — learning them and building tests as if you were an end-user.
- Get into the “user’s shoes” to anticipate edge cases and potential failure modes.
- Tame the “beast” — creatively managing and testing AI systems that may behave inconsistently.
Tools & Integrations
- Work hands-on with integrations (e.g., HubSpot, Salesforce — bonus points for experience here) and modern collaboration tools like Cursor or Windsurf.
- Test and validate backend workflows that connect names, emails, and other critical user data across systems.
Collaboration & Continuous Learning
- Collaborate closely with cross-functional teams in a startup environment, embracing rapid experimentation and iteration.
- Continuously learn new frameworks, tools, and approaches without handholding, demonstrating a strong growth mindset.
- Experience Level : ~0–3 years in backend development or testing, ideally in a startup or experimental role.
- Backend Engineering Basics : Able to write backend code, debug, and write test cases in Python or JavaScript.
- Testing Creativity : Not a traditional tester — you think creatively, experiment boldly, and approach testing like teaching a child.
- AI Agent Experience : Exposure to or experience testing and prompting AI agents, especially GenAI/LLM-based systems.
- Automation & Scripting : Comfortable writing backend scripts, automating tests, and creating testing frameworks.
- Integrations & APIs : Exposure to integrations (bonus if HubSpot, Salesforce), understanding of API interactions and authentication.
- Tools : Familiarity with tools like Cursor or Windsurf preferred.
- Knowledge of modern automation frameworks (e.g., Selenium).
- Experience in B2B product environments.
- LLM Evaluation & Metrics : Bonus if you’ve worked with LLM evaluation frameworks, metrics, or MCPs.
- Mindset :
- Curious and fast learner — you’re comfortable diving into completely unknown tools or apps and figuring them out.
- Super creative in designing tests beyond “clicking around” — you understand that testing AI systems means “taming the beast.”
- Willing to experiment, work on ambiguous problems, and wear multiple hats.
- Unlimited PTO
- Health Benefits
- Work at the intersection of backend engineering, AI, and creative testing.
- A fast-moving, supportive startup culture that values experimentation and creativity.
- Opportunity to work with cutting-edge AI systems, tools, and frameworks.
- Learn and grow rapidly alongside a talented and collaborative team.
Recommended Jobs
Territory Manager
Registered Dental Assistants or hygienists or entry level medical sales $100k-$140k, uncapped commission DESCRIPTION Top notch dental company looking to fill their Territory Manager role for th…
Research Associate III
Job Description This position is in the Investigative Toxicology Group within the department of Translational Safety in Research and Early Development organization. The mission of the Investigative…
RN - ICU
Responsibilities The Registered Nurse caring for acute and critically ill patients collects relevant patient health data, analyzes the assessment data in determining diagnoses, identifies indivi…
Rehab Technician
Ver más abajo para la versión en español Rehab Technician Do you like solving problems and keeping things running smoothly? Do you enjoy working in a fast-paced environment with new challe…
Treatment Coordinator
Job Description Job Description Company Description Our team is driven to provide the most optimal care to our patients, we treat every individual with the utmost compassion, understanding a…
Account Manager - Serving Churches
Mission: At Chaney & Associates we empower churches to wisely steward their resources through cloud-based outsourced accounting and business consulting services. Vision: To become the nati…
California Healthcare Market Leader
NBBJ is an award-winning design firm recognized as a TIME100 Most Influential Company, a Fast Company Most Innovative Architecture Firm and a two-time 2025 AIA National Honor Award recipient. These …
Staff, Data Scientist
What you'll do at Position Summary... What you'll do... Immigration Sponsorship is not available in this role What you'll do... Cortex Team is Walmart's core A.I. conversational p…
Sales Manager
We are seeking an experienced Automotive Sales Manager to join our team. The ideal candidate will have a proven track record of success in sales management and experience specifically with Nissan, CD…