AI QA Research Practice
Independent practice run by Carlos García in Monterrey. I build evaluation workflows, observe real human-AI interactions, and publish everything openly on GitHub.
Current Focus
Inspect, Promptfoo, custom harnesses.
Claude API + Playwright, end to end.
Observed interaction patterns from real AI-assisted working sessions.
File and line, not vibes.
Public Artifacts
Claude API + Playwright pipeline. User story in, test plan, Playwright specs, and bug report out.
Field observations on AI-human behavioral patterns from real working sessions. 20 years of QA practice applied to AI systems.
Runnable evaluations derived from observed patterns. Work in progress.
About
20 years in QA. Now focused on testing, evaluating, and validating AI systems in production. Holteck is where I run the experiments, build the tools, and publish what works. Everything goes on GitHub. Based in Monterrey, Mexico. Interested in AI Product QA, LLM Evaluation, and RLHF/Quality problems. Remote only.