I lead the Responsible AI Team at SB Intuitions, working on AI Safety, especially red-teaming and guardrails for LLMs and VLMs. Previously worked on fairness and privacy‑preserving computation. My recent work studies LLM‑as‑a‑Judge biases (verbosity / self‑preference), jailbreak attacks, and LLM fingerprinting.
Email: wataoka.koki@gmail.com
Google Scholar · GitHub · X