BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Department of Computer Science - ECPv6.16.2//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Department of Computer Science
X-ORIGINAL-URL:https://www.cs.jhu.edu
X-WR-CALDESC:Events for Department of Computer Science
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20260308T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20261101T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20270314T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20271107T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20260115T121500
DTEND;TZID=America/New_York:20260115T133000
DTSTAMP:20260520T061353
CREATED:20260106T160520Z
LAST-MODIFIED:20260106T160520Z
UID:1992777-1768479300-1768483800@www.cs.jhu.edu
SUMMARY:CS Seminar Series: Making Robust AI Safeguards Run Deep
DESCRIPTION:Refreshments are available starting at 12:15 p.m. The seminar will begin at 12:30 p.m. \nAbstract\nIn 2025\, frontier AI developers started warning that their AI systems were beginning to cross risk thresholds for dangerous cyber\, chemical\, and biological capabilities. This is unfortunate given how closed-weight AI systems are persistently vulnerable to prompt-injection attacks and open-weight systems are persistently vulnerable to malicious fine-tuning. Reinforcement learning from human feedback and refusal training aren’t enough. This presentation will focus on adversarial attacks that target model internals and their uses for making frontier AI safeguards “run deep.” In particular\, we will focus on what technical tools can help us make open-weight AI systems safer. Along the way\, we will discuss what AI safety can learn from the design of lightbulbs and why you should keep a close eye on Arkansas Attorney General Tim Griffin in 2026. \nSpeaker Biography\nStephen “Cas” Casper is a final-year PhD student at the Massachusetts Institute of Technology in the Algorithmic Alignment Group\, where is he advised by Dylan Hadfield-Menell. Casper leads a research stream for the MATS Program and mentors for ERA and GovAI. He is also a writer for the International AI Safety Report and the Singapore Consensus on Global AI Safety Research Priorities. Casper’s research focuses on AI safeguards and governance\, with features in the Conference on Neural Information Processing Systems; the Association for the Advancement of Artificial Intelligence Conference on Artificial Intelligence; Nature; the ACM Conference on Fairness\, Accountability\, and Transparency; the Conference on Empirical Methods in Natural Language Processing; the Institute of Electrical and Electronics Engineers Conference on Secure and Trustworthy Machine Learning; Transactions on Machine Learning Research; and the Iranian Scholars Chapter of the Association for Information Systems Annual Conference on Information Systems—as well as in a number of workshops and over 20 press articles and newsletters. Learn more on his Google Scholar page or personal website. \nZoom link »
URL:https://www.cs.jhu.edu/event/cs-seminar-series-making-robust-ai-safeguards-run-deep/
LOCATION:228 Malone Hall
CATEGORIES:Seminars and Lectures
END:VEVENT
END:VCALENDAR