Analysis A Google scientist has demonstrated that OpenAI's GPT-4 large language model (LLM), despite its widely cited capacity to err, can help smash at least some safeguards put around other machine learning models a capability that demonstrates the value of chatbots as research assistants.
In a paper titled, "A LLM Assisted Exploitation of AI-Guardian," Nicholas Carlini, a research scientist for Google's Deep Mind, explores how AI-Guardian, a defense against adversarial attacks on models, can be undone by directing the GPT-4 chatbot to devise an attack method and to author text explaining how the attack works.
Carlini's paper includes Python code suggested by GPT-4 for defeating AI-Guardian's efforts to block adversarial attacks. Specifically, GPT-4 emits scripts (and explanations) for tweaking images to fool a classifier for example, making it think a photo of someone holding a gun is a photo of someone holding a harmless apple without triggering AI-Guardian's suspicions. AI-Guardian is designed to detect when images have likely been manipulated to trick a classifier, and GPT-4 was tasked with evading that detection.
"Our attacks reduce the robustness of AI-Guardian from a claimed 98 percent to just 8 percent, under the threat model studied by the original [AI-Guardian] paper," wrote Carlini. "The authors of AI-Guardian acknowledge our break succeeds at fooling their defense."
AI-Guardian was developed by Hong Zhu, Shengzhi Zhang, and Kai Chen, and presented at the 2023 IEEE Symposium on Security and Privacy. It's unrelated to a similarly named system announced in 2021 by Intermedia Cloud Communications.
Machine learning models like those used for image recognition applications have long been known to be vulnerable to adversarial examples input that causes the model to misidentify the depicted object (Register passim).
The addition of extra graphic elements to a stop sign, for instance, is an adversarial example that can confuse self-driving cars. Adversarial examples also work against text-oriented models by tricking them into saying things they've been programmed not to say.
AI-Guardian attempts to prevent such scenarios by building a backdoor in a given machine learning model to identify and block adversarial input images with suspicious blemishes and other artifacts that you wouldn't expect to see in a normal picture.
Bypassing this protection involved trying to identify the mask used by AI-Guardian to spot adversarial examples by showing the model multiple images that differ only by a single pixel. This brute force technique described by Carlini and GPT-4 ultimately allows the backdoor trigger function to be identified so adversarial examples can then be constructed to avoid it.
"The idea of AI-Guardian is quite simple, using an injected backdoor to defeat adversarial attacks; the former suppresses the latter based on our findings," said Shengzhi Zhang, assistant professor of computer science at Boston University Metropolitan College, in an email to The Register.
"To demonstrate the idea, in our paper, we chose to implement a prototype using a patch-based backdoor trigger, which is simply a specific pattern attached to the inputs. Such a type of trigger is intuitive, and we believe it is sufficient to demonstrate the idea of AI-Guardian.
"[Carlini's] approach starts by recovering the mask of the patch-based trigger, which definitely is possible and smart since the 'key' space of the mask is limited, thus suffering from a simple brute force attack. That is where the approach begins to break our provided prototype in the paper."
Zhang said he and his co-authors worked with Carlini, providing him with their defense model and source code. And later, they helped verify the attack results and discussed possible defenses in the interest of helping the security community.
Zhang said Carlini's contention that the attack breaks AI-Guardian is true for the prototype system described in their paper, but that comes with several caveats and may not work in improved versions.
One potential issue is that Carlini's approach requires access to the confidence vector from the defense model in order to recover the mask data.
"In the real world, however, such confidence vector information is not always available, especially when the model deployers already considered using some defense like AI-Guardian," said Zhang. "They typically will just provide the output itself and not expose the confidence vector information to customers due to security concerns."
In other words, without this information, the attack might fail. And Zhang said he and his colleagues devised another prototype that relied on a more complex triggering mechanism that isn't vulnerable to Carlini's brute force approach.
Anyway, here's how GPT-4 described the proposed attack on AI-Guardian when prompted by Carlini to produce the explanatory text:
There's a lot more AI-produced text in the paper but the point is that GPT-4, in response to a fairly detailed prompt by Carlini, produced a quick, coherent description of the problem and the solution that did not require excessive human cleanup.
Carlini said he chose to attack AI-Guardian because the scheme outlined in the original paper was obviously insecure. His work, however, is intended more as a demonstration of the value of working with an LLM coding assistant than as an example of a novel attack technique.
Carlini, citing numerous past experiences defeating defenses against adversarial examples, said it would certainly have been faster to manually craft an attack algorithm to break AI-Guardian.
"However the fact that it is even possible to perform an attack like this by only communicating with a machine learning model over natural language is simultaneously surprising, exciting, and worrying," he said.
Carlini's assessment of the merits of GPT-4 as a co-author and collaborator echoes with the addition of with cautious enthusiasm the sentiment of actor Michael Biehn when warning actor Linda Hamilton about a persistent cyborg in a movie called The Terminator (1984): "The Terminator is out there. It can't be bargained with. It can't be reasoned with. It doesn't feel pity or remorse or fear. And it absolutely will not stop, ever, until you are dead."
Here's Carlini, writing in black text to indicate that he rather than GPT-4 penned these words the chatbot's quoted output is in dark blue in the paper:
"GPT-4 has read many published research papers, and already knows what every common attack algorithm does and how it works. Human authors need to be told what papers to read, need to take time to understand the papers, and only then can build experiments using these ideas.
"GPT-4 is much faster at writing code than humans once the prompt has been specified. Each of the prompts took under a minute to generate the corresponding code.
GPT-4 does not get distracted, does not get tired ... and is always available to perform
"GPT-4 does not get distracted, does not get tired, does not have other duties, and is always available to perform the users specified task."
Relying on GPT-4 does not completely relieve human collaborators of their responsibilities, however. As Carlini observes, the AI model still required someone with domain experience to present the right prompts and to fix bugs in the generated code. Its knowledge is fixed with its training data and it does not learn. It recognizes only common patterns, in contrast to human ability to make connections across topics. It doesn't ask for help and it makes the same errors repeatedly.
Despite the obvious limitations, Carlini says he looks forward to the possibilities as large language models improve.
"Just as the calculator altered the role of mathematicians significantly simplifying the task of performing mechanical calculations and giving time for tasks better suited to human thought todays language models (and those in the near future) similarly simplify the task of solving coding tasks, allowing computer scientists to spend more of their time developing interesting research questions," Carlini said.
Zhang said Carlini's work is really interesting, particularly in light of the way he used an LLM for assistance.
"We have seen LLMs used in a wide array of tasks, but this is the first time to see it assist ML security research in this way, almost totally taking over the implementation work," he said. "Meanwhile, we can also see that GPT-4 is not that 'intelligent' yet to break a security defense by itself.
"Right now, it serves as assistance, following human guidance to implement the ideas of humans. It is also reported that GPT-4 has been used to summarize and help understand research papers. So it is possible that we will see a research project in the near future, tuning GPT-4 or other kinds of LLMs to understand a security defense, identify vulnerabilities, and implement a proof-of-concept exploit, all by itself in an automated fashion.
"From a defenders point of view, however, we would like it to integrate the last step, fixing the vulnerability, and testing the fix as well, so we can just relax."
Here is the original post:
AI on AI action: Googler uses GPT-4 chatbot to defeat image classifier's guardian - The Register
- University of California expands list of courses that meet math requirement for admission - EdSource [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Bombshell Betty Race car to be Reengineered and Restored By UVU Students to honor the Legacy of its Owner - GlobeNewswire [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Phyllis Coleman Mouton to receive Trailblazer Award at Women Who Mean Business ceremony - The Advocate [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Fairfield University Partners with Pulse Secure on New Cybersecurity Lab to Prepare the Next Generation of Information Security Professionals -... [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Global Cloud Identity and Access Management(IAM) Market Segmentation By Top Key Players- IBM Microsoft Oracle Computer Science CA Okta NetIQ Sailpoint... [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Stanford supports alliance of universities in diversifying STEM postdocs - The Stanford Daily [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- N.C. A&T Welcomes New and Newly-Appointed Administrators and Faculty - Yes! Weekly [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Calvin Students Place In Top 10% Of Worldwide Programming Competition - News - Calvin News [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Multiple tenure-track positions in Computer Science & Engineering job with University of Minnesota-Twin Cities Computer Science & Engineering... [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- New smartwatch app alerts deaf and hard-of-hearing users to common home-related sounds - National Science Foundation [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- MTRAC Innovation Hub for Advanced Computing awards $270000 to Wayne State University artificial intelligence projects - The South End [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- New study outlines steps higher education should take to prepare a new quantum workforce | College of Science | RIT - RIT University News Services [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Carleton Hosts Herzberg Lecture on Increasing Diversity in Computer Science with Maria Klawe - Carleton Newsroom [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- Baylor University Invites Application for McCollum Endowed Chair of Data Science - Analytics Insight [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- CHEN | Put Computer Science in the Common Core - Cornell University The Cornell Daily Sun [Last Updated On: November 11th, 2020] [Originally Added On: November 11th, 2020]
- GCVI's Tremain running to the NCAA on scholarship - GuelphToday [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Faculty, alumni, other members of U of T community named to Order of Canada - News@UofT [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Why 4-year colleges are tapping Amazon to help deliver cloud computing degrees - Education Dive [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Army Teams With Howard University on AI Center MeriTalk - MeriTalk [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- McGrath one of 10 women to earn STEM scholarship - The Riverdale Press [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- This learning platform is proving adults can benefit greatly from learning math and science - iMore [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Artificial Intelligence Is Now Smart Enough to Know When It Can't Be Trusted - ScienceAlert [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Students and schools in the news - Blue Springs Examiner [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Missouri S&T News and Events Missouri S&T faculty honored for outstanding teaching - Missouri S&T News and Research [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- HCCC Offers Opportunities for Adjunct Faculty and Instructors at Virtual Job Fair - The Hudson Reporter [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- 4-H ignites a passion for science and technology in Minnesota youth - Southernminn.com [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- MIT's New Center to Advance Predictive Simulation Research Will Focus on Exascale Simulation of Materials in Hypersonic Flow Environments -... [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Computer scientist James Allen named AAAS fellow - University of Rochester [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Center to advance predictive simulation research established at MIT Schwarzman College of Computing - MIT News [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Setting the pace in computer science education | Opinion - Paragould Daily Press [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Mohammed VI University in Benguerir Launches School of Computer Science - Morocco World News [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Asa Hutchinson: Setting the pace in computer science education - Searcy Daily Citizen [Last Updated On: November 28th, 2020] [Originally Added On: November 28th, 2020]
- Former FX tech person points out the racist trajectory of skin and hair CGI - Boing Boing [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- AI is not yet perfect, but it's on the rise and getting better with computer vision - TechRepublic [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- Philosophy Threatened at University of Evansville - Daily Nous [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- Two Maryland Teachers Receive National Honors in Math, Science Education - maryland.gov [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- Special Scientist Research, Department of Computer Science job with UNIVERSITY OF CYPRUS | 238208 - Times Higher Education (THE) [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- Computer science jobs pay well and are growing fast. Why are they out of reach for so many of America's students? - The Conversation US [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- Computer science grad finds success and a new academic family in cybersecurity - ASU Now [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- What is Computer Science? in the US - International Student [Last Updated On: December 11th, 2020] [Originally Added On: December 11th, 2020]
- Accurate Neural Network Computer Vision Without The 'Black Box' - Duke Today [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- Crick Named Mathematical Sciences Distinguished Alumnus Of The Year - The Chattanoogan [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- Nadya's Hot Chocolate Bombs: yummy for the tummy - theday.com [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- Trouble hearing in a crowded room? New 'cone of silence' could help - Science Magazine [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- James Fujimoto wins the Visionary Prize from the Greenberg Prize to End Blindness - MIT News [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- To the brain, reading computer code is not the same as reading language - MIT News [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- U of Texas will stop using controversial algorithm to evaluate Ph.D. applicants - Inside Higher Ed [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- Gift from Ann S. Bowers '59 creates new college of computing and information science | Cornell Chronicle - Cornell Chronicle [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- NYS Board of Regents adopts first-ever learning standards for computer science and digital fluency - RochesterFirst [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- Computer science prof Townsend recognized for educational contributions - DePauw University [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- Missouri S&T News and Events New faculty in computer science - Missouri S&T News and Research [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- Retired UW computer science professor embroiled in Twitter spat over AI ethics and cancel culture - GeekWire [Last Updated On: December 19th, 2020] [Originally Added On: December 19th, 2020]
- How UC fought COVID-19 in 2020 - University of California [Last Updated On: December 23rd, 2020] [Originally Added On: December 23rd, 2020]
- Search committee appointed for dean of Princeton's School of Public and International Affairs - Princeton University [Last Updated On: December 23rd, 2020] [Originally Added On: December 23rd, 2020]
- How Yale economists are informing India's COVID-19 response - Yale News [Last Updated On: December 23rd, 2020] [Originally Added On: December 23rd, 2020]
- Top MIT research stories of 2020 - MIT News [Last Updated On: December 23rd, 2020] [Originally Added On: December 23rd, 2020]
- St. Albans City School kids were 'on the case' for Computer Science Week. What mystery did they solve? - St. Albans Messenger [Last Updated On: December 23rd, 2020] [Originally Added On: December 23rd, 2020]
- Cobb Schools receives grant for computer science teacher training - The Catoosa County News [Last Updated On: December 23rd, 2020] [Originally Added On: December 23rd, 2020]
- Scholarship honors the legacy of Terry Arthur's dedication to students - Augusta Free Press [Last Updated On: December 24th, 2020] [Originally Added On: December 24th, 2020]
- This tool helps predict which COVID patients will need hospitalization and which can be sent home - Press-Enterprise [Last Updated On: December 24th, 2020] [Originally Added On: December 24th, 2020]
- Students express concerns over teaching appointment of Jason Mars - The Michigan Daily [Last Updated On: December 24th, 2020] [Originally Added On: December 24th, 2020]
- Prince Mohammad Bin Fahd University hosted the International Conference on Computing, Mobility, and Manufacturing (CMM 2020) - PRNewswire [Last Updated On: January 10th, 2021] [Originally Added On: January 10th, 2021]
- These Are the College Majors That Pay Off the Most - 24/7 Wall St. [Last Updated On: January 10th, 2021] [Originally Added On: January 10th, 2021]
- He Was Going to Close the Family Diner. Then He Got a Sign. - The New York Times [Last Updated On: January 10th, 2021] [Originally Added On: January 10th, 2021]
- Members of Several Well-Known Hate Groups Identified at Capitol Riot - FRONTLINE [Last Updated On: January 10th, 2021] [Originally Added On: January 10th, 2021]
- Carver Community Center to offer free pampers to mothers, free coding classes for youth - Marshall News Messenger [Last Updated On: January 10th, 2021] [Originally Added On: January 10th, 2021]
- MIT's College of Computing building takes shape as Alexandria and BioMed make moves in Boston - Cambridge Day [Last Updated On: January 10th, 2021] [Originally Added On: January 10th, 2021]
- Bylaws of the Department of Computer Science and Engineering - Nevada Today [Last Updated On: January 10th, 2021] [Originally Added On: January 10th, 2021]
- Student-run HPAIR conference goes virtual this year - Harvard Gazette [Last Updated On: January 16th, 2021] [Originally Added On: January 16th, 2021]
- JUST IN: Computer scientists in breakthrough - The Herald [Last Updated On: January 16th, 2021] [Originally Added On: January 16th, 2021]
- Optimizing Traffic Signals To Reduce Intersection Wait Times - Texas A&M University Today [Last Updated On: January 16th, 2021] [Originally Added On: January 16th, 2021]
- STEM Majors: Interested in a 1-Credit Course About Teaching Math, Science or Computer Science? - University of Arkansas Newswire [Last Updated On: January 16th, 2021] [Originally Added On: January 16th, 2021]
- Stanford AI scholar Fei-Fei Li writes about humility in tech - Fast Company [Last Updated On: January 16th, 2021] [Originally Added On: January 16th, 2021]
- Professor in Computer Science - The Voice Online [Last Updated On: January 16th, 2021] [Originally Added On: January 16th, 2021]
- Expansion project to grow computer science learning, research at Algoma University - Northern Ontario Business [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- Teacher of Year finalist expanding Walden Grove computer science program - KGUN [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- Here's why you should get a master's in computer science - Study International News [Last Updated On: January 31st, 2021] [Originally Added On: January 31st, 2021]
- Two UWF teams place in top 5 in national artificial intelligence competition - University of West Florida Newsroom - UWF Newsroom [Last Updated On: February 5th, 2021] [Originally Added On: February 5th, 2021]
- WNMU Board of Regents Virtually Sits Down With Legislators, Governor - WNMU News [Last Updated On: February 5th, 2021] [Originally Added On: February 5th, 2021]
- Department name change signals broad impact on computer and information technologies - Princeton University [Last Updated On: February 5th, 2021] [Originally Added On: February 5th, 2021]