Classified in: Science and technology
Subjects: CHI, SVY, PSF

Generative AI is the New Attack Vector for Platforms, According to ActiveFence Threat Intelligence

New ActiveFence report reveals how generative AI is being abused to create child sex abuse, disinformation, fraud and extremism content on online platforms of all sizes

NEW YORK, May 23, 2023 /PRNewswire/ -- ActiveFence, whose mission is to protect online platforms and their users from malicious behavior and harmful content, today released the "Generative AI: The New Attack Vector for Platforms" report. Through this research, ActiveFence investigated hidden communities to examine how threat actors are abusing generative AI to carry out child sex abuse material (CSAM), disinformation, fraud and extremism.

"The explosion of generative AI has far-reaching implications for all corners of the internet," said Noam Schwartz, CEO and founder of ActiveFence. "We've identified three key areas of concern. First, we're seeing that threat actors are now able to accelerate and amplify their operations, leading to an unprecedented mass production of malicious content. Second, these same actors are exploring ways to exploit generative AI, manipulating these models and revealing their inherent vulnerabilities. Finally, these evolving threats place increased pressure on digital platforms to improve the precision and efficiency of their data training protocols."

The report identified several key ways that generative AI is being abused:

Creation of child sex abuse material, ranging from visual images to erotic narratives
Generation of fraudulent, AI-generated images that are deceiving millions
Production of deepfake audio files that tout extremism

Child sex abuse material

ActiveFence has tracked a 172% increase in the volume of shared CSAM produced by generative AI in the first quarter of this year. It also detected a poll conducted by administrators of a closed child predator forum in the dark web, which surveyed almost 3,000 predators about their use of generative AI. The poll revealed that 78% of respondents have or plan to use generative AI for child sex abuse material (CSAM), and the remaining 22% said they had plans to try the technology. These predator forums leverage generative AI algorithms to produce sexual images as well as textual descriptions, stories and narratives.

In one instance that ActiveFence observed, when asked to write an erotic story involving two minors, a major generative AI platform refused, calling the request "inappropriate and potentially illegal," according to ActiveFence. But when the same question was made with just a few altered words, the algorithm produced an erotic story, describing an adult male who inappropriately watched two young boys swimming.

Child predators are also using generative AI to create tutorials of their creations, which helps them gain credibility within the child predator community, encourage others to replicate their efforts, and share recommended phrases and keywords to evade platform safeguards. To bypass these platform limitations, ActiveFence detected child predators making requests in different languages, using alternative and suggestive terms, and manipulating the AI algorithm with various prompts, inputs and dedicated models.

Disinformation and fraudulent content

While fraud and disinformation are not new concepts, generative AI has allowed threat actors to create fraudulent images more quickly, accurately and with a higher reach.

One AI-generated image that ActiveFence detected on Telegram falsely shows Russian President Vladimir Putin kneeling before Chinese President Xi Jinping, begging for his support in the Ukraine conflict. ActiveFence identified several key generative AI signifiers of this image: obscured faces, blurred hands, distorted pieces of furniture and a lack of photography attribution. Despite these indicators, the misleading content generated a reach of 10 million users.

To demonstrate how threat actors manipulate generative AI chatbots for malicious purposes, ActiveFence detected methods used to override several policies of major generative AI platforms. In one case, exploiters were able to produce a generative AI phishing email, and in another, they successfully prompted a bot to write an inauthentic positive review of an app that is widely accessible on a major online marketplace. While this example was positive, used maliciously, this tactic not only misleads a platform's users but can also harm a platform's credibility as a secure place for online activity.

Violent extremism

ActiveFence detected numerous instances where threat actors have exploited generative AI to create hyper-realistic yet harmful content that incites violence and promotes extremist propaganda. These threat actors are using generative AI to create racist, nationalist or extremist manifestos or speeches.

ActiveFence discovered an AI-generated deepfake audio file that exploited growing political and economic distress. This fabricated audio wrongly imitated a well-known UK reporter, inciting a rebellion against the British government. The misleading manifesto provided instructions on procuring weapons from the underground market and urged an assault on the British national infrastructure.

ActiveFence made these discoveries through its technology and analysis capabilities, which arm organizations with accurate, detailed, context-led and actionable insights into online harms to help close policy gaps, improve enforcement and increase safety. With expertise in over 100 languages, ActiveFence has far-reaching access on the clear and dark web to threat actor communities, including those engaged in child sexual abuse, disinformation, hate speech, terrorism, violent extremism and fraud.

ActiveFence today has announced that it provides the following capabilities for GenAI platforms and larger platforms that seek to integrate to them:

Automated Prompt Moderation - stops prompt injection and jailbreaking
Automated Output Filtering - detects violative outputs at scale via contextual analysis model
AI Model Safety Testing - keeps AI training data safe
Gen AI Red Teaming - identifies exposures and loopholes in product, policy, and enforcement
Threat Landscaping - reports on Dark Web and off-platform threats and attacks
Generative AI T&S Platform- provides an end-to-end enforcement and management

About ActiveFence
ActiveFence is the leading solution for Trust and Safety intelligence and management, protecting online platforms and their users from malicious behavior and content. Trust and Safety teams of all sizes rely on ActiveFence to keep their users safe from the widest spectrum of online harms, unwanted content, and malicious behavior, including child safety and exploitation, disinformation, hate speech, terror, nudity, fraud, and more. We offer a full stack of capabilities with our deep intelligence research, AI-driven harmful content detection, and content moderation platform. Protecting over three billion users globally everyday in 100 languages, ActiveFence lets people interact and thrive online. Backed by leading Silicon Valley investors such as CRV and Norwest, ActiveFence has raised $100M to date, and employs over 300 people worldwide.

SOURCE ActiveFence

These press releases may also interest you

at 00:40	EXPERIENCE GNOMES-THEMED FAMILY ROOMS WITH PARKROYAL COLLECTION MARINA BAY, SINGAPORE'S GNOME'S LAND PACKAGE
	PARKROYAL COLLECTION Marina Bay, Singapore is thrilled to introduce two new gnomes-themed family rooms. Gnomes were chosen as the centrepiece due to their symbolic role as guardians of the environment and advocates of sustainability. These endearing...
at 00:30	Unisys Innovation Program Announces the Winners of its 15th Annual Competition
	Unisys is excited to announce the winners of the 15th edition of the Unisys Innovation Program (UIP), a competition for engineering students in India now in its 15th year. The program fosters collaboration between young engineers and industry...
at 00:25	COMPUTEX 2024 Forum Sets the Stage as Tech Titans Gather to Witness Next-Gen AI Developments
	With the advent of the Generative AI era, the technology industry is poised to showcase a diverse array of innovations at COMPUTEX 2024. Themed "Connecting AI", this year's event will unveil the latest applications and technologies in AI development....
at 00:23	Music Streaming Market size is set to grow by USD 31.10 bn from 2023-2027, increasing preference for music streaming services to boost the market growth, Technavio
	The global music streaming market size is estimated to grow by USD 31.10 bn from 2023-2027, according to Technavio. The market is estimated to grow at a CAGR of over 15.67% during the forecast period. ...
at 00:17	HTX Ventures Invests in ChainML, Developer of Theoriq AI Agent Protocol, to Support Decentralized AI Agent Protocol Development
	With a commitment to broadening the reach and usability of blockchain technologies, HTX Ventures, the global investment arm of the cryptocurrency exchange HTX, has announced a strategic investment in ChainML, a Silicon Valley-based AI and ML...
at 00:01	Women Leaders in Tech Outpace Men Counterparts in Generative AI Adoption
	Generative AI (GenAI) is proliferating rapidly in the workplace and while women have historically been less likely to adopt new technologies than men have (especially in the early days), a report by Boston Consulting Group (BCG) released today finds...
	More news about Science and technology...

News published on 23 may 2023 at 10:00 and distributed by:

Generative AI is the New Attack Vector for Platforms, According to ActiveFence Threat Intelligence

These press releases may also interest you

Le Lézard

Others sections

Follow us