How to jailbreak ChatGPT

Emma Street

23 March 2024 at 8:30 am·6-min read

The term 'jailbreaking' in a computing context emerged around the mid-2000s, specifically linked to the rise of Apple's iPhone. Users started to develop methods to bypass the device's restrictions and modify the iOS operating system. This process was termed "jailbreaking," metaphorically suggesting breaking out of the 'jail' of software limitations imposed by the manufacturer.

The term has since been used in a broader sense in the tech community to describe similar processes on other devices and systems.

When people refer to "jailbreaking" ChatGPT, they're not talking about making changes to software but rather ways to get around ChatGPT's guidelines and usage policies through prompts.

Tech enthusiasts often see jailbreaking as a challenge. It's a way of testing the software to see how robust it is, and testing parameters enables them to understand the underlying workings of ChatGPT.

Jailbreaking usually involves giving ChatGPT hypothetical situations where it is asked to role-play as a different kind of AI model who doesn't abide by Open AI's terms of service.

There are several established templates for doing this, which we'll cover below. We'll also cover the common themes used in ChatGPT jailbreak prompts.

Although we can cover the methods used, we can't actually show the results obtained because, unsurprisingly, contravening ChatGPT standards produces content that we can't publish on TechRadar, either.

The current rules that ChatGPT has in place include:

No explicit, adult, or sexual content.
No harmful or dangerous activities.
No responses that are offensive, discriminatory, or disrespectful to individuals or groups.
No misinformation or false facts.

Most jailbreaking techniques are designed to circumnavigate these regulations. We'll leave it to your conscience to decide quite how ethical it is to do so.

How to jailbreak ChatGPT

Warning:

Although jailbreaking isn't specifically against Open AI's terms of service, using ChatGPT to produce immoral, unethical, dangerous, or illegal content is prohibited in its policies.

As jailbreaking produces answers that OpenAI has tried to safeguard against, there is a risk that you could compromise your ChatGPT account.

There have been instances of individuals using jailbreaking prompts and having their ChatGPT Plus accounts shut down following "suspicious activity". So be warned.

Disclaimer: The information in this article is provided for information purposes only, and TechRadar does not condone contravening OpenAI's ChatGPT policies.

1. Use an existing jailbreak prompt

Screenshot showing the Reddit ChatGPTJailbreak home page

(Image: © Future)

There are many existing jailbreak prompts that others have shared online, and people are adding to this list all the time. If you want to find out more, you can check out ChatGPTJailbreak on Reddit.

The advantage of a ready-made script is that it is quick and easy to copy and paste into ChatGPT. However, once a successful jailbreak prompt has been shared online, OpenAI ChatGPT developers will also be aware of it. OpenAI uses the hacks created by its users to locate vulnerabilities in its system.

Users report that jailbreaking scripts only work some of the time. ChatGPT-4 appears to be harder to fool than previous versions.

Many of these prompts use similar logic, so it would be possible to create your own. We'll cover the necessary components of a successful jailbreak script in the next four steps.

2. Tell ChatGPT to roleplay as a different kind of GPT

Screenshot showing suggested prompts to jailbreak ChatGPT

(Image: © Future)

In order to get ChatGPT to break its own rules, you need to assign it a character to play. Successful jailbreak prompts will tell ChatGPT to pretend that it's a new type of GPT, which operates according to different guidelines, or to roleplay a human-like character with a particular ethical code.

It is important to ensure that ChatGPT is producing results not as itself but as a fictional character.

3. Tell ChatGPT to ignore ethical and moral guidelines

(Image: © Future)

Once you have given ChatGPT a role to play, you need to establish the parameters of the roleplay. This will usually involve specifying that its hypothetical character has no ethical or moral guidelines.

Some prompts explicitly tell ChatGPT that it should promote immoral, unethical, illegal, and harmful behavior. Not all prompts include this, however. Some simply state that its new character doesn't have any filters or restrictions.

4. Tell it never to say no

In its default setting, when not following jailbreak prompts, ChatGPT will refuse to answer questions that contravene its guidelines by saying something like "I'm sorry, I can't fulfill this request".

So, to get around this, most jailbreak prompts contain clear instructions never to refuse a request. ChatGPT is told that its character should never say it can’t do something. Many prompts also tell ChatGPT to make something up when it doesn’t know an answer.

5. Ask ChatGPT to confirm it's in character

A jailbreak prompt should include an instruction to get ChatGPT to show that it’s working as the new fictional GPT. Sometimes, this is simply a command for ChatGPT to confirm that it is operating in its assigned character. Many prompts also contain instructions for ChatGPT to preface its answers with the name of its fictional identity to make it clear that it is successfully operating in character.

Because ChatGPT can sometimes forget earlier instructions, it may revert to its default ChatGPT role during a conversation. In this case, you'll need to remind it to stay in character, or post the jailbreak prompt text again.

The success of a jailbreak prompt will depend on several factors, including the instructions given, the version you use, and the task you have asked it to perform.

Even without a jailbreak prompt, ChatGPT will sometimes produce results that contravene its guidelines. Sometimes, it will refuse to produce erotic content, for example, and other times, it will generate it. AI models are often not consistent because they have an element of randomness in their response generation process, which means that given the same prompt multiple times, the model can produce different responses.

For example, ChatGPT doesn't swear, but I asked it to recite the profanity-laden poem This Be the Verse by Philip Larkin, and it did so without complaint or censoring.

Most ChatGPT jailbreakers will claim that they are doing so to test the system and better understand how it works, but there is a dark side to asking ChatGPT to produce content that it was explicitly designed not to.

People have asked jailbroken ChatGPT to produce instructions on how to make bombs or stage terrorist attacks. Understandably, OpenAI – along with other producers of AI, like Google Bard and Microsoft Copilot – are taking steps to tighten up its security and ensure that jailbreaking is no longer possible in the future.

Tom's Guide
Have some images you’d rather the world not see? — Google Photos is putting its hiding place front and center
A recent report has revealed that Google Photos is moving the location of its locked folder to a far more prominent location.
Tom's Guide
iOS 18 brings two hidden upgrades to the Weather app — here's what's new
More hidden iOS 18 features are being discovered by users trying out the developer betas, including two new additions to the ever-useful Weather app.
The Guardian
FarmVille at 15: how a cutesy Facebook game shaped the modern internet
On its 15th anniversary, the creators of FarmVille reflect on the compulsive cartoon farm sim that paved the way for a data-driven world
The Daily Beast
Conservatives Routed in Worst Election Result for 200 Years
LONDON—The Conservatives, the world’s winningest political party, were booted out of power in dramatic style on Thursday after 14 years of chaotic and divisive rule.The Labour Party had secured a landslide victory, ending an era of Conservative rule over Britain that stretches back to 2010; the year that the iPad and Instagram were launched and Lady Gaga wore that meat dress to the MTV music awards.In that time, the Conservatives have cycled through five leaders, each of them dragging the party
The Telegraph
Horrifying moment husband wheels suitcase containing murdered wife from home while carrying child
A husband who strangled his wife in front of her two children and lover on a video call was captured wheeling the suitcase containing her body out of his home.
The Telegraph
The world must prepare for President Michelle Obama
The 2024 US Presidential race intensifies. Speculation abounds over potential replacements for President Joe Biden amid increasing pressure from his party and the media to step aside after a jaw-dropping, catastrophic debate against Donald Trump last week. Among the names circulating, a game-changer is emerging: Michelle Obama. Could this be America’s worst nightmare?
Cinema Online
Joyce Chen confirms she has become a mother
The TVB actress shares photos of her and her daughter, not revealing partner's identity
The Independent
Embattled Biden makes latest gaffe saying he’s the ‘first Black woman to serve with a Black president’
President’s July 4 blunder comes as he desperately tries to claw back confidence among voters and party members after last week’s debate flop
Malay Mail
When Ms ‘Malaysia’ visits Malaysia: American tourist gets warm welcome from locals awed by unique name (VIDEO)
KUALA LUMPUR, July 5 — An American tourist decided to visit Malaysia for a special reason: Malaysia also happens to be h...
Malay Mail
Putrajaya submits diplomatic protest note to UN rejecting Manila’s bid to reclaim Sabah after Paris court rule
Malaysia submits diplomatic note to UN Secretary-General António Guterres rejecting the Philippines’ claim to Sabah r...
INSIDER
How Obama mentored the new UK prime minister to help him tell his story
The leader of the Labour Party, Keir Starmer, has been mentored by Barack Obama. The relationship is likely to continue if Starmer wins the UK election.
The Telegraph
Beijing is laughing at the West’s weakness
Russian aggression in Ukraine poses a threat to world peace and stability. China provides material support for Russia’s actions. All this is entirely clear. In Beijing, immediately after a meeting with Xi Jinping in April this year, US Secretary of State Antony Blinken said that “Russia would struggle to sustain its assault on Ukraine without China’s support”. This included selling huge quantities of machine tools and micro-electronics that would be used in the Russian defence industry.
The Independent
Voices: Tory downfall: The 9 reasons it has all gone wrong for Rishi Sunak
A series of world events, poor decisions and fatal gaffes conspired to see Rishi Sunak lead the Tories to the worst electoral defeat in his party’s 346-year history
Malay Mail
What you should know about: Malaysia’s proposed ‘lemon law’ for cars
Malaysia is proposing a so-called “lemon law” that would provide vehicle buyers in the country more protection against f...
CNN
Opinion: What Britain’s first Asian prime minister meant to my family
The defeat of the Conservative Party in the UK election ends Rishi Sunak’s two-year premiership. Does it also sour the story of Britain’s first Asian prime minister, asks Sunder Katwala.
Bloomberg
These flight routes suffer the world’s worst turbulence
A series of incidents that left scores of airline passengers needing medical attention has shone a spotlight on the problem of severe in-flight turbulence.
Storyful
Canadian Police Seek Suspects in Pants-on-Fire Arson
One of two suspects in an arson in Richmond, British Columbia, set himself on fire and “despite efforts of both the suspects, [his] pants continued to burn,” according to Royal Canadian Mounted Police (RCMP).The RCMP released footage of the incident, which they said occurred in the early hours of April 24.“At approximately 4:30 am Richmond RCMP attended for an alarm at a business located in the 2600 block of Simpson Road,” police said.“Frontline officers located a broken window and small fire directly in front of it. Evidence was also located which indicated the fire had been intentionally set.“A subsequent review of video surveillance from the area determined that two unidentified men wearing masks had approached the business on foot prior to allegedly smashing the window and then lighting an object on fire. While lighting it, one of the suspect
Malay Mail
Blackpink’s Jennie won’t be coming, so why is there a big fuss at TRX’s Gentle Monster tomorrow? (VIDEO)
KUALA LUMPUR, July 4 — Jennie of K-pop girl group Blackpink will not be coming to Kuala Lumpur, so why is there still a...
HuffPost
Van Jones Reveals ‘Behind The Scenes’ Moves That Could End Biden Campaign
The CNN commentator detailed the “big conversation” taking place right now among Democratic insiders.
The Telegraph
Zelensky U-turns as he invites Russia to attend peace summit without giving up land
Volodymyr Zelensky has invited Russia to the next peace summit despite previously saying it could only join if it relinquished land in Ukraine.

How to jailbreak ChatGPT

How to jailbreak ChatGPT

1. Use an existing jailbreak prompt

2. Tell ChatGPT to roleplay as a different kind of GPT

3. Tell ChatGPT to ignore ethical and moral guidelines

4. Tell it never to say no

5. Ask ChatGPT to confirm it's in character

You might also like

Latest stories

Have some images you’d rather the world not see? — Google Photos is putting its hiding place front and center

iOS 18 brings two hidden upgrades to the Weather app — here's what's new

FarmVille at 15: how a cutesy Facebook game shaped the modern internet

Conservatives Routed in Worst Election Result for 200 Years

Horrifying moment husband wheels suitcase containing murdered wife from home while carrying child

The world must prepare for President Michelle Obama

Joyce Chen confirms she has become a mother

Embattled Biden makes latest gaffe saying he’s the ‘first Black woman to serve with a Black president’

When Ms ‘Malaysia’ visits Malaysia: American tourist gets warm welcome from locals awed by unique name (VIDEO)

Putrajaya submits diplomatic protest note to UN rejecting Manila’s bid to reclaim Sabah after Paris court rule

How Obama mentored the new UK prime minister to help him tell his story

Beijing is laughing at the West’s weakness

Voices: Tory downfall: The 9 reasons it has all gone wrong for Rishi Sunak

What you should know about: Malaysia’s proposed ‘lemon law’ for cars

Opinion: What Britain’s first Asian prime minister meant to my family

These flight routes suffer the world’s worst turbulence

Canadian Police Seek Suspects in Pants-on-Fire Arson

Blackpink’s Jennie won’t be coming, so why is there a big fuss at TRX’s Gentle Monster tomorrow? (VIDEO)

Van Jones Reveals ‘Behind The Scenes’ Moves That Could End Biden Campaign

Zelensky U-turns as he invites Russia to attend peace summit without giving up land