The biggest weakness of the ChatGPT Webapp is the inability to tinker with the s...

throwoutway · on July 20, 2023

When I saw the headline, I was hoping this was the ability to tinker with the system prompt, or some sort of switches to enable "Lobotomized Response" or "Not lobotomized response" for people who want "safe" responses, and those who don't want filters. Going to stick with open source for now until there's un-lobotomized commercial offerings

capableweb · on July 20, 2023

The API lets you set whatever system prompt you want and could be considered a un-lobotomized version, it's not at all as likely to return "safe" responses when you use the API.

crucialfelix · on July 20, 2023

A message with role=system gets added to the chat context and doesn't get a response. It's just a message.

You can just say the same thing as role=user and I think it has the same effect, but agent will answer confirming

When building custom apps, I use system to load in context and retrieval. The user doesn't see it .

But I think the user could just say "belay that order!" and take over.

I think they intend to change this.

Anybody else know more about this?

capableweb · on July 20, 2023

Is not just any message, it's a special message (by that vocabulary) that can set the tone for the next messages from the assistant, for example. Generally, pre GPT-4 (so GPT-3.5 for example), the system prompt wasn't as strong as it is with GPT-4. The model pays closer attention to whatever you put in the system prompt compared to other user messages.

For example, you can use the system message to force the assistant to always return JSON messages, instead of just plain text. Then whatever the user puts, it'll always output JSON message, even if they ask for YAML. You might have to tune the system prompt to be resilient to user requests, but GPT-4 is a lot better at this than GPT-3.

> The system message helps set the behavior of the assistant. For example, you can modify the personality of the assistant or provide specific instructions about how it should behave throughout the conversation. However note that the system message is optional and the model’s behavior without a system message is likely to be similar to using a generic message such as "You are a helpful assistant."

https://platform.openai.com/docs/guides/gpt/chat-completions...

bilsbie · on July 20, 2023

Is that a special call? The api by default still gives me safe responses.

alpark3 · on July 20, 2023

You have to do some intense prompt-engineering with the system prompt. The model considers the system to be trusted (roughly), some of the early 2023 exploits with ChatGPT still work if you do it on the system prompt.

bilsbie · on July 20, 2023

What’s the best option for unfiltered?

Roark66 · on July 21, 2023

The another element that's missing is control of output sampling. LLM models don't actually produce text. They produce a word probability map. Essentially a huge table of every single word(token) they know and the probability value of it being next. You run the model again and again to get each next word. You don't have to pick up the most probable word. Doing that is called greedy decoding. You can randomise a bit. Pick up one of less likely words if they have similar probabilities. This makes the output "more creative" sometimes. There are also more advanced ways of "steering the model" by maintaining a list of possible sentences and switching from one to another if it is considered better. You can run a smaller model on the output so far to judge if this answer is not becoming inappropriate etc.

Output decoding is a huge way to control the answers. Most users aren't even aware it exists. It's one of the reasons why comparing "naked" open source models to chatgpt is unfair. Chatgpt has all these extras on top.

_ea1k · on July 20, 2023

I've heard people describe "prompt engineering" as just people who know how to craft prompts well.

I hope we converge on a better definition than that soon. Maybe the name can change too.

__loam · on July 20, 2023

I think the people who know what they're doing are probably going to come up with some quantitative ways to evaluate the performance of prompts. That's where the real engineering will come from. You could even have chatGPT generate prompts for you and evaluate the best performing ones.

Any moron can make up a prompt and say "yeah looks good enough".

jstarfish · on July 20, 2023

It's just social engineering, codified. Even in that context it feels cheap.

Lawyers do the same thing as "prompt engineers"-- command mastery of English to achieve a desired outcome. We don't call them Litigation Engineers.

gorlilla · on July 20, 2023

We call them Wordsmiths.

calny · on July 20, 2023

I’ll take it. I spend about half my time developing/promptsmithing and the other half lawyering. “Wordsmith” sure beats some of the other lawyer epithets out there

alewi481 · on July 20, 2023

So prompt engineers could be promptsmiths?

vunderba · on July 20, 2023

I think prompt craft is a better definition. Engineering implies that there is a quantifiable aspect, and by their very nature LLMs are somewhat inscrutable black boxes, so craft feels more appropriate.

nefarious_ends · on July 20, 2023

prompt-fu, like Google-fu? I have a friend who’s gotten very good at prompt-fu.

SlackingOff123 · on July 20, 2023

I think that's a bit of a mouthful for casual use. I like "prompting" as an equivalent to "googling", although it is a bit more dependent on context.

Veen · on July 21, 2023

It's really a form of rhetoric tailored for LLMs.

> Rhetoric aims to study the techniques writers or speakers utilize to inform, persuade, or motivate particular audiences in specific situations

sagarpatil · on July 21, 2023

LLM whisperers?