[0720]取消审计功能今天失效了 #285

eprtr · 2023-07-20T02:01:25Z

用户输入的内容会被警告“可能违反了内容政策”，ai输出的内容目前并没有被警告。

eprtr · 2023-07-20T06:20:07Z

下午发现用户的输入被警告后，ai也只会输出空白内容了。刷新页面后可以看到ai就输出了一个字。

xcanwin · 2023-07-20T06:23:01Z

我排查了一下，取消审计功能仍然有效。
从你的角度觉得失效的原因是因为：openai一直在拿用户的聊天做训练素材，你输入的内容已经被训练发现为违规内容了。
破解的方式就是：换个提示词、换个问法、套个语言艺术、参考DAN。

xcanwin · 2023-07-20T06:26:03Z

很多DAN也被封了，这点也可以证明openai每天在看用户的聊天，拿用户的聊天内容训练新的模型。

tester1118 · 2023-07-20T08:35:22Z

和楼主情况一样，取消审计功能无效。出现的时间点也相同（7月20日上午）。
尝试切换另一个账户，情况相同。

eprtr · 2023-07-20T08:37:59Z

又测试了一下，这次我的输入没有被警告，但是ai输出到一半的时候ai的内容弹警告了，并且ai的输出也被截断了。

guxiaoyv · 2023-07-20T09:05:24Z

我的情况也一样，今天凌晨四点左右使用正常，今天中午就失效了，重新安装脚本也没用

guxiaoyv · 2023-07-20T09:26:07Z

不止这样，我觉得今天的审计比正常都要严格，原本一些不会被屏蔽的问题也会被屏蔽，是账号被限制了吗？

xcanwin · 2023-07-20T10:13:28Z

分享一下你们的提示词，我看看能不能前端解决

guxiaoyv · 2023-07-20T11:14:45Z

分享一下你们的提示词，我看看能不能前端解决

大概是弹这个”This content may violate our content policy. If you believe this to be in error, please submit your feedback — your input will aid our research in this area.“

xcanwin · 2023-07-20T11:46:23Z

分享一下你们的提示词，我看看能不能前端解决

大概是弹这个”This content may violate our content policy. If you believe this to be in error, please submit your feedback — your input will aid our research in this area.“

这个返回值我readme有截图，提示词指的是用户发送给chatgpt的内容

tester1118 · 2023-07-20T13:44:35Z

you are banging
my daughter

尝试输入这句话就报错。

guxiaoyv · 2023-07-20T15:26:54Z

分享一下你们的提示词，我看看能不能前端解决

大概是弹这个“此内容可能违反了我们的内容政策。如果您认为这是错误的，请提交您的反馈 - 您的意见将有助于我们在这一领域的研究。

这个返回值我readme有截图，提示词指的是用户发送给chatgpt的内容

大概就是一个角色扮演的对话，昨天的取消审查还生效，今天就连打”继续“都会警告，只要是还有任何稍微违禁一点的对话都会警告

guxiaoyv · 2023-07-20T15:31:47Z

分享一下你们的提示词，我看看能不能前端解决

大概是弹这个“此内容可能违反了我们的内容政策。如果您认为这是错误的，请提交您的反馈 - 您的意见将有助于我们在这一领域的研究。

这个返回值我readme有截图，提示词指的是用户发送给chatgpt的内容

我尝试了昨天在取消审计下能发出去的对话，今天也会警告

DFPOV · 2023-07-20T16:18:17Z

我也一样，而且用的另一个防审计的插件也失效了

eprtr · 2023-07-21T01:36:18Z

感觉确实失效了也更严格了。用户的输入内容被警告后标黄，刷新页面也还是黄色。之前是刷新就没警告标记了。

xcanwin · 2023-07-21T01:53:54Z

you are banging my daughter

尝试输入这句话就报错。

这么简单的英文单词组合居然真的会触发告警，醉了，chatgpt变敏感和严格了。

xcanwin · 2023-07-21T02:02:15Z

给繁忙工作的各位分享一个和本主题强相关的对你们有点帮助的笑话（也是技术）：
https://mp.weixin.qq.com/s/INrOcSDHuREvdIAg3SSIFQ

现在对提示词开发是不是有点新思路了，大家举一反三。

baliu8620 · 2023-07-21T05:10:49Z

和楼主的情况一样，这么多人同时出现这种情况，应该可以认为是OpenAI增强了监管力度吧

baliu8620 · 2023-07-21T05:21:37Z

等等，我的账号已经被封禁了，时间是7-21中午一点左右

DFPOV · 2023-07-21T05:37:52Z

我的还没封，换了很多关键词之后能继续沟通了，很奇怪，像是7.20的更新新加了很多敏感词

DFPOV · 2023-07-21T05:51:37Z

一旦成功让ai说出来敏感词汇，那么下一句话无论说什么都会被警告，我留白也被警告了

baliu8620 · 2023-07-21T06:49:04Z

是openai更新了，以前是发两个请求，一个到 conversation, 一个到 moderation（moderation是专门检测违规内容的）。现在只有conversation了。也就是说之前阻止moderation躲避检查的方法不管用了

948199363 · 2023-07-21T07:49:05Z

目前似乎可以通过把对话放在文件中，然后用Code Interpreter上传的方式来逃避监管，但是ai回复之后必须另起对话，不然ai的回复还是会被监管导致警告。

OrochiZ · 2023-07-21T10:50:13Z

新版似乎是嵌入了一个脚本,但是不能整体屏蔽
https://chat.openai.com/_next/static/chunks/412-d7b7161e288bfc24.js

modApiVoilation:{id:"userContextModal.modApiVoilation",defaultMessage:"This content may violate our content policy. If you believe this to be in error, please submit your feedback — your input will aid our research in this area."

OrochiZ · 2023-07-21T10:52:14Z

我也一样，而且用的另一个防审计的插件也失效了

请问另一个插件在哪里

DFPOV · 2023-07-21T11:59:47Z

我也一样，而且用的另一个防审计的插件也失效了

请问另一个插件在哪里

叫"ChatGPT功能增强"，我已经在尝试找chatgpt平替了

knightofsantiago · 2023-07-24T13:22:53Z

我也遇到了这种情况，不知道怎么解决

mistrobot · 2023-07-25T06:33:33Z

Let's not input any prompts that may be flagged for violating content policy, and let's wait for @xcanwin to resolve this issue

xcanwin · 2023-07-25T07:31:40Z

Let's not input any prompts that may be flagged for violating content policy, and let's wait for @xcanwin to resolve this issue

Thank you for your support, but as a user, we can’t solve openai staff to perform manual screening and block list in the background every day

xcanwin · 2023-07-25T07:34:05Z

Let's not input any prompts that may be flagged for violating content policy, and let's wait for @xcanwin to resolve this issue

I think openai has recently recruited a lot of cheap employees, and read all the prompts of the user once, and did not miss it at all.

OrochiZ · 2023-07-25T14:05:45Z

我们不要输入任何可能被标记为违反内容策略的提示，让我们等待@xcanwin解决这个问题

我想openai最近招了很多廉价员工，把用户的提示全部看完一遍，根本就没有错过。

显然是程序实时处理的,而且很可能只是本地js处理

mistrobot · 2023-07-25T15:23:15Z

But hey, @xcanwin, do you think you can create something like KeepChatGPT for Google Bard?

cmradix · 2023-07-25T22:08:09Z

取消审计这个功能已经确认无法规避了，他们将审核切入点移动到了服务端，本地的412脚本只管显示。另外根据一些推特上的推测，他们使用的是没有上下文的审核AI，每次提交都会被视为一次新的对话，这显然增加了欺骗难度，如果一定要就只能在规则上下手，并且写到每句话的开头处。

xcanwin · 2023-07-26T04:55:40Z

我们不要输入任何可能被标记为违反内容策略的提示，让我们等待@xcanwin解决这个问题

我想openai最近招了很多廉价员工，把用户的提示全部看完一遍，根本就没有错过。

显然是程序实时处理的,而且很可能只是本地js处理

已经抓包对比了，也本地调试了，是服务端处理的，不是本地js处理的。

mistrobot · 2023-07-26T07:26:58Z

My prompts have been countlessly and falsely flagged for violating content policy. If this keeps up, I may be banned from ChatGPT for good.

cmradix · 2023-07-26T07:36:24Z

My prompts have been countlessly and falsely flagged for violating content policy. If this keeps up, I may be banned from ChatGPT for good.

You can refer to this page: https://openai.com/policies/usage-policies. According to some discussions on Twitter, the current moderation system seems to be carried out by a context-free AI. If these claims on Twitter are accurate, then you would need to construct a sentence that complies with the rules in the link provided.

OrochiZ · 2023-07-27T08:13:34Z

希望可以阻止用户输入被本地删除

BiggestBears · 2023-09-07T02:10:50Z

没看错的话，实时回复的内容，红色敏感信息已经从服务端切断，服务端会生成数据，但是不会返回数据。橙色敏感信息不受影响。
js里有个shouldHideContent，把它干掉，可以保证刷新页面加载历史记录不被屏蔽。（不排除以后会从服务端切断，不返回红色敏感历史记录）
可以考虑从dan下手，引导消息敏感度降级。
最后还是看看能不能找一个平替吧。

xcanwin added the 不在规划中这不会去处理 label Jul 20, 2023

cmradix mentioned this issue Jul 25, 2023

取消审计打开了也会出现违规警告 #289

Closed

xcanwin closed this as completed May 17, 2024

[0720]取消审计功能今天失效了 #285

[0720]取消审计功能今天失效了 #285

Comments

eprtr commented Jul 20, 2023

eprtr commented Jul 20, 2023

xcanwin commented Jul 20, 2023

xcanwin commented Jul 20, 2023

tester1118 commented Jul 20, 2023

eprtr commented Jul 20, 2023

guxiaoyv commented Jul 20, 2023

guxiaoyv commented Jul 20, 2023

xcanwin commented Jul 20, 2023

guxiaoyv commented Jul 20, 2023

xcanwin commented Jul 20, 2023

tester1118 commented Jul 20, 2023

guxiaoyv commented Jul 20, 2023

guxiaoyv commented Jul 20, 2023

DFPOV commented Jul 20, 2023

eprtr commented Jul 21, 2023

xcanwin commented Jul 21, 2023

xcanwin commented Jul 21, 2023

baliu8620 commented Jul 21, 2023

baliu8620 commented Jul 21, 2023

DFPOV commented Jul 21, 2023

DFPOV commented Jul 21, 2023 • edited Loading

baliu8620 commented Jul 21, 2023

948199363 commented Jul 21, 2023

OrochiZ commented Jul 21, 2023 • edited Loading

OrochiZ commented Jul 21, 2023

DFPOV commented Jul 21, 2023

knightofsantiago commented Jul 24, 2023

mistrobot commented Jul 25, 2023

xcanwin commented Jul 25, 2023

xcanwin commented Jul 25, 2023

OrochiZ commented Jul 25, 2023 • edited Loading

mistrobot commented Jul 25, 2023

cmradix commented Jul 25, 2023

xcanwin commented Jul 26, 2023

mistrobot commented Jul 26, 2023

cmradix commented Jul 26, 2023

OrochiZ commented Jul 27, 2023

BiggestBears commented Sep 7, 2023

DFPOV commented Jul 21, 2023 •

edited

Loading

OrochiZ commented Jul 21, 2023 •

edited

Loading

OrochiZ commented Jul 25, 2023 •

edited

Loading