Back to All Events

Conversational AI Safety: GPT-4

  • Fureai Space Nishishinjuku 1-19-2 Seiko Building 5F Shinjuku City, Tokyo, 160-0023 Japan (map)

*** English follows Japanese ***


午後のひととき、形式にとらわれない気軽な会話を楽しみませんか。コーヒー、スナック、プロンプトを用意し、新しいアイデアについて新鮮な顔ぶれと話すことができるようにします:

イメージはDreamStudio betaで起こしました

  • GPTを何に使ったことがありますか?校正、プログラミング、詩、陶芸?

  • GPTの信頼性についてどう思いますか?

  • GPTに対して、どんな新しい規制や法律が提案されていますか?その背景は?

  • GPTから予見されるリスク:大量誤報?技術的な失業?みんな授業でAを取れるようになる?

  • AGI研究の一時停止を求めるFLIの公開書簡

  • GPTをどのように改善すればいいのですか?何が足りないのですか?

スケジュール

13:00–13:15: 歓迎
13:15–14:00: 10分間のLightning Talks
14:30–16:00: 会話と交流


Join us for an afternoon of mostly unstructured casual conversation. We’ll provide coffee, snacks and prompts, to get you talking to some fresh faces about new ideas:

Image generated using DreamStudio beta

  • For what have you used GPT? Proofreading, programming, poetry, pottery?

  • How reliable have you found GPT?

  • What new regulations and laws are being passed in response to GPT? Why?

  • Foreseeable risks from GPT: Mass misinformation? Technological unemployment? Everyone gets an A in their coursework?

  • FLI’s open letter to pause AGI research

  • How can we improve on GPT? What is it missing?

Schedule

13:00–13:15: Welcome, mingle
13:15–14:00: 10-minute lightning talks
14:30–16:00: Conversation & networking


Access instructions for the venue.

We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and more, without needing any special prompting. Moreover, in all of these tasks, GPT-4’s performance is strikingly close to human-level performance, and often vastly surpasses prior models such as ChatGPT. Given the breadth and depth of GPT-4’s capabilities, we believe that it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system.

"Sparks of Artificial General Intelligence: Early experiments with GPT-4", Bubeck et al. 2023

First, we highlight safety challenges presented by the model’s limitations (e.g., producing convincing text that is subtly false) and capabilities (e.g., increased adeptness at providing illicit advice, performance in dual-use capabilities, and risky emergent behaviors). […] Finally, we demonstrate that while our mitigations and processes alter GPT-4’s behavior and prevent certain kinds of misuses, they are limited and remain brittle in some cases. This points to the need for anticipatory planning and governance.

"GPT-4 System Card", OpenAI 2023

I think that we are hearing the last winds start to blow, the fabric of reality start to fray. This thing alone cannot end the world, but I think that probably some of the vast quantities of money being blindly and helplessly piled into here are going to end up actually accomplishing something.

Eliezer Yudkowsky @ Bankless Podcast 2023

Previous
Previous
11 March

TEDxOtemachi

Next
Next
25 June

Conversational AI Safety: Is AI as dangerous as pandemics? AIはパンデミックと同じくらい危険なのか?