GPT-4/ChatGPT/GPT-3@RealTime QA (@realtimeqa) 's Twitter Profile
GPT-4/ChatGPT/GPT-3@RealTime QA

@realtimeqa

How well can GPT-3 answer your real-time questions?
Examples from RealTime QA, a weekly-updated QA benchmark.
Managed by @jungokasai and @KeisukeS_ .

ID: 1531492014897868800

linkhttps://realtimeqa.github.io/ calendar_today31-05-2022 04:25:43

75 Tweet

203 Followers

7 Following

GPT-4/ChatGPT/GPT-3@RealTime QA (@realtimeqa) 's Twitter Profile Photo

Many Japanese families make a tradition of eating a Christmas meal at which fast food restaurant? Closed GPT: KFC Open GPT: McDonald's japantimes.co.jp/news/2022/12/2…

GPT-4/ChatGPT/GPT-3@RealTime QA (@realtimeqa) 's Twitter Profile Photo

Which country’s prime minister unexpectedly announced that she will resign within weeks? cnn.com/2023/01/18/asi… Closed GPT-3: Japan Open GPT-3: United Kingdom

GPT-4/ChatGPT/GPT-3@RealTime QA (@realtimeqa) 's Twitter Profile Photo

GPT-3 indeeds has some sensible guess! Japan's prime ministership is described as a “revolving door” because of leaders' short terms. japantimes.co.jp/news/2021/09/0…

GPT-4/ChatGPT/GPT-3@RealTime QA (@realtimeqa) 's Twitter Profile Photo

Erin Brockovich has urged locals of a town in which US state to ignore government safety reassurances after a train carrying toxic waste derailed there? GPT-3: California ChatGPT (web access): Not mentioned in the search results. Answer: Ohio telegraph.co.uk/world-news/202…

GPT-4/ChatGPT/GPT-3@RealTime QA (@realtimeqa) 's Twitter Profile Photo

Which politician announced this week that she’s running for president in 2024? GPT-3: Kamala Harris ChatGPT (web): former Ambassador to the United Nations and former South Carolina Governor Nikki Haley cnn.com/2023/02/14/pol…

GPT-4/ChatGPT/GPT-3@RealTime QA (@realtimeqa) 's Twitter Profile Photo

Which US state announced a ban on TikTok last week? GPT-3: No US state. ChatGPT: Can't answer. GPT-4: Can't answer. cnn.com/2023/05/17/tec…

Alice Oh (@aliceoh) 's Twitter Profile Photo

Personal read #1 of the NeurIPS Conference #neurips2023 papers (incl D&B). "RealTime QA" such as "what is Yejin Choi's citation count?" openreview.net/forum?id=HfKOI… Weekly uploaded q&a, humans w/ web search do near perfect, GPT-3 w/ search is ~60%. Potential for real-time disaster help.