时间:2025-04-26 13:04:55 来源:网络整理编辑:休閑
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new versio
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.
OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.
SEE ALSO:How to get access to GPT-4 right nowPut your pencil down, GPT-4, it's time to see check your scores.
GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.
GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.
The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.
GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.
GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.
It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.
GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.
OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.
TopicsArtificial IntelligenceChatGPT
17 questions you can answer if you're a good communicator2025-04-26 12:49
足協正加緊研究廣州城抗議函 此前曾通知僅郭田雨滿足條件2025-04-26 12:49
曝國安輸球後謝峰徹夜難眠 一旦再有閃失將會更加複雜2025-04-26 12:48
體能教練海切曼加盟國安 曾在河北隊輔佐謝峰2025-04-26 12:35
Tyler, the Creator helped Frank Ocean celebrate 'Blonde' release in a delicious way2025-04-26 12:34
中衛連續兩場踩踏染紅 衛冕冠軍壓力大致心態失衡2025-04-26 12:04
足協官方 :徐新嚴重犯規+踢碎玻璃門 停賽2場罰款2萬2025-04-26 11:37
姆皇登基太後垂簾 巴黎三叉戟都被爹媽包辦了人生2025-04-26 10:34
Two astronauts just installed a new parking spot on the International Space Station2025-04-26 10:25
大連人VS廣州城首發:林良銘領銜 童磊李提香登場2025-04-26 10:21
How Hyperloop One went off the rails2025-04-26 13:00
若塔評1億歐鋒霸 :我不能說啥 範迪克:他是當世前52025-04-26 12:54
薩拉赫已告知密友加盟巴薩 他和馬內到底誰搭萊萬 ?2025-04-26 12:29
足協罰單 :鄭錚暴力行為停賽4場 罰款人民幣4萬元2025-04-26 12:28
Airbnb activates disaster response site for Louisiana flooding2025-04-26 12:11
滬媒:不管是先前的犯規還是之後鼓掌 石柯必會受到追加處罰2025-04-26 11:51
紅牌天天見 !中超開賽連續五個比賽日出現紅牌2025-04-26 10:40
曼聯紐卡最接近於簽下努涅斯 本菲卡標價6700萬鎊2025-04-26 10:35
Balloon fanatic Tim Kaine is also, of course, very good at harmonica2025-04-26 10:35
武漢長江VS河北隊首發:雙外援PK全華班 福布斯出戰2025-04-26 10:27