时间:2026-03-18 12:40:58 来源:网络整理编辑:時尚
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new versio
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.
OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.
SEE ALSO:How to get access to GPT-4 right nowPut your pencil down, GPT-4, it's time to see check your scores.
GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.

GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.
The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.
GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.
GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.
It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.
GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.
OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.
TopicsArtificial IntelligenceChatGPT
Whyd voice2026-03-18 12:34
韓喬生:武磊單刀一直是問題 這狀態對陣越南確實令人擔憂2026-03-18 12:34
官方 :美國投資公司入主熱那亞 已購買99.9%股份2026-03-18 12:26
尷尬!拜仁接觸呂迪格經紀人 但他隻想和藍軍續約2026-03-18 11:58
This coloring book is here for all your relationship goals2026-03-18 11:49
曼聯VS維拉首發 :C羅B費博格巴領銜 瓦拉內出戰2026-03-18 11:26
怒懟 !科曼 :在西班牙什麽都不幹 裁判也會把你罰下2026-03-18 11:23
敘利亞隊成國足唯一熱身對手 主動邀約模擬戰韓國2026-03-18 11:12
Dramatic photo captures nun texting friends after Italy earthquake2026-03-18 10:43
C羅錄視頻鼓勵患癌小球迷:早日康複 想請你來看我的比賽2026-03-18 09:57
Singapore gets world's first driverless taxis2026-03-18 11:59
廣州隊與卡納瓦羅整個教練組解約 曾希望帶完本賽季2026-03-18 11:27
記者辟謠沈祥福執教廣州隊:目前在上海 足協請求去指導U162026-03-18 11:21
國足隊內分組對抗後放假2天 球員酒店內自個找樂子2026-03-18 11:07
The five guys who climbed Australia's highest mountain, in swimwear2026-03-18 11:06
皇馬前瞻:戰艦衝擊30年最佳開局 本澤馬再爆發?2026-03-18 10:59
西甲主席:已邀請巴黎 以解釋為何他們違反財政公平2026-03-18 10:59
貝利病房內玩撲克牌 女兒:近幾天他狀況有很大進展2026-03-18 10:47
Darth Vader is back. Why do we still care?2026-03-18 10:32
重慶隊集結僅吃下半顆定心丸 徹底完成股改還需相當長時間2026-03-18 10:29