时间:2025-04-26 13:02:06 来源:网络整理编辑:時尚
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new versio
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.
OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.
SEE ALSO:How to get access to GPT-4 right nowPut your pencil down, GPT-4, it's time to see check your scores.
GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.
GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.
The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.
GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.
GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.
It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.
GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.
OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.
TopicsArtificial IntelligenceChatGPT
Felix the cat just raised £5000 for charity because she's the hero we all need2025-04-26 13:00
女足奧運隊成全運奪冠熱門 水慶霞若帶隊出色不排除轉正2025-04-26 12:40
拜仁歐冠名單:萊萬格納布裏科曼皆入選 強陣出擊2025-04-26 12:10
15日賠率:利物浦主場小勝米蘭 皇馬客戰國米不敗2025-04-26 12:02
'Rocket League' Championship Series Season 2 offers $250,000 prize pool2025-04-26 11:52
青島隊重新集結備戰足協杯 將對中超保級組比賽針對部署2025-04-26 11:41
泰山隊將與武漢津門虎進行熱身 李霄鵬已率隊抵濟南2025-04-26 11:36
進球機器!萊萬連續18場破門 上次啞火已是7個月前2025-04-26 11:05
Dog elected for third term as mayor of Minnesota town2025-04-26 10:26
難翻身 !武磊將錯過西班牙人主場戰皇馬 因需提前與國足會合2025-04-26 10:22
Nancy Pelosi warns colleagues after info hacked2025-04-26 12:53
135球 !C羅連續16個賽季歐冠破門 一數據追平梅西2025-04-26 12:52
吳曦:踢越南要先做好自己 張琳芃還未恢複到100%2025-04-26 12:50
歐冠金靴賠率:萊萬壓梅西居首 C羅姆巴佩並列第52025-04-26 12:49
Singapore rolls out video2025-04-26 12:19
國足官方:張琳芃正積極恢複未合練 其餘30人全部參訓2025-04-26 11:53
中國足協已與多支球隊聯係 推動國足熱身賽計劃2025-04-26 11:49
曼城鐵衛與明星女友分手 理由 :娛樂圈令她太開放2025-04-26 11:49
Uber's $100M settlement over drivers as contractors may not be enough2025-04-26 11:43
國足強化體能隻為打越南不拉胯 濕熱氣候是巨大考驗2025-04-26 11:29