Index Investing News
Tuesday, May 26, 2026
No Result
View All Result
  • Login
  • Home
  • World
  • Investing
  • Financial
  • Economy
  • Markets
  • Stocks
  • Crypto
  • Property
  • Sport
  • Entertainment
  • Opinion
  • Home
  • World
  • Investing
  • Financial
  • Economy
  • Markets
  • Stocks
  • Crypto
  • Property
  • Sport
  • Entertainment
  • Opinion
No Result
View All Result
Index Investing News
No Result
View All Result

OpenAI GPT 4o ranked as finest AI mannequin for writing Solidity sensible contract code by IQ

by Index Investing News
October 21, 2024
in Cryptocurrency
Reading Time: 3 mins read
A A
0
Home Cryptocurrency
Share on FacebookShare on Twitter


Receive, Manage & Grow Your Crypto Investments With Brighty

SolidityBench by IQ has launched as the primary leaderboard to judge LLMs in Solidity code technology. Accessible on Hugging Face, it introduces two progressive benchmarks, NaïveJudge and HumanEval for Solidity, designed to evaluate and rank the proficiency of AI fashions in producing sensible contract code.

Developed by IQ’s BrainDAO as a part of its forthcoming IQ Code suite, SolidityBench serves to refine their very own EVMind LLMs and evaluate them towards generalist and community-created fashions. IQ Code goals to supply AI fashions tailor-made for producing and auditing sensible contract code, addressing the rising want for safe and environment friendly blockchain functions.

As IQ informed CryptoSlate, NaïveJudge provides a novel method by tasking LLMs with implementing sensible contracts based mostly on detailed specs derived from audited OpenZeppelin contracts. These contracts present a gold normal for correctness and effectivity. The generated code is evaluated towards a reference implementation utilizing standards resembling practical completeness, adherence to Solidity finest practices and safety requirements, and optimization effectivity.

The analysis course of leverages superior LLMs, together with totally different variations of OpenAI’s GPT-4 and Claude 3.5 Sonnet as neutral code reviewers. They assess the code based mostly on rigorous standards, together with implementing all key functionalities, dealing with edge instances, error administration, correct syntax utilization, and total code construction and maintainability.

Optimization issues resembling gasoline effectivity and storage administration are additionally evaluated. Scores vary from 0 to 100, offering a complete evaluation throughout performance, safety, and effectivity, mirroring the complexities {of professional} sensible contract improvement.

Which AI fashions are finest for solidity sensible contract improvement?

Benchmarking outcomes confirmed that OpenAI’s GPT-4o mannequin achieved the best total rating of 80.05, with a NaïveJudge rating of 72.18 and HumanEval for Solidity move charges of 80% at move@1 and 92% at move@3.

Curiously, newer reasoning fashions like OpenAI’s o1-preview and o1-mini had been crushed to the highest spot, scoring 77.61 and 75.08, respectively. Fashions from Anthropic and XAI, together with Claude 3.5 Sonnet and grok-2, demonstrated aggressive efficiency with total scores hovering round 74. Nvidia’s Llama-3.1-Nemotron-70B scored lowest within the high 10 at 52.54.

SolidityBench scores for LLMs (Hugging Face)
SolidityBench scores for LLMs (Hugging Face)

Per IQ, HumanEval for Solidity adapts OpenAI’s unique HumanEval benchmark from Python to Solidity, encompassing 25 duties of various problem. Every activity contains corresponding checks suitable with Hardhat, a well-liked Ethereum improvement atmosphere, facilitating correct compilation and testing of generated code. The analysis metrics, move@1 and move@3, measure the mannequin’s success on preliminary makes an attempt and over a number of tries, providing insights into each precision and problem-solving capabilities.

Targets of using AI fashions in sensible contract improvement

By introducing these benchmarks, SolidityBench seeks to advance AI-assisted sensible contract improvement. It encourages the creation of extra subtle and dependable AI fashions whereas offering builders and researchers with worthwhile insights into AI’s present capabilities and limitations in Solidity improvement.

The benchmarking toolkit goals to advance IQ Code’s EVMind LLMs and likewise units new requirements for AI-assisted sensible contract improvement throughout the blockchain ecosystem. The initiative hopes to handle a important want within the trade, the place the demand for safe and environment friendly sensible contracts continues to develop.

Builders, researchers, and AI lovers are invited to discover and contribute to SolidityBench, which goals to drive the continual refinement of AI fashions, promote finest practices, and advance decentralized functions.

Go to the SolidityBench leaderboard on Hugging Face to be taught extra and start benchmarking Solidity technology fashions.

🤖 Prime AI Crypto Belongings

View All

Talked about on this article



Source link

Tags: CodecontractGPTModelOpenAIrankedSmartSolidityWriting
ShareTweetShareShare
Previous Post

NFL Ticket Value Inflation Over The Final Decade – FREEDOMBUNKER

Next Post

Mitigate the affect of world shocks on India’s monetary sector

Related Posts

Binance Denies WSJ Report Alleging 0M in Iran-Linked Crypto Transactions

Binance Denies WSJ Report Alleging $850M in Iran-Linked Crypto Transactions

by Index Investing News
May 23, 2026
0

Binance CEO Richard Teng has pushed back against a new Wall Street Journal investigation claiming the exchange processed $850 million...

Crypto gives back gains as macro headwinds overwhelm regulatory optimism

Crypto gives back gains as macro headwinds overwhelm regulatory optimism

by Index Investing News
May 15, 2026
0

For about 48 hours, crypto had something genuine to celebrate. The CLARITY Act, a landmark piece of stablecoin regulation, cleared...

Bitcoin ETF Issuers Are Predicting ,000,000 Per Coin As Inflows Accelerate

Bitcoin ETF Issuers Are Predicting $1,000,000 Per Coin As Inflows Accelerate

by Index Investing News
May 11, 2026
0

Trusted Editorial content, reviewed by leading industry experts and seasoned editors. Ad Disclosure VanEck’s Matthew Sigel has become the latest...

Retail Traders Get Crypto Access as Morgan Stanley Follows SoFi in Trading Push

Retail Traders Get Crypto Access as Morgan Stanley Follows SoFi in Trading Push

by Index Investing News
May 7, 2026
0

FM Daily Brief - 6 May 2026 FM Daily Brief - 6 May 2026 FM Daily Brief - 6 May...

Why Cross-Chain DEX Trading Is Becoming the New Default in Crypto

Why Cross-Chain DEX Trading Is Becoming the New Default in Crypto

by Index Investing News
May 3, 2026
0

Image source: GeminiThe manner in which individuals conduct crypto trading has changed. Not slightly but structurally. A decentralized exchange platform which...

Next Post
Mitigate the affect of world shocks on India’s monetary sector

Mitigate the affect of world shocks on India’s monetary sector

Why Housing Is Artificially Costly and What Can Be Accomplished About It (with Bryan Caplan)

Why Housing Is Artificially Costly and What Can Be Accomplished About It (with Bryan Caplan)

RECOMMENDED

Kamala Harris: ‘Do they need ZERO Catholic votes for Harris?’: Christian officers slam governor’s viral video mocking holy communion

Kamala Harris: ‘Do they need ZERO Catholic votes for Harris?’: Christian officers slam governor’s viral video mocking holy communion

October 13, 2024
Japan Earthquake: 6.1-magnitude earthquake strikes off central Japan, no tsunami advisory | World News

Japan Earthquake: 6.1-magnitude earthquake strikes off central Japan, no tsunami advisory | World News

November 14, 2022
Bitcoin (BTC) Tops K Forward of Jackson Gap as Ether (ETH) ETFs Prolong Report Outflow Streak

Bitcoin (BTC) Tops $61K Forward of Jackson Gap as Ether (ETH) ETFs Prolong Report Outflow Streak

August 23, 2024
Zombie Punk Sells For .08M

Zombie Punk Sells For $1.08M

May 19, 2025
10 Horror Movies That’ll Make You Never Want to Pick Up the Phone Again

10 Horror Movies That’ll Make You Never Want to Pick Up the Phone Again

October 7, 2023
Monetary Freedom in 11 Years Because of This “Excellent” Rental Technique

Monetary Freedom in 11 Years Because of This “Excellent” Rental Technique

March 10, 2025
Palestinian activist’s deportation a ‘violation of free speech,’ says advocate | Hanomansing Tonight

Palestinian activist’s deportation a ‘violation of free speech,’ says advocate | Hanomansing Tonight

April 12, 2025
He Wanted His Texas Home to Resemble a ‘Groovy’ Opium Den

He Wanted His Texas Home to Resemble a ‘Groovy’ Opium Den

September 5, 2023
Index Investing News

Get the latest news and follow the coverage of Investing, World News, Stocks, Market Analysis, Business & Financial News, and more from the top trusted sources.

  • 1717575246.7
  • Browse the latest news about investing and more
  • Contact us
  • Cookie Privacy Policy
  • Disclaimer
  • DMCA
  • Privacy Policy
  • Terms and Conditions
  • xtw18387b488

Copyright © 2022 - Index Investing News.
Index Investing News is not responsible for the content of external sites.

No Result
View All Result
  • Home
  • World
  • Investing
  • Financial
  • Economy
  • Markets
  • Stocks
  • Crypto
  • Property
  • Sport
  • Entertainment
  • Opinion

Copyright © 2022 - Index Investing News.
Index Investing News is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In