Anthropic’s Claude adds a prompt playground to quickly improve your AI apps

5:11 PM PDT • July 9, 2024

Prompt engineering became a hot job last year in the AI industry, but it seems Anthropic is now developing tools to at least partially automate it.

Anthropic released several new features on Tuesday to help developers create more useful applications with the startup’s language model, Claude, according to a company blog post. Developers can now use Claude 3.5 Sonnet to generate, test and evaluate prompts, using prompt engineering techniques to create better inputs and improve Claude’s answers for specialized tasks.

Language models are pretty forgiving when you ask them to perform some tasks, but sometimes small changes to the wording of a prompt can lead to big improvements in the results. Normally you’d have to figure out that wording yourself, or hire a prompt engineer to do it, but this new feature offers quick feedback that could make finding improvements easier.

The features are housed within Anthropic Console under a new Evaluate tab. Console is the startup’s test kitchen for developers, created to attract businesses looking to build products with Claude. One of the features, unveiled in May, is Anthropic’s built-in prompt generator; this takes a short description of a task and constructs a much longer, fleshed out prompt, utilizing Anthropic’s own prompt engineering techniques. While Anthropic’s tools may not replace prompt engineers altogether, the company said it would help new users, and save time for experienced prompt engineers.

Within Evaluate, developers can test how effective their AI application’s prompts are in a range of scenarios. Developers can upload real-world examples to a test suite or ask Claude to generate an array of AI-generated test cases. Developers can then compare how effective various prompts are side-by-side, and rate sample answers on a five-point scale.

A prompt being fed generated data to find good and bad responses.

In an example from Anthropic’s blog post, a developer identified that their application was giving answers that were too short across several test cases. The developer was able to tweak a line in their prompt to make the answers longer, and apply it simultaneously to all their test cases. That could save developers lots of time and effort, especially ones with little or no prompt engineering experience.

Anthropic CEO and co-founder Dario Amodei said prompt engineering was one of the most important things for widespread enterprise adoption of generative AI in an interview from Google Cloud Next earlier this year. “It sounds simple, but 30 minutes with a prompt engineer can often make an application work when it wasn’t before,” said Amodei.

More TechCrunch

ZoomInfo alum raises $15M for startup that builds AI sales engineers

Marina Temkin

8 mins ago

Until a year ago, Arjun Pillai had the comfortable yet important role of chief data officer at ZoomInfo, a B2B database company. But the serial entrepreneur was getting antsy. He…

ZoomInfo alum raises $15M for startup that builds AI sales engineers

Apps

Substack writers can now draft and publish posts in iOS app

Aisha Malik

8 mins ago

Substack is rolling out the ability for writers to draft and publish new posts directly from their phone via its iOS app, the company announced on Thursday. Until now, users…

Substack writers can now draft and publish posts in iOS app

TechCrunch Disrupt 2024

Disrupt 2024 Career Fair: Your gateway to top tech talent

TechCrunch Events

8 mins ago

Disrupt 2024 is the premier event where tech careers are launched, connections are forged, and the future of technology talent takes center stage. The Disrupt Career Fair is the perfect…

Disrupt 2024 Career Fair: Your gateway to top tech talent

Featured Article

Hacked, leaked, exposed: Why you should never use stalkerware apps

Using stalkerware is creepy, unethical, potentially illegal, and puts your data and that of your loved ones in danger.

Lorenzo Franceschi-Bicchierai

21 mins ago

Hacked, leaked, exposed: Why you should never use stalkerware apps

Featured Article

Endeavor CEO says long-term capital needs to be prioritized in emerging ecosystems

Venture capital has become a more global industry as the tech sector slowly decentralizes. In 2022, more than 50% of VC deployed globally was invested in startups outside the U.S., according to data available from the National Science Foundation (NSF) — a stark contrast to 20 years ago, when nearly…

Tage Kene-Okafor

2 hours ago

Endeavor CEO says long-term capital needs to be prioritized in emerging ecosystems

Featured Article

Data breach exposes US spyware maker behind Windows, Mac, Android and Chromebook malware

Exclusive: The Minnesota-based spyware maker Spytech snooped on thousands of devices before it was hacked earlier this year.

Zack Whittaker

2 hours ago

Data breach exposes US spyware maker behind Windows, Mac, Android and Chromebook malware

Commerce

Singaporean e-commerce firm Qoo10’s Korean units face probe due to payment delays to merchants

Kate Park

2 hours ago

The e-commerce market in South Korea ranks as one of the largest in the world, but it’s also proving to be a precarious one. On Thursday, South Korea’s Fair Trade…

Singaporean e-commerce firm Qoo10’s Korean units face probe due to payment delays to merchants

Startups

Kodiak Robotics is taking self-driving trucks off road to reach profitability faster

Rebecca Bellan

3 hours ago

Don Burnette, CEO and co-founder of self-driving truck startup Kodiak Robotics, had an “a-ha” moment when the company started working with the U.S. Department of Defense. Kodiak’s mission has always…

Kodiak Robotics is taking self-driving trucks off road to reach profitability faster

Space

Lodestar’s robotic arm will be an orbital ‘first responder’ for satellites in need

Aria Alamalhodaei

3 hours ago

Satellites are among our most critical infrastructure, providing everything from GPS to disaster coordination, yet their inherent inaccessibility leaves them vulnerable to relatively simple technical issues or attacks. London-based Lodestar…

Lodestar’s robotic arm will be an orbital ‘first responder’ for satellites in need

Intron Health gets backing for its speech recognition tool that recognizes African accents

Annie Njanja

4 hours ago

Voice recognition is getting integrated in nearly all facets of modern living, but there remains a big gap: speakers of minority languages, and those with thick accents or speech disorders…

Intron Health gets backing for its speech recognition tool that recognizes African accents

Climate

GM-backed Addionics aims to make lithium-ion batteries cheaper with wavy foil

Tim De Chant

4 hours ago

The startup has developed a way to create copper and aluminum foils that are laced with tiny holes and riddled with undulating peaks and valleys.

GM-backed Addionics aims to make lithium-ion batteries cheaper with wavy foil

Fintech

Revolut receives long-awaited UK banking license

Romain Dillet

5 hours ago

This is a significant milestone for the London-based fintech company, particularly since it has been trying to secure this license since 2021.

Revolut receives long-awaited UK banking license

Social

Oversight Board wants Meta to refine its policies around AI-generated explicit images

Ivan Mehta

5 hours ago

The Board wants Meta to change the terminology it uses for labeling explicit, AI-generated images from “derogatory” to “non-consensual.”

Oversight Board wants Meta to refine its policies around AI-generated explicit images

Apps

Google Maps adds a slew of features to entice Indian drivers, commuters and travelers

Jagmeet Singh

Ivan Mehta

8 hours ago

Google Maps is improving navigation through flyovers and narrow roads in India through new feature updates.

Google Maps adds a slew of features to entice Indian drivers, commuters and travelers

Fundraising

bunch raises $15.5M for its platform that simplifies investment management for VCs

Mike Butcher

8 hours ago

Public market investors have a large variety of infrastructure and software that helps them keep track of, analyze and manage their investments, but that’s not the case for investors in…

bunch raises $15.5M for its platform that simplifies investment management for VCs

Transportation

Jio partners with Taiwan’s MediaTek to tap into two-wheeler EV market

Jagmeet Singh

8 hours ago

India’s Jio has partnered with Taiwanese semiconductor giant MediaTek to launch its 4G smart dashboards for electric two-wheelers.

Jio partners with Taiwan’s MediaTek to tap into two-wheeler EV market

Security

Hacker claims theft of Piramal Group’s employee data

Jagmeet Singh

10 hours ago

A hacker claims to be selling data relating to thousands of current and former employees of India’s Piramal Group.

Hacker claims theft of Piramal Group’s employee data

Fintech

CRED launches personal finance manager for India’s affluent

Manish Singh

15 hours ago

CRED, an Indian fintech startup, has rolled out a new feature that will help its customers manage and gain deeper insights into their cash flow, as the startup seeks to…

CRED launches personal finance manager for India’s affluent

A new Chinese video-generating model appears to be censoring politically sensitive topics

Kyle Wiggers

16 hours ago

A powerful new video-generating AI model became widely available today — but there’s a catch: The model appears to be censoring topics deemed too politically sensitive by the government in…

A new Chinese video-generating model appears to be censoring politically sensitive topics

Space

Star Catcher wants to build a space power grid to supercharge orbital industry

Aria Alamalhodaei

16 hours ago

Our growth as a civilization is tightly coupled to our ability to sufficiently generate ever-increasing amounts of electricity. Could the same be true in space? Star Catcher Industries, a startup…

Star Catcher wants to build a space power grid to supercharge orbital industry

Mistral’s Large 2 is its answer to Meta and OpenAI’s latest models

Maxwell Zeff

19 hours ago

For frontier AI models, when it rains, it pours. Mistral released a fresh new flagship model on Wednesday, Large 2, which it claims to be on par with the latest…

Mistral’s Large 2 is its answer to Meta and OpenAI’s latest models

Robotics

Researchers are training home robots in simulations based on iPhone scans

Brian Heater

19 hours ago

Researchers at MIT CSAIL this week are showcasing a new method for training home robots in simulation.

Researchers are training home robots in simulations based on iPhone scans

Apps

Apple Maps launches on the web to challenge Google Maps

Aisha Malik

20 hours ago

Apple announced on Wednesday that Apple Maps is now available on the web via a public beta, which means you can now access the service directly from your browser. The…

Apple Maps launches on the web to challenge Google Maps

Apps

Alternative app store AltStore PAL adds third-party iOS apps in wake of EU Apple ruling

Aisha Malik

20 hours ago

AltStore, an alternative app store, has launched its first batch of third-party iOS apps in the European Union. The rollout comes a few months after the company launched an updated…

Apps

Bing previews its answer to Google’s AI Overviews

Kyle Wiggers

20 hours ago

Microsoft this afternoon previewed its answer to Google’s AI-powered search experiences: Bing generative search. Available for only a “small percentage” of users at the moment, Bing generative search, underpinned by…

Bing previews its answer to Google’s AI Overviews

This Week in AI: How Kamala Harris might regulate AI

Kyle Wiggers

22 hours ago

Hiya, folks, welcome to TechCrunch’s regular AI newsletter. Last Sunday, President Joe Biden announced that he no longer plans to seek reelection, instead offering his “full endorsement” of VP Kamala…

This Week in AI: How Kamala Harris might regulate AI

VCs are still pouring billions into generative AI startups

Kyle Wiggers

22 hours ago

But the fate of many generative AI businesses — even the best-funded ones — looks murky.

VCs are still pouring billions into generative AI startups

Colin Kaepernick lost control of his story. Now he wants to help creators own theirs

Maxwell Zeff

23 hours ago

Thousands of stories have been written about former NFL quarterback and civil rights activist Colin Kaepernick. If anyone knows a thing or two about losing control of your own narrative,…

Colin Kaepernick lost control of his story. Now he wants to help creators own theirs

Security

CrowdStrike offers a $10 apology gift card to say sorry for outage

Lorenzo Franceschi-Bicchierai

23 hours ago

Several people who received the CrowdStrike offer found that the gift card didn’t work, while others got an error saying the voucher had been canceled.

CrowdStrike offers a $10 apology gift card to say sorry for outage

Social

TikTok Lite exposes users to harmful content, say Mozilla researchers

Annie Njanja

24 hours ago

TikTok Lite, a low-bandwidth version of the video platform popular across Africa, Asia and Latin America, is exposing users to harmful content because of its lack of safety features compared…

Anthropic’s Claude adds a prompt playground to quickly improve your AI apps

More TechCrunch

Get the industry’s biggest tech news

Tags